Gene Rleg_6068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6068 
Symbol 
ID8016330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp103514 
End bp104641 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content59% 
IMG OID644827376 
Productprotein of unknown function UPF0052 and CofD 
Protein accessionYP_002978576 
Protein GI241258692 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGA TTGTATTATT CTCGGGCGGC AGCGCCTGTC GATCGATCAA TGTGGCGCTG 
GGCCAACGTG GAGCCGATGT CACCCGCGTC GTGCCTGCTT GGGATAGCGG CGGCAGCTCG
AAAGTTATTC GCGAGCGGCT CTCCATCCTC TCAGTGGGAG ACATCCGCCA GGCTCTGATG
ACGATGGCAC ATGGAGAAGG CTGCGCCGGT GACGTCGTCA AGATCTGCAA CGCCCGGGTA
TCGGCCAATC TTGGCTTTGA CGATGCCCGC AAGGAATTCC TCTTCTACGC CGAAGGCCGT
CATCCGTTGC TCGAGAGAAT GGAGCCGGGC CTGCGCGGTG CAATCCTGAA TTATCTGAAC
ACATTCGCCA CTTCTGTTGG TCGAGATTTC GACTTCAGGC ACGGCAGCGT CGGGAACTTC
ATTCTGACGG GCGCCTGCGT GGCGCATAAT GGAGACGTCA ACACGGCGAT CTTCGTGTTC
AGAAAGCTGT GCGGCATCGT CGGAAACGTA TGGCCTTCGT CCTGTGACAA CGACTTGGTT
CTCTCCGCAA CTCTGAGGGA CGGTCGGAGG CTTGCGCCGC AGGACGTCAT CACATCGATG
GGGGCAGGGG ATGCAAAGGT TGGGATCGCA GAGGTCGAGC TGGCTGGCGC CGATCAGGCC
CTGCCGGTTG CCAACTCGGC TGTGCTGGAC GCCGTTGCCC GGGCGGACCT GATTGCCATC
GGTCCGGGCA GTCTCTACAC GAGTATCCTT CCTCATCTTC TGGTGACGGG CCTCGTTTCA
GCTATCGAGA AGGCCAACTG CCCGAAAGTC TTCATCGGCA ACATCCTGCA ATGCCGGGAG
ACCATGGGAC TAACGCTCGA AGACGTCCTG GCGGCCGCGG ATCTGCATGT GCGCAAGGGC
GGCGGAACCT CGAATCTGTT CACCTTCGTT CTCGCAAACA GGATGCTGTT TCCCTTCGAG
AAGACGGTCG GAACGTTTCC CTATCTCAGA GAGAAACCGC AGGAAAAGAA CGGGCGTCAC
ATCATCAAGG GCGAATTCGA AGATGCCTGG AGCCGCGGTC AGCATGACGG TGATGCAACC
GCTGCAGCCT TGCTGAAAAT CGCATCCGGA GACGCTTCAG ATGCATGA
 
Protein sequence
MKRIVLFSGG SACRSINVAL GQRGADVTRV VPAWDSGGSS KVIRERLSIL SVGDIRQALM 
TMAHGEGCAG DVVKICNARV SANLGFDDAR KEFLFYAEGR HPLLERMEPG LRGAILNYLN
TFATSVGRDF DFRHGSVGNF ILTGACVAHN GDVNTAIFVF RKLCGIVGNV WPSSCDNDLV
LSATLRDGRR LAPQDVITSM GAGDAKVGIA EVELAGADQA LPVANSAVLD AVARADLIAI
GPGSLYTSIL PHLLVTGLVS AIEKANCPKV FIGNILQCRE TMGLTLEDVL AAADLHVRKG
GGTSNLFTFV LANRMLFPFE KTVGTFPYLR EKPQEKNGRH IIKGEFEDAW SRGQHDGDAT
AAALLKIASG DASDA