Gene Rleg_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1899 
Symbol 
ID8012948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1884182 
End bp1885162 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content60% 
IMG OID644824488 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002975720 
Protein GI241204624 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.317199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.179225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAA CGCAGGCTGT TTCATTTTCG AGAACGGGTG GGCCGGAGGT TTTCGACTAT 
GTCGAGATCG ATCTTCCCTC ACCCTCGACG GGCGAAGTGC AGATCAGGCA GGCGGCGGTT
GGGCTTAATT TCATCGACGT CTATTTCCGC AACGGCACCT ACAAGGCGCC GCATCTGCCC
TTCGTCACCG GCAAGGAGGG CGCCGGCACC GTGACATCGG TCGGTCCCGG CGTCGAGGAT
TTCAAGGTCG GCGACCGTGT CGCCTATGCC AGTGCCGATG GTGCCTATAG CGCCGAGCGC
AATGTCGAGA CGCGCCATCT GGTGCATGTT CCCGAGGGAA TCGAGCTCGA AACCGCAGCG
GCGATGATGC TGAAGGGCAT GACCGCCGAA TATCTCTTGA ACCGCACCTT CAAGGTCGGC
CCGCAGACCG TCCTGCTGTT CCACGCCGCT GCCGGCGGCG TCGGCCTGAT CGCCGGCCAA
TGGGCTAAGG CGCTGGGCGC CACCGTCATC GGCACGGCGG GCTCTGAAGA CAAGATCGAG
CTGGCGCTCG CCCATGGCTA CGATCATGTG ATCAACTACA AGAGCGACAG CTTCGTCGAC
CGTGTCCGCG ACATCACCGG CGGCAAGGGC GTGGATGTCG TCTACGATTC GATCGGCCGC
GATACTTTTC CACAGTCGCT TGACTGCCTG AAGCCGCGGG GCCTTTTTGC CTCCTTCGGC
CAATCCTCCG GACCGATCGA GAATTTCACC CTTGCGGCTC TGGCGCAAAG GGGCTCGCTC
TTTGCGACGC GGCCGACGCT CTTCACCTAT ATCGCCACGC GTCAGGAACT GATCGACAGT
GCGAAAGCGC TATTTGATAT TGTGCAAAGC AACAAAGTGC GTATCAATAT CAATCAAACC
TATCCGCTGC GTGAGGTTGG GCGGGCTCAT GCGGATCTGG AGACAAGAAA AACAACAGGA
ACGACGCTGC TGATTCCATG A
 
Protein sequence
MTKTQAVSFS RTGGPEVFDY VEIDLPSPST GEVQIRQAAV GLNFIDVYFR NGTYKAPHLP 
FVTGKEGAGT VTSVGPGVED FKVGDRVAYA SADGAYSAER NVETRHLVHV PEGIELETAA
AMMLKGMTAE YLLNRTFKVG PQTVLLFHAA AGGVGLIAGQ WAKALGATVI GTAGSEDKIE
LALAHGYDHV INYKSDSFVD RVRDITGGKG VDVVYDSIGR DTFPQSLDCL KPRGLFASFG
QSSGPIENFT LAALAQRGSL FATRPTLFTY IATRQELIDS AKALFDIVQS NKVRININQT
YPLREVGRAH ADLETRKTTG TTLLIP