Gene Rleg_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3068 
Symbol 
ID8013979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3064462 
End bp3065442 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID644825636 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002976864 
Protein GI241205768 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0181294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.830882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCG TCCAATTCAA TCGCTTCGGT CCGCCAGATG TCCTCGAGCT CGTGGAACTG 
CCGGTCCCCG AGCCGGGACC GGACGAGGTG CTTGTCCGCG TCCACGCGGC GGGCGTCAAC
TTCTTCGAGG TGTTGATGCG GGCCGACCGC TATGCCGTGA CGCCCGACCT GCCGATGTTT
CCCGGTGTCG AGGTCGCGGG TACGATCGAG CGAGCAGGGC CTGGTGCCGA CCGCTCGCTC
ATTGGTACGC GCGTTGCCGT TCCTCTCTTT GCGATGGGGC GCGGTTCGGG CGGTTATGCC
GAGTTCGTTG CGGTCGATGG CGGGGCGGTG GTGCAACTGC CCGGTGCGGT TTCTTTCGAG
GCGGCCGCCG CGCTGATGGT GCAGGGGTTG ACGGCGCTGC ACCTCCTACG CCGCAGTCCA
GTGAAAGGCA AAAACGTTCT CGTCAATGCG GCAGCCGGCG GTGTCGGTTC GCTCCTCCTG
CAGCTGGCGA GGCGCGACGG GACGAAGATG GTGATCGCGG CGGCGAGCAG TGACGAGAAG
AGGGCGCTTT CCCTGTCGCT TGGCGCCGAT CATGCGGTCG ATTATACGGC GCCCGGCTGG
CAGGAGGATG TCAAGAGGGT GACCGGAGGG CCCGGCGCGG ATGTCATCTA TGAAACCGTC
GGCGGCGCGT TTTCAAGGGC GGCGCTCGAT GCGCTGGCGC CTTGCGGGGA ACTGGTGCTG
GCGGCGATGG GGCGGTTCGG GCTCGGGGCC GCAGATGTCG AGGGCATGCT TGATCACAAC
CAGTCGATCA AGGGATTTTC GTTGTTGGCG CTGCTGACGC CTCAGGGGGT GCGTGAGGAT
CTTGCAGCGC TCTTCGAGCT TGCCGCGACG GGCGCCCTGA CGGTTATCGA CGGCGGTCGT
TTCCCACTGC ATCAAGCGGC GGAAGCGCAT CGCGCCATCG AAGATCGGCG GGCGGTCGGC
AAGGTGGTGC TGGTGCCTTA G
 
Protein sequence
MKAVQFNRFG PPDVLELVEL PVPEPGPDEV LVRVHAAGVN FFEVLMRADR YAVTPDLPMF 
PGVEVAGTIE RAGPGADRSL IGTRVAVPLF AMGRGSGGYA EFVAVDGGAV VQLPGAVSFE
AAAALMVQGL TALHLLRRSP VKGKNVLVNA AAGGVGSLLL QLARRDGTKM VIAAASSDEK
RALSLSLGAD HAVDYTAPGW QEDVKRVTGG PGADVIYETV GGAFSRAALD ALAPCGELVL
AAMGRFGLGA ADVEGMLDHN QSIKGFSLLA LLTPQGVRED LAALFELAAT GALTVIDGGR
FPLHQAAEAH RAIEDRRAVG KVVLVP