Gene Rleg2_4821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4821 
Symbol 
ID6977915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp464251 
End bp465285 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content66% 
IMG OID643393983 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002278801 
Protein GI209546883 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.111851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.477759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACAC GTCTGGCACG GCTTTACCAG AAGGGCGATC TCAGGATCGA GACCGACACC 
GTTCCGGTCC CCGGTCCAGG CGAGGTGCTC CTCAAAATGG CAGCCGCAGG CATCTGCGGC
TCCGACCTGC ATTATTACCA GGACGGCGGC TTCGGGCCGG TCAGGGTGCG TGAGCCGATC
ATCCCGGGCC ACGAGGCCTC GGGAACGGTA AGCCAGCCAG GCGAGGGCGT CGACCTCAAG
GCCGGCGTGC TGGTGGCGAT CAATCCCAGC CAGCCCTGCG GACATTGCGA ATATTGCGCA
AAGGGACTGC CGATCCACTG CCTGGAGATG CGCTTCATGG GCAGCGCAAT GCGCCTGCCG
CATGAGCAGG GGATGTTCCG GGAATGGCTG GTGGTGCCGG CCAAACAGTG CTTCGAAGTG
GGGCCTGCGA CGACGGCCGC GGAGGCCGCC TGCAGCGAGC CCTTGTCCGT CTGCCTGCAC
GCCGCATCAC GTGCCGGAGA GATACCGGGC AAACGCGTCC TCATCACCGG CGCCGGCCCG
ATCGGCGCGC TGATGGTTGC TGTTGCTGCT TATCACGGTG CCACCGAGAT CGTGGTGACC
GATCTTGCGG ACGCGGCGCT GGAGAGGGCG AAGGCGATGG GCGCCAGCCG CGCGATCAAT
GTTTCCCGGG ATGCCGCAGC ACTTGCCGCA TTCGAGGCGG GCAAAGGCTA TTTCGATCTC
GTCTTCGAAT GCTCGGCCGC CGCTCCGGCC ATCCGCAGCG CGATCGCGGC CATCCGGCCG
CGCGGCACGA TCGTCCAGGT CGGCGTCACC GGTGAGATCC CGATCCCGCT CAACGCCATT
GTCGGCAAGG AGCTTCATAT TCACGGCACG CAGCGCTTCC ACGAGGAGTT CGCCACCGCC
GTCGCGCTGA TCTCCAGCCG CAAGATCGAT GTCCGGCCGA TCATCAGCCA CAGCCTGCCG
CTGGAAGAGG CGAACGCAGC CTTCGCGCTT GCCGGCGACC GCACGACCGC CTGTAAGGTG
CAGCTTACGT TTTAG
 
Protein sequence
MQTRLARLYQ KGDLRIETDT VPVPGPGEVL LKMAAAGICG SDLHYYQDGG FGPVRVREPI 
IPGHEASGTV SQPGEGVDLK AGVLVAINPS QPCGHCEYCA KGLPIHCLEM RFMGSAMRLP
HEQGMFREWL VVPAKQCFEV GPATTAAEAA CSEPLSVCLH AASRAGEIPG KRVLITGAGP
IGALMVAVAA YHGATEIVVT DLADAALERA KAMGASRAIN VSRDAAALAA FEAGKGYFDL
VFECSAAAPA IRSAIAAIRP RGTIVQVGVT GEIPIPLNAI VGKELHIHGT QRFHEEFATA
VALISSRKID VRPIISHSLP LEEANAAFAL AGDRTTACKV QLTF