Gene Rleg2_4457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4457 
Symbol 
ID6977551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp89730 
End bp91136 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content63% 
IMG OID643393635 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002278453 
Protein GI209546535 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.399001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0106481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCAGT CTACCCTGCT CATCGGTGGC GAAACTGTCG CCACCGCCCA GCATGCGCCG 
GTCACAAATC CGTCGAACGG CGAAATTGTC GGCTATATGC CGCTTGCCGG GCAAGACGAT
CTCGACCGTG CCGTTGCCGC CGCAGCCGCG GCTTTCAAAA GCTGGTCGCA GACCTCGAAT
GAACAGCGCG CCGGAGCCTG CCGCGCCATA GCGGAAAAGA TCAGCGAGCA CGCCGAGGAA
TTGGCGCAGC TCCTGACCCG GGAGCAGGGT AAGCCGCTCA ACGGCCTCGG GTCGCGTTTT
GAAATCGGCG GCGCACTTGC CTGGACGCGC CATACGGCGG AACTCGATCT GCCGGTCGAG
ATCCTGCAGG ATGACAATGA GGGCCGCGTC GAGCTGCACC GCAAGCCGAT TGGCGTTGTC
GGTTCGATCA CACCCTGGAA CTGGCCCGTC ATGATCGCCT GCTGGCACAT CGTGCCGGCG
GTGCGGGCCG GCAATACCGT GGTCATCAAG CCATCGCCCC TGACGCCGCT CTCGACCATC
CGCCTGGTCG AGATCATCAA CCAGGTGCTG CCGGCCGGCG TCGTCAATGT GATCACTGGG
GAAAACAGCA TTGGAGCCGC GCTTTCGGCC CATCCCGGTA TTGCCAAGAT GACCTTCACC
GGCTCGACCG AGACGGGCAA GAAAATCATG GCCTCGGCCG TCGCCACCTT GAAGCGGCTG
ACGCTGGAGC TCGGCGGCAA TGATGCGGGC ATCGTGCTGC CCGACGTCGA TCCGAAAGGC
GTCGCCGAGG GTCTGTTCTG GGGCGCCTTC ATCAATAACG GCCAGACCTG CGCGGCGCTG
AAACGCCTCT ATGTGCATGA CAGCATCTAT GAGGAGGTCT GCGCGGCACT TGCCGATTAC
GCCGGAAAGA TCACCGTCGG CGACGGCCTG GATGAGGCCA GCATCCTCGG GCCGATACAG
AACGAAATAC AGTTCAACAA AGTGCGCGAT CTCGTCGACG ATGCGCGCAC TCAGGGCGGC
CGCATCCTGA CCGGCGGCGC GCCGCTGGAC CGGCCCGGCT ATTTCTATCC GATCACCCTC
GTTGCCGATG TCGATCATGG TGTGCGCCTG GTCGATGAGG AGCAGTTCGG CCCGGCCCTG
CCGATCATTC GCTACAGCGA TCTCGACGAG GTGATCGCCC GCGCCAACCA GAATCCGGCC
GGTCTCGGCG GCTCGGTCTG GTCTGCCGAC GTCGAGAAGG CCAAGCGTTA TGCGAGGCAG
CTCGAATGCG GCTCGGTCTG GATCAACAAA CACGGCGCGA TCCAGCCCAA CGCGCCCTTC
GGCGGCGTCA AACAATCCGG CATCGGCGTC GAATTCGGCG CCGAAGGCCT GAAGGAATTC
ACCACGATCC AGACGGTGTT GAGCTGA
 
Protein sequence
MKQSTLLIGG ETVATAQHAP VTNPSNGEIV GYMPLAGQDD LDRAVAAAAA AFKSWSQTSN 
EQRAGACRAI AEKISEHAEE LAQLLTREQG KPLNGLGSRF EIGGALAWTR HTAELDLPVE
ILQDDNEGRV ELHRKPIGVV GSITPWNWPV MIACWHIVPA VRAGNTVVIK PSPLTPLSTI
RLVEIINQVL PAGVVNVITG ENSIGAALSA HPGIAKMTFT GSTETGKKIM ASAVATLKRL
TLELGGNDAG IVLPDVDPKG VAEGLFWGAF INNGQTCAAL KRLYVHDSIY EEVCAALADY
AGKITVGDGL DEASILGPIQ NEIQFNKVRD LVDDARTQGG RILTGGAPLD RPGYFYPITL
VADVDHGVRL VDEEQFGPAL PIIRYSDLDE VIARANQNPA GLGGSVWSAD VEKAKRYARQ
LECGSVWINK HGAIQPNAPF GGVKQSGIGV EFGAEGLKEF TTIQTVLS