Gene Rleg2_4960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4960 
Symbol 
ID6978054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp604787 
End bp606196 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content61% 
IMG OID643394112 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002278930 
Protein GI209547012 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.613851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTG CCAAACTTCA TATCACGACG AGCTTGGCTC CGATCACCGT TCACAACCCC 
TATGACGGGG CAATCCTGGG AACGGTCGAG GCCACGGATG CGAGCGACGT CAATGCCATT
CTTGGACGTG CCCGGCGCGG CGCGCAGATT TCGCGCAGCC TGCCGCGGCA TCAGCGTGCG
AGCATCCTGG AAAGGGCGGC TAACATCATC GAGAGCCGCC GCGACGCCTT TGCAGAAACC
ATCGTTCGCG AAGCCGGAAA GACAATTGTT CAGGCGCGCA AGGAAGTGCT GCGTTGCGTT
AATACGATAA AGCTCTCCGC AGAAGAAGCA AAGCGCAATG CGGGCGAAGT CGTGCCGTTC
GATGCATATA ACGGCTCTGA ACAACGGCAG GGGTGGTTCA CCCGCGAACC GCTCGGCATC
ATCACGGCGA TCACGCCCTA CAACGATCCT CTGAACCTGG TTGCGCACAA GCTTGGCCCG
GCCATCGCCG GCGGCAATGC GGTCCTGCTC AAGCCGTCGA ACCTGACGCC TTTCTCTGCC
ATCAAGCTGG TGGAAGCACT GCGTGAGGCG GGATTGCCTG AGGAAGTCAT CACGGTCGCG
CACGGTGACC GAGAACTGGT CACCGCGATG ATCGCCGCTC GCGAGGTGCG GATGGTGTCA
TTTACCGGCG GCTTTGCCAC CGCCGAGGCG ATCAGCCGCG CCGCTGGGCT AAAGAAGCTC
GCCATGGAGC TCGGTGGCAA TGCGCCGGTG ATCGTCATGA ACGACTGCGA CTTCGACAAA
GCCGTCGAAG GTTGCGTCTC CGGTGCCTTT TGGGCCGCAG GCCAAAACTG CATCGGTGCG
CAGCGCATTC TTATCCAGGG GAAGCTTTAC GATCGTTTCC GCGATGCATT CGTCTCAGCG
ACACAGAGGC TCAAGGCCGG CGACCCTCTG CAGGAAGATA CCGACGTCGG CCCGATGATC
TCCACCCAAG TCGCCGAACG CACCGAATCC GTCGTCAGCG ACGTCATCAA AGCAGGCGCA
AAGCTGCTCT GCGGCAATAG TCGCGAAGGA TCCCTCTATC ATCCGACGGT GCTCGAAGGC
ACGCCGGTGA CCTGCAAGCT ATGGCATGAG GAAGTGTTCG CACCCGTGGT CATGCTGGCA
CCGTTCCACA CGCTCGATCA GGCGATCGAG ATGGCCAACG ATCCGGATTA CAGCCTCCAT
GCCGGCATCT ACACCAGCGA CCTCAACGTT GCGCTTGACG CAGCCAACCG CATCGAGGCT
GGCGGCGTGA TGATCAATGA CTCCTCTGAC TACCGCTTCG ACGCCATGCC CTTCGGTGGT
TTCAAGTACG GCAGCATGGG CCGCGAGGGC GTCCGCTTCG CTTACGAAGA CATGACCCAG
CCGAAGGTCG TTTGCATCAA TCGGGGATAA
 
Protein sequence
MTAAKLHITT SLAPITVHNP YDGAILGTVE ATDASDVNAI LGRARRGAQI SRSLPRHQRA 
SILERAANII ESRRDAFAET IVREAGKTIV QARKEVLRCV NTIKLSAEEA KRNAGEVVPF
DAYNGSEQRQ GWFTREPLGI ITAITPYNDP LNLVAHKLGP AIAGGNAVLL KPSNLTPFSA
IKLVEALREA GLPEEVITVA HGDRELVTAM IAAREVRMVS FTGGFATAEA ISRAAGLKKL
AMELGGNAPV IVMNDCDFDK AVEGCVSGAF WAAGQNCIGA QRILIQGKLY DRFRDAFVSA
TQRLKAGDPL QEDTDVGPMI STQVAERTES VVSDVIKAGA KLLCGNSREG SLYHPTVLEG
TPVTCKLWHE EVFAPVVMLA PFHTLDQAIE MANDPDYSLH AGIYTSDLNV ALDAANRIEA
GGVMINDSSD YRFDAMPFGG FKYGSMGREG VRFAYEDMTQ PKVVCINRG