Gene TM1040_3381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3381 
Symbol 
ID4075280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp396121 
End bp397569 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content59% 
IMG OID638004889 
Productaldehyde dehydrogenase 
Protein accessionYP_611615 
Protein GI99078357 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.464458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA TTCAAAAGAA CCTGATCGCT GGCGAATGGC TGACCGGCGA AGGCGAAATC 
GAAAATCGCA ACCCTTCGGA CCTTTCGGAT CTGGTCGGGA TCTTCGCTCA GGCCAGCAGC
GACCAGCTTG AGGCCACCCT AGATCAAGCA AAGGTCGCAC AACGCGAGTG GGCTGCCTAT
GGTCTCGAAC GCAAGCAGGC GGTCTTGATG GCCATCGGCA ATGAGTTGAT CGCCCGTTCT
GAGGAGCTTG GCACGCTGTT GTCGCGCGAA GAGGGCAAAC CTTTCGCTGA GGGCAAGGGC
GAAGTCTACC GTGCCGGTCA GTTCTTCACC TATTACGCCG CCGAATGCCT GCGCCAGATT
GGCGAGAACG CTGACTCGGT GCGTCCTGAC ATCGAAATCG ACGTGCGCCG CGAGGCCGTG
GGCACGGTCG CCATCATCAG CCCTTGGAAT TTTCCGACCG CCACCGCCTC GTGGAAAATT
GCGCCCGCTC TGTGCTACGG CAACGCAGTC GTGTGGAAAC CTGCCAATGT GACGCCTGCC
TCGGCGGTTG CGCTGGCAGA AATCATCAAC CGCCAAGACA TCCCCAAAGG GCTGTTCAGC
CTTGTGATGG GTGCGGGTCG CACCGTAGGT CAGCGCCTGG TCGAGAGCCC GAAAGTCAAT
GCGATTTCCT TTACTGGTTC AGTGCCAGTC GGCAAAGGCA TTGCCGCAGC GGCAATCCAG
AATCTGACTA AAGTACAGAT GGAGATGGGC TCGAAGAATG CGTTGGCCGT GATGGACGAC
GCGGATCTGA ACCTCGCGGT GAGCCTTGCT CTGGGTGGCG CTTTTGGCGG CACGGGTCAG
AAATGCACCG CGTCCTCTCG GCTTGTCGTC CACGCTGCAG TGCATGATGC CTTTGTTGAA
AAGCTGGTCG CCGGCGCACA AGCGATGAAG GTGGGCCACG CGTTGCATGA CGGCACACAG
ATGGGACCAG TGGTGAGCGC ACAGCAGCTC GAGGAAAACC TCGCCTATGT GGATCTGGGC
CGTTCAGAAG GGGCAGAACT GGCCTGTGGA GGAACGCGAT TGGAGATGCC GCACGACGGT
TTTTACATGT CGCCAGGCGT ATTTCTGAAC ACCAGCAACG ATATGCGCAT CAACCGCGAG
GAGATGTTTG CGCCGCTGAC CTCTGTAATC AAGGTCGACA GCTATGATGA AGCGCTGGCA
ACGGTGAACG ACACCAACTT TGGCCTGACC TCAGGCATTG TGACCAAATC GCTCGCCCGC
GCCACGCATT TCCGTCGCAA CGCACAGACT GGCGTTGTCA CCGTGAATCT GCCGACCGCG
GGCACCGATT ACCACGTTCC CTTTGGTGGG CGCGGCGACA GCTCTTACGG CCCACGCGAG
CAAGGCAAGG CCGCGGCAGA ATTCTACACA ACGGTCAAGA CGGCCTACAT CAGCGCAGGC
CCCGTCTGA
 
Protein sequence
MTDIQKNLIA GEWLTGEGEI ENRNPSDLSD LVGIFAQASS DQLEATLDQA KVAQREWAAY 
GLERKQAVLM AIGNELIARS EELGTLLSRE EGKPFAEGKG EVYRAGQFFT YYAAECLRQI
GENADSVRPD IEIDVRREAV GTVAIISPWN FPTATASWKI APALCYGNAV VWKPANVTPA
SAVALAEIIN RQDIPKGLFS LVMGAGRTVG QRLVESPKVN AISFTGSVPV GKGIAAAAIQ
NLTKVQMEMG SKNALAVMDD ADLNLAVSLA LGGAFGGTGQ KCTASSRLVV HAAVHDAFVE
KLVAGAQAMK VGHALHDGTQ MGPVVSAQQL EENLAYVDLG RSEGAELACG GTRLEMPHDG
FYMSPGVFLN TSNDMRINRE EMFAPLTSVI KVDSYDEALA TVNDTNFGLT SGIVTKSLAR
ATHFRRNAQT GVVTVNLPTA GTDYHVPFGG RGDSSYGPRE QGKAAAEFYT TVKTAYISAG
PV