Gene TM1040_2655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2655 
Symbol 
ID4077958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2789859 
End bp2791367 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content65% 
IMG OID638007979 
Productaldehyde dehydrogenase 
Protein accessionYP_614649 
Protein GI99082495 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGAA CAGCAGACGA TATCCTGAAG AACCTGGGCC TGACGGCGGC AGAGCTGAGC 
GGTGGCAGCC GCGCGGTGCG CTCGCCCATT GATGGCAGCA CGCTGGCCGA GGTCCACGAC
ACACCCGCAG GCGAGATGCC CGCGATTCTG GACCGCGCAC AGTCCGCGTT CAAGGCGTGG
CGCGTGGTGC CCGCCCCGCG GCGCGGTGAG CTCATTCGCC TGCTCGGTGA AGAGCTGCGC
GCGGCCAAGG AGGACCTTGG CGCGCTGGTC AGCTGGGAAG CGGGCAAGAT CACCTCTGAA
GGCCTTGGCG AAGTGCAGGA GATGATCGAC ATCTGCGACT TTGCGGTGGG TCTGTCGCGC
CAGCTTTATG GTCTGACGAT TGCCTCGGAA CGTCCCGGCC ACAGCATGCG CGAGACATGG
CATCCGGCGG GGCCTGTGGG CGTGATCTCG GCGTTCAACT TTCCGGTTGC GGTCTGGTCG
TGGAACGCGG CGCTGGCGAT TGTCTGCGGC GATCCGGTGA TCTGGAAACC GTCGGAGAAA
ACACCGCTGA CGGCGCTGGC CTGCACCAAG ATCTTTGAAC GCGCGGTGAA ACGCTTTGGC
GAGGATGCGC CCGAGGGCCT GCTGCAGATC CTGATCGGCG ATGCGGAGCT GGGCAAGGAG
CTGGTCGCCA GCCCATCCGT GCCGGTGATT TCGGCCACGG GGTCGACCCG CATGGGCCGC
GCGGTGGCAC CTGTGGTGGC AGAGCGGTTT GGCAAATGCA TTCTCGAACT GGGCGGCAAT
AACGCGATGA TCGTGGCCCC GTCGGCGGAT CTGGAAATGG CAGTGCGGGC GATCGTGTTC
TCTGCCGTGG GCACCGCCGG TCAGCGCTGC ACCTCGCTGC GCCGCCTGAT CGTGCACAAC
TCCATCCGCG CGGATCTGGT GAAGCGCCTG AAGGCCGCCT ATGCCGGTCT GCCGATTGGT
GATCCGCAGG CCGCGGGCAC GCTGGTTGGC CCGCTCGTGG ACGAGGCCGC AGGGGATGCG
ATGATCTCTG CGCTGAAGGC GGCCGAGAGC GAGGGCGGCA CGGTGCATGG TGGCGCGCGT
GTCACCGAGG GCGTGCCCGC AGGCGGTGTC TATATGGCGC CCGCCATCGT CGAGATGCCC
GGTCAGAGCG CGTCGGTCAA AGAAGAGACC TTTGCGCCGA TCCTATACGT CATGGGCTAT
GACGATTTTG AGGACGCGGT GGAGATGCAG AACGACGTGC CGCAAGGGCT GAGCTCCTGC
GTGTTCACGC TCAATATGCG CGAGGCGGAG AGTTTCCTCA CCGCAGCCGG GTCCGATTGT
GGCATTGCCA ATGTGAACAT TGGGCCGTCG GGCGCGGAAA TCGGCGGTGC CTTTGGCGGC
GAGAAGGAAA CCGGTGGTGG GCGCGAGAGC GGCTCTGACG CGTGGAAATC CTACATGCGC
CGCCAGACCA ACACCGTGAA TTATTCGGCG GAATTGCCGC TGGCGCAGGG TGTGAAGTTC
GACATCTAA
 
Protein sequence
MARTADDILK NLGLTAAELS GGSRAVRSPI DGSTLAEVHD TPAGEMPAIL DRAQSAFKAW 
RVVPAPRRGE LIRLLGEELR AAKEDLGALV SWEAGKITSE GLGEVQEMID ICDFAVGLSR
QLYGLTIASE RPGHSMRETW HPAGPVGVIS AFNFPVAVWS WNAALAIVCG DPVIWKPSEK
TPLTALACTK IFERAVKRFG EDAPEGLLQI LIGDAELGKE LVASPSVPVI SATGSTRMGR
AVAPVVAERF GKCILELGGN NAMIVAPSAD LEMAVRAIVF SAVGTAGQRC TSLRRLIVHN
SIRADLVKRL KAAYAGLPIG DPQAAGTLVG PLVDEAAGDA MISALKAAES EGGTVHGGAR
VTEGVPAGGV YMAPAIVEMP GQSASVKEET FAPILYVMGY DDFEDAVEMQ NDVPQGLSSC
VFTLNMREAE SFLTAAGSDC GIANVNIGPS GAEIGGAFGG EKETGGGRES GSDAWKSYMR
RQTNTVNYSA ELPLAQGVKF DI