Gene TM1040_0847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0847 
Symbol 
ID4076022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp898671 
End bp900155 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content64% 
IMG OID638006145 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_612842 
Protein GI99080688 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00938434 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTG ACGCCTTAAT CTCTACCTAT TTCGCCAACG GGCGCCTGTC GGATCTGCCG 
CGCGATCACT TTATCGACGG TCGCCTTGTT CCGGCTGAAA CCGGCAACCG GATGGAGAGC
TTTGATCCCG GCAGAGCCAC CGCCTTTGGG GATTTTGCCG CTGGCACCGC TGTGGATGTG
GACCGAGCGG TCAGCAGCGC TGTGGCGGGG TTTGCAATCT GGCGCGACAC CGCGCCGGCG
ACGCGCTGTG CGGTGTTGAT GGAGGCCGCA CGCCTGATGC GCGCCGAAGC TGACTGGCTG
GCTGTAATCG AGAGCCTTGA TAGCGGCAAA ACGCTGGCCG AAGCCTACGG CGACGTGCAG
GGCTCGGCGC GGCTCTTTGA GTATTATGCC GGCGCGGCAG ACAAGCTCGA TGGGCGCAGC
GTGAACCTCG GCAATGACAA CGCGGCCTTC ACCCTGCGCG AACCAGTCGG TGTCACGGCG
CATATCGTGC CGTGGAACTA TCCCACCTCG ACGCTGGTGC GCGGGATCGC GCCGGCGCTT
GCAGCGGGGT GTTCGGCGGT GGTGAAACCG GCCGAAACCA CGCCCTATAC GGCACTGATG
ATCGCAGAGC TGTTGATCCG CGCCGGGCTG CCTGCGGGCG TGGTGAATGT TGTGACCGGC
ACCGGACCGG AAGCAGGCGC GCCCTTGGTG CGCGATCCCC GGGTGCGCCA TGTGACCTTT
ACCGGTTCGG TGCAGACCGG CGTTGGCGTG ATGCAGGCCG TCGCCCCCAA TGTCACCGGA
CTGACGCTGG AACTCGGCGG CAAATCCCCG TTGGTGGCCT TTGCAGATGC AGATGCCGAG
AAAGTGGCCG AGGGCGCGCT CTGGGCGATC TTTTCCAACG CGGGGCAGAT CTGTTCGGCG
GGTTCACGGC TGGTGGTTCA CCGCGACCTC CATGCGCAGG TGCTCGAACG TCTGGTGAAG
AAGGCCACGA CCCTGCGGTT CGGTCATGGG CTGCACAACC CGGATATGGG GGCACTCAAC
TCCGAACGCC ATCAGGCGGC AATCAGCGGC CACGTTGAGC GCGCGCGCGC CCGAGGCGTC
GAAATCCTTT GCGGCGGTCA GGCAACCACA GACCCTGCCA CGGGCAAGGG TTGGTTCTTT
GAACCGACAA TTCTGGACGC GCTCTCTGCG GATGACCCGG CCATCCAGCA GGAAATCTTC
GGTCCCGTGC TCGGCGTGCA GGTTTTCGAC GACGAAGACG AAGCACTTGC TCTGGCCAAT
GGTACAGAGT TTGCGCTGGC GGCGGGGGTC TACACCAAGG ACACTTCAAC CGCCCTGCGC
ATGGCGCGCC GCATTGATGC GGGGCAGGTG ACGGTGAACG ACTATTGGGC AGGGGGCATC
GAACTGCCCT TCGGCGGCAA TCGCAAGTCT GGCTTTGGTC GCGAAAAAGG GCTGGAAGGT
GTTGATGCCT ACACCCGCGC CAAGGCAATC ACCCTCGCGG TATAA
 
Protein sequence
MSVDALISTY FANGRLSDLP RDHFIDGRLV PAETGNRMES FDPGRATAFG DFAAGTAVDV 
DRAVSSAVAG FAIWRDTAPA TRCAVLMEAA RLMRAEADWL AVIESLDSGK TLAEAYGDVQ
GSARLFEYYA GAADKLDGRS VNLGNDNAAF TLREPVGVTA HIVPWNYPTS TLVRGIAPAL
AAGCSAVVKP AETTPYTALM IAELLIRAGL PAGVVNVVTG TGPEAGAPLV RDPRVRHVTF
TGSVQTGVGV MQAVAPNVTG LTLELGGKSP LVAFADADAE KVAEGALWAI FSNAGQICSA
GSRLVVHRDL HAQVLERLVK KATTLRFGHG LHNPDMGALN SERHQAAISG HVERARARGV
EILCGGQATT DPATGKGWFF EPTILDALSA DDPAIQQEIF GPVLGVQVFD DEDEALALAN
GTEFALAAGV YTKDTSTALR MARRIDAGQV TVNDYWAGGI ELPFGGNRKS GFGREKGLEG
VDAYTRAKAI TLAV