Gene TM1040_1883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1883 
Symbol 
ID4077380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1982575 
End bp1984032 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content61% 
IMG OID638007199 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_613878 
Protein GI99081724 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.86306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACATC CGGCCCAGCC CAAAGCCAGC CATTTCATCA ATGGCGAATA TGTCGAAGAC 
ACCTCCGGCA CCCCGATCCC GGTGATCTAT GCTGCCACTG GCGAGACCAT CGCCACCGTT
TACGCGGCCA CACCTGCAAT CGTCGAGCAG GCGCTGACCG CCGCTGAGGC TGCGCAGAAA
GAATGGGCCA AGATGTCCGG CACCGAGCGC GGCCGGGTGC TGCGCCGCGC CGCCGACATC
ATGCGTGAAC GCAATCACGA ACTGTCGGTG CTGGAAACCT ATGACACCGG CAAGCCCCTG
CAGGAAACCC TGGTCGCGGA TGCCACCTCT GGTGCGGATG CACTTGAATA TTTCGGGGGC
CTCGCGGGCA GCCTCACCGG CGAGCACATC CAGCTTGGCG AAGACTGGGT CTATACCGTG
CGCGAGCCGC TTGGCCTCTG TGTGGGGATC GGCGCGTGGA ACTACCCCAC CCAGATCGCC
TGCTGGAAGG GCGCACCGGC ATTGGCCTGT GGCAACGCCA TGGTGTTCAA ACCTTCCGAG
ACCACGCCGC TCTGCGCGCT CAAGGTGGCC GAGATCCTGA TGGAGGCCGG CGCGCCAGCC
GGGCTTTACA ACGTCATTCA GGGCATGGGC GAAGTGGGCG GCAAACTGGT CACGGACCCG
CGCGTCGACA AGGTCTCGCT CACCGGCTCC GTTCCGACCG GCAAAAAGGT CTATGCGGCT
GCGGCTGAGG GCATGAAACA TGTAACCATG GAGCTGGGTG GCAAATCCCC GATGATCGTC
TTTGAAGACG CCGACATCGA GAGCGCCATC TCCGGTGCCA TCAACGGCAA CTTCTATTCC
TCGGGTCAGG TCTGCTCGAA CGGCACCCGC GTCTTTGTTC ACAAGGACAT CAAGGAGAAG
TTCCTCGCCC GTCTGGCCGA GCGCACTGCG ACAGCCAAAC TGGGCGATCC GATGGATGAG
AGCGTGAACT TTGGCCCGAT GGTCAGCGAG CGCCAGATGG AGATCGTGCT CGGTTATATC
GAGAAGGGCA AATCCGAGGG CGCGCGCCTG ATTTGTGGTG GCAACCGCGC TGATATGGAT
GGCTTTTTTG TGGAGCCCAC GGTCTTTGCC GATGTCAAAG ACGATATGGT GATCGCCCGC
GAAGAGATCT TTGGCCCGGT CATGTGCGTT TTGGATTTCG AGAGCGAAGA CGAGGTGATC
GCCCGCGCCA ATGATACCGA ATTTGGTCTC TCGGCAGGGG TCTTCACCAA GGATTTCACC
CGCGCGCACC GGGTGATCGG TCAGCTTGAG GCCGGCAGCT GCTTTATCAA TTCCTACAAT
GACGCACCCG TCGAGGCGCC CTTTGGTGGC GTCAAAGCCT CTGGCGTGGG GCGTGAAAAT
TCCAAGGAAG CGATCAAGCA CTTCAGCCAG GTGAAGTCGG TCTACGTGCG CATGGGCGAC
TGCGAAGCAG CCTTCTAA
 
Protein sequence
MTHPAQPKAS HFINGEYVED TSGTPIPVIY AATGETIATV YAATPAIVEQ ALTAAEAAQK 
EWAKMSGTER GRVLRRAADI MRERNHELSV LETYDTGKPL QETLVADATS GADALEYFGG
LAGSLTGEHI QLGEDWVYTV REPLGLCVGI GAWNYPTQIA CWKGAPALAC GNAMVFKPSE
TTPLCALKVA EILMEAGAPA GLYNVIQGMG EVGGKLVTDP RVDKVSLTGS VPTGKKVYAA
AAEGMKHVTM ELGGKSPMIV FEDADIESAI SGAINGNFYS SGQVCSNGTR VFVHKDIKEK
FLARLAERTA TAKLGDPMDE SVNFGPMVSE RQMEIVLGYI EKGKSEGARL ICGGNRADMD
GFFVEPTVFA DVKDDMVIAR EEIFGPVMCV LDFESEDEVI ARANDTEFGL SAGVFTKDFT
RAHRVIGQLE AGSCFINSYN DAPVEAPFGG VKASGVGREN SKEAIKHFSQ VKSVYVRMGD
CEAAF