Gene TM1040_0096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0096 
Symbol 
ID4078762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp100018 
End bp102360 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content63% 
IMG OID638005383 
Productaldehyde dehydrogenase 
Protein accessionYP_612091 
Protein GI99079937 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.493554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCA AAGAGATCTT CGAGACCATG GACTATGGCC CCGCCCCGGA AAATGCCGCA 
GAAGCCCTTG CTTGGTTGGT CGACCAGGGC AGCCAGTTTG GTCACTTCAT CAACGGTGAT
TTCACCGCCC CCGGCACAGT GTTCGAGAGC AAGAACCCCG CAACCGGCGA GGTTCTGGCA
GAGCTGACAC AAGCCACGCA AAAGGACATT GATGCGGCTG TCAAAGCCGC CCGCGCCGCG
CAAGAAGGCT GGGCCGCGAT GGGCGGTTCC GGTCGCGCCA CATACCTCTA TGCCATCGCC
CGCCTCCTGC AGAAACACAG CCGCCTCTTT GCGGTGCTAG AGACGCTCGA CAACGGCAAA
CCGATCCGCG AGAGCCGCGA TATCGACGTG CCGCTGGTCC AGCGCCATTT CTACCACCAC
GCGGGCCACG CGCAGCTGAT GGATGACGAG ATGAGCGACC GCGCCGCACT TGGCGTTTGC
GGTCAGATCA TCCCGTGGAA TTTCCCGCTC TTGATGCTCG CGTGGAAGGT CGCGCCTGCA
CTCGCCATGG GTAACACGGT GGTGCTGAAA CCCGCCGAAT ATACCTCGCT CACCGCGCTG
CTCTTTGCCG ACATCTGCCG TCAGGCGGGC CTGCCCAAAG GGGTCGTGAA TATCGTAACC
GGCGACGGGG CCGTGGGCGA GATGATCGTC AATGCAGAGG TCGACAAGAT CGCCTTTACC
GGCTCCACTT CGGTGGGGCG CAAGATCCGC GAAGCCACTG CAGGGTCCGG CAAGGCGCTG
ACGCTCGAGC TTGGCGGCAA AAGCCCCTAC ATCGTCTTTG ACGACGCTGA TCTTGATAGC
GCCATCGAAG GCCTCGTGGA TGCGATCTGG TTCAATCAGG GGCAGGTCTG CTGCGCGGGC
TCCCGGCTCT TGGTGCAAGA GGGCGTCTCT GAGCGGTTCC ACCAGAAACT GCGCGCACGG
ATGAAAACCC TGCGTCTGGG CGATCCGCTC GACAAATGCA TCGACATCGG CGCTGTGGTC
GACCCCGTTC AACACGCCGA GATCAGCCGT CTGGTGGCCT CGGCCACAAA CTGCACCGTG
CATCAATCGG CAGTCAACAT GCCCGCAAAG GGCTGTTTCT TCCCGCCGAC CCTGATCGAG
GGCCTCTCGC CCTCTGATCC CTTGATGCAG GAAGAGATCT TTGGCCCGGT TCTGGTCTCT
ACCACCTTCC GCACCCCAGC AGAGGCTGTG GAGCTCGCAA ACAACACCCG CTACGGGCTT
GCAGCGACGC TCTGGACAGA GAATGTGAAC CTCGCGCTGG ATGTCGCACC CAAGCTCGTT
GCCGGCGTGG TCTGGGTCAA TGGAACCAAT ATGTTTGATG CCGCTGCCCC CTTTGGCGGC
GTGCGCGAGA GCGGCTTTGG CCGCGAGGGC GGAGTTGAGG GCCTGATGGC CTATACCAAG
CCCAAGGCAC AGTCTGAGGC GCTCCAGCCG GTTGTGGCCT TTGAAGGCAA CGCCAAAAGC
GCGACCCCGG AAGGGATTGA CCGCACCGCG AAGATGTTTG TCGGCGGCAA GCAAGCGCGC
CCCGACAGCG GCTATTCCAA ACCCGTGTAT GGCCCCAAAG GCGACCTATT GGGCCACGTC
GGCCTTGGCA GCCGCAAGGA TGTCAGAAAT GCCGTCGAGG CCGCTAATGC GGCCAAGGGC
TGGGCCAAGA CCACCGGCCA TCTACGCGCG CAGATCCTTT ATTATCTGGC CGAGAACCTC
GCCGCTCGCG CGGGGGAGTT TGCGGCCCGC ATTGACGCAA TGACCGGCAA AGGAGAGGGC
GCCGCAGAGG TCGAGGCGTC GCTGCAACGC CTCTTCTCTG CCGCAGCCTG GGCCGACAAA
TACGACGGTC TCGCCCATGG CGTGCCGATC CGTGGCGTGG CTCTTGGCAT GAAAGAACCC
GTGGGCACCA TTGGCGTCCT CTGCGCCGAT GAGGCGCCGC TCCTGGGGCT CGTCTCGGCC
ATGGCCCCCG CCATTGCCAT GGGCAACCGC GTGGTGCTCG CGGCCTCAGA GGCCTTTCCT
CTGGCGGCTA CGGATCTTTA TCAGGTGCTC GAAACCTCTG ATGTGCCCGC TGGCGTGGTC
AATATCCTCA CCGGCCCGCA CAAGGACCTC GGTGACACAA TGGCCAAGCA CCTCGATATC
GACGCGGTCT GGAGTTTCTC TTCCAGCGAC CTCTCCAAGA TGATCGAGGC CGCTTCTGCC
GGGAACCTCA AGCGCACCTG GGTCAACAAC GGCCACGCCT TCGATTGGTC GCGCGATCAG
TCAAAGCGCT TCTTGCAGGC CGCGACAGAG GTCAAGACCG TCTGGATCCC CTACGGCGAG
TGA
 
Protein sequence
MTIKEIFETM DYGPAPENAA EALAWLVDQG SQFGHFINGD FTAPGTVFES KNPATGEVLA 
ELTQATQKDI DAAVKAARAA QEGWAAMGGS GRATYLYAIA RLLQKHSRLF AVLETLDNGK
PIRESRDIDV PLVQRHFYHH AGHAQLMDDE MSDRAALGVC GQIIPWNFPL LMLAWKVAPA
LAMGNTVVLK PAEYTSLTAL LFADICRQAG LPKGVVNIVT GDGAVGEMIV NAEVDKIAFT
GSTSVGRKIR EATAGSGKAL TLELGGKSPY IVFDDADLDS AIEGLVDAIW FNQGQVCCAG
SRLLVQEGVS ERFHQKLRAR MKTLRLGDPL DKCIDIGAVV DPVQHAEISR LVASATNCTV
HQSAVNMPAK GCFFPPTLIE GLSPSDPLMQ EEIFGPVLVS TTFRTPAEAV ELANNTRYGL
AATLWTENVN LALDVAPKLV AGVVWVNGTN MFDAAAPFGG VRESGFGREG GVEGLMAYTK
PKAQSEALQP VVAFEGNAKS ATPEGIDRTA KMFVGGKQAR PDSGYSKPVY GPKGDLLGHV
GLGSRKDVRN AVEAANAAKG WAKTTGHLRA QILYYLAENL AARAGEFAAR IDAMTGKGEG
AAEVEASLQR LFSAAAWADK YDGLAHGVPI RGVALGMKEP VGTIGVLCAD EAPLLGLVSA
MAPAIAMGNR VVLAASEAFP LAATDLYQVL ETSDVPAGVV NILTGPHKDL GDTMAKHLDI
DAVWSFSSSD LSKMIEAASA GNLKRTWVNN GHAFDWSRDQ SKRFLQAATE VKTVWIPYGE