Gene TM1040_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0074 
Symbol 
ID4075971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp76421 
End bp77863 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content62% 
IMG OID638005361 
Productaldehyde dehydrogenase 
Protein accessionYP_612069 
Protein GI99079915 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.68258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGAGA AACGCGAATT CTACATCAAT GGCCAGTGGG TTGAACCCTC GGCGGCCAAT 
GACTGCGCCG TTATCGACCC CTCCACCGAA GAGCCCTGCA CGGTGATCTC GCTCGGCAGT
CAGGCCGACA CCGATGCTGC CGTTGCCGCC GCCAAAGCCG CGCTGCCCGG CTGGATGGCG
ACGCCACCAG CAGAGCGCAT CGCACTGGTG GAGAAGCTCG TCGAGATCTA CAACAGTCGC
GCAGAAGATC TGGCGCAGGC CATGTCCAGC GAGATGGGCG CCCCCATCGA CATGTCGCGC
TCCAGTCAGG TAGGCGCGGG CAGCTGGCAC CTTAACGGTT TTATCGAGGC GGCAAAGAAT
TTCCAGTTCG AGCGCCCACT CGGCGATCAT GCCCCCAACG ATCGCATCAT CTATGAGGCC
GTAGGCGTTG CCGCGCTGAT CACCCCGTGG AACTGGCCGA TGAACCAGAT CACGCTGAAG
TTCGGCGCCG CTGCGATTGC GGGCTGCACC ATGGTGCTGA AACCCTCCGA GCAGAGCCCG
CTCAATGCGA TGATCTTTGC CGAACTGGTG CACGAAGCCG GCTTCCCGCC CGGTGTTTTC
AACCTCGTGA ACGGCGATGG CGCGGGCGTG GGCACGCAAC TGTCGTCGCA TCCGGATGTG
GACATGGTAT CCTTTACCGG CTCGACCCGC GCGGGTACGG CGATCTCCAA GGCTGCGGCA
GATACCCTGA AAAAGGTGCA TCTGGAGCTG GGTGGCAAAG GCGCCAACCT CGTCTTTGAA
GACGCCGATG AAAAGGCCGT GAAACGCGGC GTGCTGCATA TGATGCAGAA CACCGGTCAG
AGCTGCAACG CACCGTCGCG GATGCTGGTT CAAAAGAGTA TCTACGACCG CGTGGTTGAA
GAGGCCGCTG CGGTTGCCAA CAAGGTCGAG GTGGGCCCCG CCTCGCAAGA AGGCCGCCAT
ATCGGCCCCG TCGTCAACGA ACTGCAGTGG ACCAAGATCC AGGATCTGAT CCAGAAGGGC
ATCGACGAGG GCGCGCGCCT TGTGGCCGGG GGCACCGGTC GCCCGGACGG TCTGAACAAG
GGCTACTATG TGAAGCCCAC GGTGTTTGCA GATGTAAACA ACCAGATGAC CATCGCGCGC
GAGGAAATCT TTGGCCCAGT GATGGCAATC ATCCCCTTCG AGACCGAAGA AGAAGCTGTC
GAGATCGCCA ATGACACCCC CTATGGCCTG ACCAACTACG TGCAGACACA GGATGGTGCG
CGCGCCAACC GTCTGGCGCG GGTGCTGCGC TCGGGCATGG TGGAAATGAA CGGTAAATCC
CGCAGCGCCG GGTCGCCGTT TGGCGGCATG AAACAGTCCG GCAACGGCCG TGAAGGCGGC
GTCTGGGGGC TTGAGGACTT TATGGAAGTC AAAGCCGTAG GGGGCTGGAC GCCCGACGCC
TAA
 
Protein sequence
MLEKREFYIN GQWVEPSAAN DCAVIDPSTE EPCTVISLGS QADTDAAVAA AKAALPGWMA 
TPPAERIALV EKLVEIYNSR AEDLAQAMSS EMGAPIDMSR SSQVGAGSWH LNGFIEAAKN
FQFERPLGDH APNDRIIYEA VGVAALITPW NWPMNQITLK FGAAAIAGCT MVLKPSEQSP
LNAMIFAELV HEAGFPPGVF NLVNGDGAGV GTQLSSHPDV DMVSFTGSTR AGTAISKAAA
DTLKKVHLEL GGKGANLVFE DADEKAVKRG VLHMMQNTGQ SCNAPSRMLV QKSIYDRVVE
EAAAVANKVE VGPASQEGRH IGPVVNELQW TKIQDLIQKG IDEGARLVAG GTGRPDGLNK
GYYVKPTVFA DVNNQMTIAR EEIFGPVMAI IPFETEEEAV EIANDTPYGL TNYVQTQDGA
RANRLARVLR SGMVEMNGKS RSAGSPFGGM KQSGNGREGG VWGLEDFMEV KAVGGWTPDA