Gene TM1040_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1106 
Symbol 
ID4077813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1188053 
End bp1189552 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content61% 
IMG OID638006410 
Productmethylmalonate-semialdehyde dehydrogenase [acylating] 
Protein accessionYP_613101 
Protein GI99080947 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAC TGACCCATTA CGTGAATGGT GAAAAAGTCG CCGGGACCTC CGGCCGTTTT 
GCCGACGTTT TGAACCCCGC CACCGGTGAA GTGCAGGCCA AGGTGCCGCT TGCCACCAAG
GCGGAAATGG ACGCAATCAT CGCCAAAGCC GCCGAAGCAC AGGTGGAATG GGCGGCAACC
AACCCGCAAA AACGCGCCCG TGTGATGATG AAGTTCGGCC AACTCATCAA CGAACACATG
GACACGCTCG CAGAACTGGT TGCCCGTGAA CACGGCAAGA CCCTGCCTGA TGCGCGCGGC
GACGTGCAGC GCGGCCTTGA AGTTGTCGAG GTCTGCATGG GCACGCCGAG CCTGCTGAAA
GGCGAATTCA CCGACAGCGG CGGACCGGGC ATCGACCTTT ACTCCATGCG CCAGCCTCTG
GGTGTGGTTG CGGGCATCAC CCCCTTCAAC TTCCCGGCGA TGATCCCCTT GTGGAAAATG
GCCCCTGCCC TCTCGTGCGG CAACGCCATG ATCCTGAAAC CTTCCGAGCG CGTGCCGTCC
ACCTCGCTCT ATCTGGCGGA GCTTCTGAAA GAAGCCGGTC TGCCTGATGG TGTGCTGCAG
GTTGTGAACG GCGACAAGGA AGCCGTGGAC GCGATCCTCG ACAACGAGAC CATTCAGGCC
GTGGGCTTTG TGGGTTCCAC CCCGATTGCG CAGTATATCT ATGGCCGCGC GGCGACCAAC
GGCAAGCGCG CGCAGTGCTT TGGCGGCGCC AAGAACCACA TGCTGATCAT GCCTGATGCG
GATCTCGACA AGGCGGCAGA CGCGCTGGTT GGTGCCGGGT TTGGCGCAGC GGGCGAACGC
TGCATGGCGA TCTCCGTTGC GGTGCCGGTC GGCAAAGAAA CCGCCGATGG CCTCATTGAG
CGTCTGGTGC CCCGCATCGA GAAACTCAAG GTCGGCCCCT ACACCGCCGG TGAGGACATC
GACTACGGCC CCGTGATCAC CCCGCAGGCC AAGGCGCGCA TCGAGGGTCT CATTGACAGC
GGCGTCGAGC AGGGCGCAAC CCTTGTGACC GATGGCCGTG GCCTGACGCT GCAGGGGTAT
GAGAACGGCT ATTTTGTTGG CCCGACCCTC TTTGACAATG TCACCGCCGA GATGGACATT
TACAAAGAAG AGATCTTTGG CCCGGTTCTG TCGACAGTCC GCATGGACAA CTACGAGGAC
GCACTGAACC TTGTCAAAGA CAACGCCTAT GGCAACGGCA CCGCGATCTA CACTGCCGAT
GGTGACACCG CGCGTGACTT TGCCAACCGC GTGAACGTGG GCATGGTCGG TATCAACTTC
CCGATCCCGG TCCCGCTCAG CTACCACACC TTTGGCGGCT GGAAGAAATC GGCCTTTGGC
GATCTGAACC AATATGGCCC CGACGCCTTC CGCTTCTACA CCCGCACCAA GACTGTGACC
CAGCGCTGGT TCTCGGGCAT CAAAGAAGGC GGCGAATTCA ACTTCAAAGC CATGGACTGA
 
Protein sequence
MQELTHYVNG EKVAGTSGRF ADVLNPATGE VQAKVPLATK AEMDAIIAKA AEAQVEWAAT 
NPQKRARVMM KFGQLINEHM DTLAELVARE HGKTLPDARG DVQRGLEVVE VCMGTPSLLK
GEFTDSGGPG IDLYSMRQPL GVVAGITPFN FPAMIPLWKM APALSCGNAM ILKPSERVPS
TSLYLAELLK EAGLPDGVLQ VVNGDKEAVD AILDNETIQA VGFVGSTPIA QYIYGRAATN
GKRAQCFGGA KNHMLIMPDA DLDKAADALV GAGFGAAGER CMAISVAVPV GKETADGLIE
RLVPRIEKLK VGPYTAGEDI DYGPVITPQA KARIEGLIDS GVEQGATLVT DGRGLTLQGY
ENGYFVGPTL FDNVTAEMDI YKEEIFGPVL STVRMDNYED ALNLVKDNAY GNGTAIYTAD
GDTARDFANR VNVGMVGINF PIPVPLSYHT FGGWKKSAFG DLNQYGPDAF RFYTRTKTVT
QRWFSGIKEG GEFNFKAMD