Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0155 |
Symbol | |
ID | 4078822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 169615 |
End bp | 171393 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005449 |
Product | sulfoacetaldehyde acetyltransferase |
Protein accession | YP_612150 |
Protein GI | 99079996 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR03457] sulfoacetaldehyde acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.316464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATGA CCACCGAAGA AGCCTTTGTA AAAACACTGC AAATGCATGG TATCGAGCAT GCGTTTGGCA TTATCGGTTC CGCGATGATG CCGATTTCAG ATCTGTTTCC CAAGGCTGGG ATCACGTTCT GGGACTGCGC GCATGAGGGC TCCGCCGGGA TGATGGCCGA TGGCTACACG CGGGTGACGG GCAACATGTC GATGATGATT GCGCAGAACG GCCCGGGCAT CACCAACTTC GTCACCGCCG TAAAAACCGC CTATTGGAAC CACACGCCAC TGCTGCTGGT GACGCCACAG GCTGCCAACA AGACCATCGG CCAAGGGGGC TTTCAGGAGG TCGAACAGAT GAAACTGTTC GAGGATATGG TGGCCTATCA AGAAGAGGTT CGCGACCCGA GCCGCGTTGC GGAAGTTCTG AATCGCGTGA TCTGCAACGC GAAACGCGCC TCGGCCCCGG CGCAGATCAA CATTCCTCGC GACATGTGGA CTCAGGTCAT CGATATCGCC CTCCCCGAAA TCCTCACGTT CGAACGTCCT GCAGGAGGGG CGCAGGCCGT CTCAGAAGCC GCCTCGCTCC TCTCTGAGGC CCAAAATCCG GTGATCCTCA ATGGCGCAGG CGTGGTGCTC TCTGGCGGGG GTATCGCCGC CTCCATGGCG CTGGCGGAGC GGCTCGATGC GCCGGTATGC GTTGGCTATC AGCACAATGA CGCCTTTCCC GGCACGCATC CTCTTTTTGT GGGTCCCTTG GGGTACAATG GCTCTAAAGC GGCGATGGAG CTGATCCGCG ATGCCGATGT CGTGCTTTGC CTCGGCACCC GACTCAACCC TTTTTCCACT CTGCCCGGCT ATGGGATCGA CTATTGGCCA ACGGAAGCCG ATATCATTCA GGTGGACATC AACCCAGATC GGATCGGGCT GACGAAACCG GTGCGTGTGG GGATTGTGGG GGACGCGGCC AAAGTGGCCG ATGGCATTCT GCAACAGCTC TCAGAGGACG CCGGAGATGC CGGGCGCTCC GCACGCAAGG CCCATATCGC AGCAACAAAA TCCCGCTGGG CGCAGCAGCT TGCCGCGATG GACCATGAAG ACGACGACCC CGGCACCACC TGGAACGCCC GCGCGCGGTC CGCAAAACCC GACTGGATGA GCCCACGCAT GGCCTGGCGC GCGATACAGT CCGCACTCCC CTCAGACGCC ATCATTTCGT CCGACATTGG TAACAACTGC GCCATCGGCA ACGCCTATCC CACCTTCGAA GCAGGCCGCA AATATCTTGC CCCGGGGCTG TTTGGCCCAT GCGGGTATGG TTTGCCAGCG ATCATGGGCG CCAAGATCGG CTGCCCCGAG ACCCCGGTCG TGGGCTTTGC CGGAGACGGT GCGTTTGGCA TCTCGGTAAA CGAGCTGACC GCTATTGGAC GCGCAGAGTG GCCCGCAATT ACGCAGGTGG TGTTTCGCAA CTACCAATGG GGTGCCGAAA AACGCAACTC GACGCTCTGG TTTGACGACA ATTTCGTGGG CACAGAACTC GACACCAAGG TCTCTTATGC GGGGATTGCA CAAGCCTGCG GCCTGAAAGG CGTTGTCGCG CGCACCATGG AAGAGCTGAC CGCGACGTTG CGCCACGCGA TCGAGGATCA GGGCCGAGGC ATCACCACGC TGATCGAAGC CCTGATCAAT CAGGAGCTGG GAGAGCCCTT TCGCCGCGAC GCCATGAAAA AGCCGGTGGC GATTGCCGGG ATCTCGGCCG CAGACATGCG TCCACAGGCG CGCGCCTGA
|
Protein sequence | MRMTTEEAFV KTLQMHGIEH AFGIIGSAMM PISDLFPKAG ITFWDCAHEG SAGMMADGYT RVTGNMSMMI AQNGPGITNF VTAVKTAYWN HTPLLLVTPQ AANKTIGQGG FQEVEQMKLF EDMVAYQEEV RDPSRVAEVL NRVICNAKRA SAPAQINIPR DMWTQVIDIA LPEILTFERP AGGAQAVSEA ASLLSEAQNP VILNGAGVVL SGGGIAASMA LAERLDAPVC VGYQHNDAFP GTHPLFVGPL GYNGSKAAME LIRDADVVLC LGTRLNPFST LPGYGIDYWP TEADIIQVDI NPDRIGLTKP VRVGIVGDAA KVADGILQQL SEDAGDAGRS ARKAHIAATK SRWAQQLAAM DHEDDDPGTT WNARARSAKP DWMSPRMAWR AIQSALPSDA IISSDIGNNC AIGNAYPTFE AGRKYLAPGL FGPCGYGLPA IMGAKIGCPE TPVVGFAGDG AFGISVNELT AIGRAEWPAI TQVVFRNYQW GAEKRNSTLW FDDNFVGTEL DTKVSYAGIA QACGLKGVVA RTMEELTATL RHAIEDQGRG ITTLIEALIN QELGEPFRRD AMKKPVAIAG ISAADMRPQA RA
|
| |