Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1828 |
Symbol | xcs |
ID | 5712819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1907402 |
End bp | 1909183 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641267751 |
Product | sulfoacetaldehyde acetyltransferase |
Protein accession | YP_001533171 |
Protein GI | 159044377 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR03457] sulfoacetaldehyde acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.902523 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA CGACCGAAGA GGCCTTCGTG AAGGTTCTGC AGATGCACGG TATCGACAAC GCCTTCGGCA TTATCGGTTC CGCCATGATG CCGATCTCCG ACCTGTTCCC GCAGGCGGGC ATCAAGTTTT GGGACTGTGC CCATGAAACC TCCGGCGGCA TGATCGCCGA CGGCTACACC CGCGCCACGG GCAAGATGTC GATGATGATC GCCCAGAACG GCCCCGGCAT CACCAACTTC GTGACGCCCG TGAAGACCGC CTACTGGAAC CACACGCCGC TTCTGCTGGT CACGCCCCAG GCGGCCAACA AGACCATCGG CCAGGGCGGC TTCCAGGAAA TCGAGCAGAT GAAACTGTTC GAGGACATGG TGTGCTACCA GGAAGAGGTC CGCGACCCCT CGCGCATGGC CGAAGTCCTG AACCGGGTGA TCGAGAAAGC CTGGCGCGGC TCCGCCCCGG CGCAGATCAA CATCCCGCGC GACTACTGGA CCCAGGTGAT CGACATCGAG CTGCCCCAGA TCATCCGTCT GGAGCGCCCG CAGGGTGGCG AGCAGGCCGT CAAGGACGCC GCCAAGCTGC TCTCCGAAGC CGAGTTCCCG GTGATCCTGA ACGGCGCGGG CGTGGTCCTG TCCGGCGGGA TCGAGGCGTC CGCCAAACTG GCCGAAGCGC TGGATGCGCC CGTGGCCTGC AACTACCAGC ACAACGATGC CTTCCCCGGC TCGCACCCGC TGGGCGTTGG CCCGCTGGGC TATAACGGCT CCAAGGCGGC GATGGAGATC ATCCAGAAGG CCGACGTGGT CCTGGCCCTC GGTAACCGTC TGAACCCGTT CAGCACGCTG CCCGGCTACG GCATCGACTA CTGGCCGAAG GACGCCAAGA TCATCCAGGT CGACATCAAC TCCGACCGCA TCGGTCTGAC CAAGAAGGTC GACGTGGCCA TCCAGGGCGA CGCCAAGCGC GTGGCCGAGC AGCTTCTGGA GAACCTCTCC GACGGGGCGG GCGACAAGGG CCGCAAGGCG CGCAAGGAAC TGATCGCGCT GACCAAATCC CGCTGGGCGC AGGAACTGTC TTCGATGGAT CACGAGGACG ATTCCGACGA GGGCATCGAC TGGAACGAGC GCGCCCGCAA GGCCAAGCCC GATCACATGT CGCCGCGCCA GGCCTGGCGT GCGATCATGT CCGCGCTGCC CAAGGACGCG ATCATCAGCT CCGACATTGG CAACAACTGC GCCATCGGCA ACGCCTATCC GTCCTTCGAG AAGGGCCGCA AGTACCTCGC GCCCGGCCTC TTCGGACCCT GCGGCTACGG CCTGCCGGCG ATCCTCGGCG CCAAGATCGG CTGCCCGGAC GTCCCCGTGG TGGGCTTTGC CGGGGACGGC GCCTTCGGCA TCTCGATGAA CGAGATGACC GCCTGCGGGC GCGGCGACTG GCCGGCGATC ACCATGGTGG TGTTCCGCAA CTACCAGTGG GGCGCGGAAA AGCGCAACAC GACCCTGTGG TTCGAGGACA ACTTCGTCGG CACCGAGCTG AACGAGGGCG TCAACTATGC CGAGATCGCC AAGGGCTGCG GCCTCAAGGG CGTGCAATGC ACCGGCATGG AAGAGCTGAC CGATGCGCTC AACACCGCTG TGCGCGAGCA GATGAACGAT GGCGTGACCA CCTTCATCGA AGTCGTGCTG AACCAGGAAC TGGGCGAGCC TTTCCGCCGC GACGCGATGA AAAAGCCGGT CTCGGTTGCC GGGATCAATC GCGAGGACAT GCGCCCGCAG CAGGTCGTCT GA
|
Protein sequence | MKMTTEEAFV KVLQMHGIDN AFGIIGSAMM PISDLFPQAG IKFWDCAHET SGGMIADGYT RATGKMSMMI AQNGPGITNF VTPVKTAYWN HTPLLLVTPQ AANKTIGQGG FQEIEQMKLF EDMVCYQEEV RDPSRMAEVL NRVIEKAWRG SAPAQINIPR DYWTQVIDIE LPQIIRLERP QGGEQAVKDA AKLLSEAEFP VILNGAGVVL SGGIEASAKL AEALDAPVAC NYQHNDAFPG SHPLGVGPLG YNGSKAAMEI IQKADVVLAL GNRLNPFSTL PGYGIDYWPK DAKIIQVDIN SDRIGLTKKV DVAIQGDAKR VAEQLLENLS DGAGDKGRKA RKELIALTKS RWAQELSSMD HEDDSDEGID WNERARKAKP DHMSPRQAWR AIMSALPKDA IISSDIGNNC AIGNAYPSFE KGRKYLAPGL FGPCGYGLPA ILGAKIGCPD VPVVGFAGDG AFGISMNEMT ACGRGDWPAI TMVVFRNYQW GAEKRNTTLW FEDNFVGTEL NEGVNYAEIA KGCGLKGVQC TGMEELTDAL NTAVREQMND GVTTFIEVVL NQELGEPFRR DAMKKPVSVA GINREDMRPQ QVV
|
| |