Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0805 |
Symbol | |
ID | 5711241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 812303 |
End bp | 814081 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641266714 |
Product | sulfoacetaldehyde acetyltransferase |
Protein accession | YP_001532151 |
Protein GI | 159043357 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR03457] sulfoacetaldehyde acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.153828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATGA CGACGGAAGA GGCGTTTATC AAGGTTTTGC AGCGGCATGG TGTGGACCAT GCGTTCGGGA TCATCGGGTC TGCGATGATG CCGATTTCGG ACCTGTTTCC GGAAGCGGGG ATCACCTTCT GGGACTGTGC GCACGAGGGC TCTGCGGGGA TGATGGCCGA TGGCTTCACC CGGGCGTCGG GGCGGATGTC GATGATGATC GCCCAGAACG GCCCCGGCAT CACCAATTTC GTCACCGCCG TGAAGACCGC CTACTGGAAC CACACGCCGC TTCTGCTGGT CACGCCCCAG GCGGCCAACA AGACCATCGG CCAGGGCGGC TTCCAGGAGG TCGCGCAGAT GAAGCTTTTC GAGGACATGG TCGCCTACCA GGAGGAGGTC CGCGATCCCT CGCGCATGGC CGAGGTGCTG ACCCGGGTGA TCTCCAAGGC GAAAACCCTC TCGGGGCCCG CGCAGATCAA CATCCCGCGC GATTTCTGGA CCCAGGTGAT CGATATCGAG ATCCCCGAGC CCATCGAATT CGAGCGCTCC CCGGGCGGCG AGGCGTCGGT GGCGCGCGCC GCGGCGCTGT TGTCGGAGGC GAAAAACCCG GTGATCCTGA ACGGCGCGGG CGTGGTGCTG TCGGAGGGCG GGATCGCGGC CAGCAAGGCG CTGGCCGAGC GGCTCGATGC GCCCGTTTGC GTGGGATATC AACACAATGA CGCATTTCCG GGGGGCCATC CGCTGTTCGC GGGGCCGCTG GGATACAACG GCTCGAAGGC GGCGATGGAG CTGATTTCCG AGGCCGATGT GGTGCTGGCA CTCGGCACCC GGCTTAACCC GTTCTCAACG CTTCCGGGCT ACGGAATGGA GTACTGGCCG GCGGATGCGA AAATCATCCA GGTCGACATC AATTCCGACC GGATCGGGCT GACCAAGAAG ATCAGCGTCG GCATCGTCGG CGACGCGGCC AAGGTGGCGC GCGGCATCCT GGGCCAGCTG GCCGAGGATG CGGGCGATGC GGGCCGCCAG GAACGGCGCG ACCGGATCGC GCAGGTGAAA TCCCGCTGGG CCCAGCAGCT CAGCGCCATG GACCATGAGG AGGACGACCC CGGCACCACC TGGAACGCCC GCGCCCGTGC CGCGAAACCC GACTGGATGA GCCCGCGCAT GGCCTGGCGC GCGATCACCG CCGCCCTGCC GCGCGACGCG ATCATCAGCT CGGATATCGG CAACAACTGT GCCATCGGCA ACGCCTATCC GGACTTCGAC GCGCCGCGCA AATACCTCGC GCCGGGGCTC TTTGGCCCCT GCGGCTACGG GTTGCCGGCG ATCGTGGGCG CCAAGATCGC CCAACCCGAC ACGCCGGTGG TGGGGTTTGC CGGGGACGGC GCGTTCGGCA TCGCGGTGAA CGAGCTGACG GCGATCGGCC GCGGCGACTG GCCCGCGATC ACGCAAGTGG TGTTCCGCAA CTACCAGTGG GGCGCGGAGA AGCGCAATTC CACGCTCTGG TTCGACGACA ACTTCGTGGG CACCGAGCTC GACGAGGAGG TCTCCTATGC CGGCATCGCG CGCGCCTGCG GGCTCGACGG CGTGGTCGTG CGCACCATGC AGGAGTTGAC CGACACCCTC GCAACCGCCA TCAAGGCCCA GATGACCGAG GGCAAAACAA CCCTCATCGA GGTCCTGCTC AACCAGGAAC TCGGAGAACC ATTCAGAAGA GACGCAATGA AAAAACCAAA CAAGGTCGCA GGCATCAACA AAAACGACAT GCTCGTAAAG GCAGACTGA
|
Protein sequence | MRMTTEEAFI KVLQRHGVDH AFGIIGSAMM PISDLFPEAG ITFWDCAHEG SAGMMADGFT RASGRMSMMI AQNGPGITNF VTAVKTAYWN HTPLLLVTPQ AANKTIGQGG FQEVAQMKLF EDMVAYQEEV RDPSRMAEVL TRVISKAKTL SGPAQINIPR DFWTQVIDIE IPEPIEFERS PGGEASVARA AALLSEAKNP VILNGAGVVL SEGGIAASKA LAERLDAPVC VGYQHNDAFP GGHPLFAGPL GYNGSKAAME LISEADVVLA LGTRLNPFST LPGYGMEYWP ADAKIIQVDI NSDRIGLTKK ISVGIVGDAA KVARGILGQL AEDAGDAGRQ ERRDRIAQVK SRWAQQLSAM DHEEDDPGTT WNARARAAKP DWMSPRMAWR AITAALPRDA IISSDIGNNC AIGNAYPDFD APRKYLAPGL FGPCGYGLPA IVGAKIAQPD TPVVGFAGDG AFGIAVNELT AIGRGDWPAI TQVVFRNYQW GAEKRNSTLW FDDNFVGTEL DEEVSYAGIA RACGLDGVVV RTMQELTDTL ATAIKAQMTE GKTTLIEVLL NQELGEPFRR DAMKKPNKVA GINKNDMLVK AD
|
| |