Gene Dshi_1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1828 
Symbolxcs 
ID5712819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1907402 
End bp1909183 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content65% 
IMG OID641267751 
Productsulfoacetaldehyde acetyltransferase 
Protein accessionYP_001533171 
Protein GI159044377 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03457] sulfoacetaldehyde acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.902523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA CGACCGAAGA GGCCTTCGTG AAGGTTCTGC AGATGCACGG TATCGACAAC 
GCCTTCGGCA TTATCGGTTC CGCCATGATG CCGATCTCCG ACCTGTTCCC GCAGGCGGGC
ATCAAGTTTT GGGACTGTGC CCATGAAACC TCCGGCGGCA TGATCGCCGA CGGCTACACC
CGCGCCACGG GCAAGATGTC GATGATGATC GCCCAGAACG GCCCCGGCAT CACCAACTTC
GTGACGCCCG TGAAGACCGC CTACTGGAAC CACACGCCGC TTCTGCTGGT CACGCCCCAG
GCGGCCAACA AGACCATCGG CCAGGGCGGC TTCCAGGAAA TCGAGCAGAT GAAACTGTTC
GAGGACATGG TGTGCTACCA GGAAGAGGTC CGCGACCCCT CGCGCATGGC CGAAGTCCTG
AACCGGGTGA TCGAGAAAGC CTGGCGCGGC TCCGCCCCGG CGCAGATCAA CATCCCGCGC
GACTACTGGA CCCAGGTGAT CGACATCGAG CTGCCCCAGA TCATCCGTCT GGAGCGCCCG
CAGGGTGGCG AGCAGGCCGT CAAGGACGCC GCCAAGCTGC TCTCCGAAGC CGAGTTCCCG
GTGATCCTGA ACGGCGCGGG CGTGGTCCTG TCCGGCGGGA TCGAGGCGTC CGCCAAACTG
GCCGAAGCGC TGGATGCGCC CGTGGCCTGC AACTACCAGC ACAACGATGC CTTCCCCGGC
TCGCACCCGC TGGGCGTTGG CCCGCTGGGC TATAACGGCT CCAAGGCGGC GATGGAGATC
ATCCAGAAGG CCGACGTGGT CCTGGCCCTC GGTAACCGTC TGAACCCGTT CAGCACGCTG
CCCGGCTACG GCATCGACTA CTGGCCGAAG GACGCCAAGA TCATCCAGGT CGACATCAAC
TCCGACCGCA TCGGTCTGAC CAAGAAGGTC GACGTGGCCA TCCAGGGCGA CGCCAAGCGC
GTGGCCGAGC AGCTTCTGGA GAACCTCTCC GACGGGGCGG GCGACAAGGG CCGCAAGGCG
CGCAAGGAAC TGATCGCGCT GACCAAATCC CGCTGGGCGC AGGAACTGTC TTCGATGGAT
CACGAGGACG ATTCCGACGA GGGCATCGAC TGGAACGAGC GCGCCCGCAA GGCCAAGCCC
GATCACATGT CGCCGCGCCA GGCCTGGCGT GCGATCATGT CCGCGCTGCC CAAGGACGCG
ATCATCAGCT CCGACATTGG CAACAACTGC GCCATCGGCA ACGCCTATCC GTCCTTCGAG
AAGGGCCGCA AGTACCTCGC GCCCGGCCTC TTCGGACCCT GCGGCTACGG CCTGCCGGCG
ATCCTCGGCG CCAAGATCGG CTGCCCGGAC GTCCCCGTGG TGGGCTTTGC CGGGGACGGC
GCCTTCGGCA TCTCGATGAA CGAGATGACC GCCTGCGGGC GCGGCGACTG GCCGGCGATC
ACCATGGTGG TGTTCCGCAA CTACCAGTGG GGCGCGGAAA AGCGCAACAC GACCCTGTGG
TTCGAGGACA ACTTCGTCGG CACCGAGCTG AACGAGGGCG TCAACTATGC CGAGATCGCC
AAGGGCTGCG GCCTCAAGGG CGTGCAATGC ACCGGCATGG AAGAGCTGAC CGATGCGCTC
AACACCGCTG TGCGCGAGCA GATGAACGAT GGCGTGACCA CCTTCATCGA AGTCGTGCTG
AACCAGGAAC TGGGCGAGCC TTTCCGCCGC GACGCGATGA AAAAGCCGGT CTCGGTTGCC
GGGATCAATC GCGAGGACAT GCGCCCGCAG CAGGTCGTCT GA
 
Protein sequence
MKMTTEEAFV KVLQMHGIDN AFGIIGSAMM PISDLFPQAG IKFWDCAHET SGGMIADGYT 
RATGKMSMMI AQNGPGITNF VTPVKTAYWN HTPLLLVTPQ AANKTIGQGG FQEIEQMKLF
EDMVCYQEEV RDPSRMAEVL NRVIEKAWRG SAPAQINIPR DYWTQVIDIE LPQIIRLERP
QGGEQAVKDA AKLLSEAEFP VILNGAGVVL SGGIEASAKL AEALDAPVAC NYQHNDAFPG
SHPLGVGPLG YNGSKAAMEI IQKADVVLAL GNRLNPFSTL PGYGIDYWPK DAKIIQVDIN
SDRIGLTKKV DVAIQGDAKR VAEQLLENLS DGAGDKGRKA RKELIALTKS RWAQELSSMD
HEDDSDEGID WNERARKAKP DHMSPRQAWR AIMSALPKDA IISSDIGNNC AIGNAYPSFE
KGRKYLAPGL FGPCGYGLPA ILGAKIGCPD VPVVGFAGDG AFGISMNEMT ACGRGDWPAI
TMVVFRNYQW GAEKRNTTLW FEDNFVGTEL NEGVNYAEIA KGCGLKGVQC TGMEELTDAL
NTAVREQMND GVTTFIEVVL NQELGEPFRR DAMKKPVSVA GINREDMRPQ QVV