Gene Dshi_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2045 
Symbolxsc 
ID5713040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2164196 
End bp2165974 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content66% 
IMG OID641267968 
Productsulfoacetaldehyde acetyltransferase 
Protein accessionYP_001533384 
Protein GI159044590 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03457] sulfoacetaldehyde acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.460294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATGA CGACGGAAGA GGCGTTTATC AAGGTTTTGC AGCGGCATGG TGTGGACCAT 
GCGTTCGGGA TCATCGGGTC TGCGATGATG CCGATTTCGG ACCTGTTTCC GGAAGCGGGG
ATCACCTTCT GGGACTGTGC GCACGAGGGC TCTGCGGGGA TGATGGCCGA TGGCTTCACC
CGGGCGTCGG GGCGGATGTC GATGATGATC GCCCAGAACG GCCCCGGCAT CACCAATTTC
GTCACCGCCG TGAAGACCGC CTACTGGAAC CACACGCCGC TTCTGCTGGT CACGCCCCAG
GCGGCCAACA AGACCATCGG CCAGGGCGGC TTCCAGGAGG TCGCGCAGAT GAAGCTTTTC
GAGGACATGG TCGCCTACCA GGAGGAGGTC CGCGATCCCT CGCGCATGGC CGAGGTGCTG
ACCCGGGTGA TCTCCAAGGC GAAAACCCTC TCGGGGCCCG CGCAGATCAA CATCCCGCGC
GATTTCTGGA CCCAGGTGAT CGATATCGAG ATCCCCGAGC CCATCGAATT CGAGCGCTCC
CCGGGCGGCG AGGCGTCGGT GGCGCGCGCC GCGGCGCTGT TGTCGGAGGC GAAAAACCCG
GTGATCCTGA ACGGCGCGGG CGTGGTGCTG TCGGAGGGCG GGATCGCGGC CAGCAAGGCG
CTGGCCGAGC GGCTCGATGC GCCCGTTTGC GTGGGATATC AACACAATGA CGCATTTCCG
GGGGGCCATC CGCTGTTCGC GGGGCCGCTG GGATACAACG GCTCGAAGGC GGCGATGGAG
CTGATTTCCG AGGCCGATGT GGTGCTGGCA CTCGGCACCC GGCTTAACCC GTTCTCAACG
CTTCCGGGCT ACGGAATGGA GTACTGGCCG GCGGATGCGA AAATCATCCA GGTCGACATC
AATTCCGACC GGATCGGGCT GACCAAGAAG ATCAGCGTCG GCATCGTCGG CGACGCGGCC
AAGGTGGCGC GCGGCATCCT GGGCCAGCTG GCCGAGGATG CGGGCGATGC GGGCCGCCAG
GAACGGCGCG ACCGGATCGC GCAGGTGAAA TCCCGCTGGG CCCAGCAGCT CAGCGCCATG
GACCATGAGG AGGACGACCC CGGCACCACC TGGAACGCCC GCGCCCGTGC CGCGAAACCC
GACTGGATGA GCCCGCGCAT GGCCTGGCGC GCGATCACCG CCGCCCTGCC GCGCGACGCG
ATCATCAGCT CGGATATCGG CAACAACTGT GCCATCGGCA ACGCCTATCC GGACTTCGAC
GCGCCGCGCA AATACCTCGC GCCGGGGCTC TTTGGCCCCT GCGGCTACGG GTTGCCGGCG
ATCGTGGGCG CCAAGATCGC CCAACCCGAC ACGCCGGTGG TGGGGTTTGC CGGGGACGGC
GCGTTCGGCA TCGCGGTGAA CGAGCTGACG GCGATCGGCC GCGGCGACTG GCCCGCGATC
ACGCAAGTGG TGTTCCGCAA CTACCAGTGG GGCGCGGAGA AGCGCAATTC CACGCTCTGG
TTCGACGACA ACTTCGTGGG CACCGAGCTC GACGAGGAGG TCTCCTATGC CGGCATCGCG
CGCGCCTGCG GGCTCGACGG CGTGGTCGTG CGCACCATGC AGGAGTTGAC CGACACCCTC
GCAACCGCCA TCAAGGCCCA GATGACCGAG GGCAAAACAA CCCTCATCGA GGTCCTGCTC
AACCAGGAAC TCGGAGAACC ATTCAGAAGA GACGCAATGA AAAAACCAAA CAAGGTCGCA
GGTGTCTCGA AACAGGATAT GATGGTCGAG GCTGGCTGA
 
Protein sequence
MRMTTEEAFI KVLQRHGVDH AFGIIGSAMM PISDLFPEAG ITFWDCAHEG SAGMMADGFT 
RASGRMSMMI AQNGPGITNF VTAVKTAYWN HTPLLLVTPQ AANKTIGQGG FQEVAQMKLF
EDMVAYQEEV RDPSRMAEVL TRVISKAKTL SGPAQINIPR DFWTQVIDIE IPEPIEFERS
PGGEASVARA AALLSEAKNP VILNGAGVVL SEGGIAASKA LAERLDAPVC VGYQHNDAFP
GGHPLFAGPL GYNGSKAAME LISEADVVLA LGTRLNPFST LPGYGMEYWP ADAKIIQVDI
NSDRIGLTKK ISVGIVGDAA KVARGILGQL AEDAGDAGRQ ERRDRIAQVK SRWAQQLSAM
DHEEDDPGTT WNARARAAKP DWMSPRMAWR AITAALPRDA IISSDIGNNC AIGNAYPDFD
APRKYLAPGL FGPCGYGLPA IVGAKIAQPD TPVVGFAGDG AFGIAVNELT AIGRGDWPAI
TQVVFRNYQW GAEKRNSTLW FDDNFVGTEL DEEVSYAGIA RACGLDGVVV RTMQELTDTL
ATAIKAQMTE GKTTLIEVLL NQELGEPFRR DAMKKPNKVA GVSKQDMMVE AG