Gene Dtox_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0235 
Symbol 
ID8427159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp255161 
End bp256345 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content45% 
IMG OID645032622 
Productdihydropteroate synthase 
Protein accessionYP_003189811 
Protein GI258513589 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR01496] dihydropteroate synthase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.683276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTAA CAATTCGTAA TGTAATTGTA CATAATAAAA AGGAGTCTTT GGCGGAGATA 
ATATCTACCG GTGCTGATAT ATCCGGGTGT CGCTTAATGG CTCCTAAAGG AGTGCACAGG
CTAATTAAGT TAACCGGCTT AAGTCCCAAG CAGGCTAATA TAATTAAGCA GCAGATGTTG
AGCAGAGGTG GGGAGGCCGC AGTATCCAGA GGAGTTATAG ACTGCTCTGC CTCTGAGGCA
ATAGTATTGA TTATGGGTAC CTTAAAGCAG TTTGACGGAC TTGTGAATAT ATTAAAAATG
CAGCCCTTTG GTCTGCCCAA AATCGGTGCA CGGATTAAAC AAGTATTAAA AAACCTGGAA
GGCAGATCTC CTGTGGAAAT AGACTGCCGG GGAAAGATCC TCCTTCTGGG TGAACGCACA
TTAATCATGG GAATTTTGAA TGTGACTCCG GATTCTTTTT CGGATGGGGG AAGTTTTTAT
AACCATGAAG TTGCAATTGA ACATGCCAGG GCAATGGTGT CCGAAGGAGC GGACATAATA
GACTTGGGAG GAGAGTCAAC ACGACCGGGT CATGAATCCA TCAGTGTGGA AGAAGAATTA
GACCGGGTAA TACCTGTTTT GGAAAAACTG GTTAGAGAAA TTGATGTTCC TGTTTCCATT
GATACAACCA AGGCAGAAGT AGCGCGTCGC GCTTTGCTTG CCGGTGCTCA TCTGATTAAC
GATCAGTGGG CTTTAAGGGC AGACCCGGAT ATGGCTCATG TAGTAGCGGA ATACGATGTT
CCTTTAATTA TCATGCATAA CCAAAAAGGT ACAGAGTACA ATGATCTCAT GGGTGATATG
GTGCAGTTTT TTGAAGAAAG TATAGATACA GCGGTAAATG CCGGGTTAGC CAGAGAAAAG
ATAATTGTTG ATCCCGGCAT TGGTTTTGGC AAAACGCTGG AACAGAATCT TGAAACCATG
CGAAGATTAG ATGAATTGTC TTGTTTGGGT TGTCCGGTAC TCCTGGGTAC TTCGCGTAAG
TCAATGATCG GCAAGGTTCT TGATCTGCCT GTTGATGAGC GGGTGGAGGG TACTGCGGCT
ACGGTAACAC TGGGCATAGC TAACGGTGCT GATATTGTGC GGGTGCATAA TGTTAAAGAA
ATGGTTCGTG TCGCCAGAAT GACTGATGCG ATGGTGAGAA GATAA
 
Protein sequence
MSVTIRNVIV HNKKESLAEI ISTGADISGC RLMAPKGVHR LIKLTGLSPK QANIIKQQML 
SRGGEAAVSR GVIDCSASEA IVLIMGTLKQ FDGLVNILKM QPFGLPKIGA RIKQVLKNLE
GRSPVEIDCR GKILLLGERT LIMGILNVTP DSFSDGGSFY NHEVAIEHAR AMVSEGADII
DLGGESTRPG HESISVEEEL DRVIPVLEKL VREIDVPVSI DTTKAEVARR ALLAGAHLIN
DQWALRADPD MAHVVAEYDV PLIIMHNQKG TEYNDLMGDM VQFFEESIDT AVNAGLAREK
IIVDPGIGFG KTLEQNLETM RRLDELSCLG CPVLLGTSRK SMIGKVLDLP VDERVEGTAA
TVTLGIANGA DIVRVHNVKE MVRVARMTDA MVRR