Gene Dtox_4188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4188 
Symbol 
ID8431202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4362810 
End bp4364297 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content50% 
IMG OID645036381 
Productpara-aminobenzoate synthase component I 
Protein accessionYP_003193479 
Protein GI258517257 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade
[TIGR01824] aminodeoxychorismate synthase, component I, clade 2 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAACCC TGCCTTTGCT TAAAAAAATA GATGTCGCCG CCGACGCGGT TCTCTTGTAT 
GAGCGTTTGC AATCCGGTTC TTATAGTTTT TTTCTCGATA CCGGCATGAA ATCACCGGGC
CTCGGTCACC ATTCTTTTGT AGGCGCAGAT CCCTTTTTGC TGTTCGAAAC TAAAGACGAT
CTAATTACGA TAACCAGCAA CGGACGGCGG CAAAACATTA CCGGATCTCC TTTAAAGGAA
TTGAAAAGAC TTTTAGCCGA ATATCAAATG CCTCCGGTTG ACACGGGACT GCCTTTCAAC
GGGGGTGCCG TGGGCTTCTT CAGTTACGAT TTAGGCAGAC AGATAGAAGT TATACCTGAC
CGGGCCGTGG ATGACCTGGA ACTGCCCGAC TGCCAACTGG GCTTTTATGA TGTCCTGGCA
GCCGTCAATC ACCTGACCGG GGAAGTATTT GTTGTTTCCA CCGGGCTGCC CGAAAAAGAC
CCGGAACTGG CTTTCCGCCG GGCGGAAAAA AGACTGGCAG AAACGGAAAA GCTTTTGTGC
GGCCCGGAAT CTCAAAAGGA CGCCCGGCCC GCGCCTGCCG GGATCGCGAC ACCCGGCAGG
ATTTTTTCCT ATCCCGAGCG GCAGGAAATC AGTTTGCCTC AATCCCATTT TACCCAGGAG
AGTTATTGTT CCGCAGTACA AAAAGCTATT GATTATATTG CCTCAGGCGA TATTTTTCAG
GTTAACCTGT CCCAGCGCTT CTCTATACGC CAGACCACTG ATTCATGGAA ACTGTATAAG
AAATTGCGGG AGATTAATCC CGCTCCTTTT GCCGCCTTCC TGTCTTTTGC TGATGTAGAG
GTAATAAGCG CTTCCCCCGA ACGGTTTTTA AAAGTTACCG GCAAGCAGGT GGAAACCAGG
CCTATTAAAG GGACCAGGCC CAGGGGCAAA ACAAAAGCTG AGGATGCTCT TATGCGGCGT
GAACTCTGGG AGAGCATCAA GGACAGGGCG GAACTGGTTA TGATTGTGGA TATGGAAAGA
AACGATCTGG GACGGGTCTG TAAAATAGGC TCGGTAAAGG TGCCCGAGCT TTACCGGCTG
GAAGAATACG CGACCGTATT TCACCTGGTT TCCACGGTAG TCGGCGAACT GCCGGAAGAT
AAAACCACCA TAGATTTACT GGAGGCGGCT TTCCCCGGCG GCTCAATCAG CGGAGCACCT
AAAATCCGCT CCATGGAAAT CATTGAAGAA CTGGAACCTG TGCGGCGGGG AATCTATACC
GGGTCTATCG GCTATATCGG CTTTGACGGG GATGCTGACT TGAATATTGT TATTCGCACT
ATTATCGCCA GGCACGGCCG TTTTTACTTC CAGGTGGGCG GTGGTATTAC GGCTGACTCA
AATCCTTATG CCGAGTATAT TGAGACGCTG GATAAGGCGA GAGCTTTGAT GAAAGCACTG
GGATTAGAGG AGAAGGAGGA GTATTCTTGG AGCGTTTCGT CCAGATAA
 
Protein sequence
MKTLPLLKKI DVAADAVLLY ERLQSGSYSF FLDTGMKSPG LGHHSFVGAD PFLLFETKDD 
LITITSNGRR QNITGSPLKE LKRLLAEYQM PPVDTGLPFN GGAVGFFSYD LGRQIEVIPD
RAVDDLELPD CQLGFYDVLA AVNHLTGEVF VVSTGLPEKD PELAFRRAEK RLAETEKLLC
GPESQKDARP APAGIATPGR IFSYPERQEI SLPQSHFTQE SYCSAVQKAI DYIASGDIFQ
VNLSQRFSIR QTTDSWKLYK KLREINPAPF AAFLSFADVE VISASPERFL KVTGKQVETR
PIKGTRPRGK TKAEDALMRR ELWESIKDRA ELVMIVDMER NDLGRVCKIG SVKVPELYRL
EEYATVFHLV STVVGELPED KTTIDLLEAA FPGGSISGAP KIRSMEIIEE LEPVRRGIYT
GSIGYIGFDG DADLNIVIRT IIARHGRFYF QVGGGITADS NPYAEYIETL DKARALMKAL
GLEEKEEYSW SVSSR