Gene Dtox_3877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3877 
Symbol 
ID8430891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4057263 
End bp4058414 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content42% 
IMG OID645036096 
Producthelix-turn-helix domain protein 
Protein accessionYP_003193195 
Protein GI258516973 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2856] Predicted Zn peptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.862285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAAA AAATTATTGG TGCGAACCTT CGCCGGATTC GCGAGGCCAA GGGTTGGACT 
CAATCCCAGG TAGCCGATCT GGCTGGGATT TCAAGGGTTG CTTATCGAAA TATTGAAAAC
GGTAATACAA CCCCCAAAGT ATCGACCCTG CAAAACATCG CTTCCGCTGT TGGAGTAAAA
CTCCAGGATT TATTCATTCC GGTTCGCACC TTAAAGGGAG TCCGATTTCG AGCATCGAAA
AAGATGAACA GCCGGGACAA TATTTTGACT GAAGTAGCAC ATTGGCTTGA TGATTTTAAT
TACCTGGAAA GGTTACTAAA TGATCACAAA GACTACCAGT TCGAAGATCT TACCAGGGAG
TTGTCTTCAA TGCCTCCTGG AGATGACAGA GCTAAGCATG CAGCCGAACG AGTAAGAAAA
AAATTGAAAC TCAAAGAAAA AGAGCCTATC CGCGATATTG CTGGTCTGCT GGAAGCATGC
GGGATAAAAG TATACCCTCT GAGTCTTGTA TCAGACGGTT TTTTCGGCTT ATCTGTTGCC
GGAGAAGATG GCGGCCCTGC AGTCATTGTT AATGTTTGGG GAAGAATATC CGTTGAGCGA
TGGATTTTTA GCGCTGCTCA CGAACTAGGG CATTTACTTC TTCATTTAGA TACCTATAAC
ATAGAAGAAA GTTTTGAAGA CAAAGACCAA GAAAATGAAG CAAATGTCTT TGCTTCCCAT
TTTTTGATGC CAGAAAAAGC TTTTCAAGCT GAATGGATAG ATACTTACGG CTTGTCCTTT
GTCGACCGGG TTTTTAAAGT TAAGCAAATA TTCCTGGTAA GCTACAAAAC TGTTCTATAT
CGCCTTTCTG AAAGTCTGGG AAATTCCGTG TGGAAAAAAT TTCAGATTGC TTATAAGATG
AAAACTGGCA AAACATTGAG TATTGCGGAT GAGCCGGAGG CTTTGTCCCC AGATAAATTT
CAACAATCAT CGCCAGAAGT ATTGCGTTCC AGAGAACCTG ACTCTCTATC CCCCTCACAC
TTTATTGAAG ATCGTTTATC TAGATTGGTT CGTAAAGCTA TCGAAAAGGA TGAAATCACC
ATGAGTCGTG GGGCAGAAAT TCTTAGATTA GATCTTGAGG CCATGCGAGA AATGGTCTCT
TCATGGGTGT GA
 
Protein sequence
MDQKIIGANL RRIREAKGWT QSQVADLAGI SRVAYRNIEN GNTTPKVSTL QNIASAVGVK 
LQDLFIPVRT LKGVRFRASK KMNSRDNILT EVAHWLDDFN YLERLLNDHK DYQFEDLTRE
LSSMPPGDDR AKHAAERVRK KLKLKEKEPI RDIAGLLEAC GIKVYPLSLV SDGFFGLSVA
GEDGGPAVIV NVWGRISVER WIFSAAHELG HLLLHLDTYN IEESFEDKDQ ENEANVFASH
FLMPEKAFQA EWIDTYGLSF VDRVFKVKQI FLVSYKTVLY RLSESLGNSV WKKFQIAYKM
KTGKTLSIAD EPEALSPDKF QQSSPEVLRS REPDSLSPSH FIEDRLSRLV RKAIEKDEIT
MSRGAEILRL DLEAMREMVS SWV