Gene Dtox_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1806 
Symbol 
ID8428778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1907886 
End bp1909064 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content47% 
IMG OID645034141 
Productputative transmembrane anti-sigma factor 
Protein accessionYP_003191282 
Protein GI258515060 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.841957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCCTCA AGGAAGGTGT TTTACAGGCT TACCTGGATC AGGAATTGAG CGCGGCACAG 
AGAGCTGAGG TAGAAGCTTG CCTGACCGGC TGTGAAGCAT GCAGGCAGTT GCTGGAGGTG
CTTAAGAGCA ATGATGCTTT TGTTGCAGCA AAGCTTGAGG CATATGCCTC CTGGTTGGAA
AATCCCTCTA TGGCCACCAA TGCGGCTTGG GCCAGATTTT ACGGTGATTT ACCGGCTAAA
GAGAGCAAGC ATAAATTTGG GGAGGTTTGG AAAGTTATGA AAAGATACAG GTTGGTGGCT
GTAGCCGGTG TTGTGGTTAT GGCGCTGGCG GCATCGTTTA GTTTCAGTTC TGTACGCAGT
TTTGCCGGAG AAATCTTAAC GGTTTTCCGA GTCAATAAGG TACAGACAAT AAATATCGAT
CCACAGGAAA TAGCCAGACT GCAGGAGTCT ATTGAAAAGG GTACCGGACA AGTAAATATA
GATAACTTTG GGAAAATTGA GCTGAGCAAT CAGCAAACCT CTGAAGAAGT CAGTCTGGAG
CAAGCCGGGC AGGCGGTAGA TTTTCCTGTC AAATTGCCTG TGCTGGCGGG AGATTATAAA
GGACCGGTTT TACAAAAGAG TTCGGCTGGT AGAGCCAGTT TTACTTTAGA TATAGTTAAG
GCCAATAGCC TGCTCAGCTC TTTGGGGAGC CAAAAACTTT TGCCCGAAGA ACTGGACCAG
AAAACTTTTA CTTTAACCAT GCCGGTAATA ATCGGGGCTC AGTATACGTT TGGCAGCGAG
GATAGCCGTT TATTGGTAGC TCAGGCCAGA AGCCCTGAAA TGTCAGTGCC GGAAGGTGTT
TCGGTTGAGC AGATACGTGA TACGCTTCTT TCTGTACCTG TTTTGCCGGA TAGTTTAAGA
CAGCAGTTAG CCTCGGTTAA TGACTGGCAG CATACGATTT TGATACCGAA TATTGATGGC
AACTCTCAGG AGGTAATGGT AAACGGTGTG CAGGGTGTGT TTACCACTGC TGTTGGGCAT
AGCGGTCATA AAGCTAATGA GAGAAATGAG CGGAGCCTGG ATGAGCGAGG TCATAACAGC
AGTGCGCTGA TTTGGCAAAA AGACGGCGTC ATATACGCGA TTGCCGGTAA CGAACTGACT
CTGGATCAGT CTTTGAGCAT GGCTTCCTCT ATGAAATAA
 
Protein sequence
MCLKEGVLQA YLDQELSAAQ RAEVEACLTG CEACRQLLEV LKSNDAFVAA KLEAYASWLE 
NPSMATNAAW ARFYGDLPAK ESKHKFGEVW KVMKRYRLVA VAGVVVMALA ASFSFSSVRS
FAGEILTVFR VNKVQTINID PQEIARLQES IEKGTGQVNI DNFGKIELSN QQTSEEVSLE
QAGQAVDFPV KLPVLAGDYK GPVLQKSSAG RASFTLDIVK ANSLLSSLGS QKLLPEELDQ
KTFTLTMPVI IGAQYTFGSE DSRLLVAQAR SPEMSVPEGV SVEQIRDTLL SVPVLPDSLR
QQLASVNDWQ HTILIPNIDG NSQEVMVNGV QGVFTTAVGH SGHKANERNE RSLDERGHNS
SALIWQKDGV IYAIAGNELT LDQSLSMASS MK