Gene Dtox_4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4156 
Symbol 
ID8431170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4331509 
End bp4332609 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content48% 
IMG OID645036349 
Productputative transmembrane anti-sigma factor 
Protein accessionYP_003193447 
Protein GI258517225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0909715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000583924 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACTGCC AAAAAACAAT GAATTTATTA TCACTCTACG TAGACGGCAG TCTCGACAAT 
AAACCAGACG ACCGTGCTAT AAAAGATCAC CTGTCGGCTT GCAAAGCCTG CAGCAGCGAG
TTTGCTTTGC AAAAAAGGCT TTCCACCGCT ATGAATAGTT TTAAGAGTGA GGATATAACA
GCACCGCCTG ATTTATGCGC CAATATCATG GGGCAACTAA AACAGGAGCG CAAAAAAGTT
TTCCACCTGC TTCCTGCCGC CTGGCGCAGA ACAATTGCCG CGGTTGCCGC AATACTGCTT
ATGGCAGGCA TGTCCTCAGG GATTACAAGC AGCTTGCTGC CGGTAGCCAA CAATGATAAG
CCGAATGCGG CGCGGCCCTC ACAAGTTGCC TCAACTGATA ACGGCGCTGC GGCAGAAGTT
AAACCTGAGA CGCATGATCC AAACCGGTCC AAAGATGCCG AGCAACAGCC GGACAGTAAC
GCAAATGAAT CAAAAGTTGA TGTTTCATCT AAAGAAACCA CGTCAGGCAA CGATGTCAAA
AAGAACGGCA CGGCTATGAC AAACACGGAA AGCAACACCG GAGAGGTTCC GTCAGGCACA
ACCAAGAAGC CTACAGAAGT TGGTCAAAGT TCCCCTTCTG TTAAAGCAAC ACCTTCATAT
GCAGAGAAAA CCGCATTTCT CAGTAAAAAT ATGGTGATAA CCAGTACCGT CTTAAAAATC
TCGGTAAATG ACTTGTCCGA AGCCAAGATA AAAGCAGTAG CCTTGGCCGC CGGCGCAGGA
GCCTCAAACC AGCTGTTTCC CGAGGAAGGC GGCTTGCTCA TGAGATTGGC TACTCCGGCT
GAGCAGGCAC AACAGCTCAT CAACGGGCTG TCCGGATTAG GCACGACGAT GGACAGACAA
GATGAAAACA GGGACATAAC TTCTTCTTAC AACAAAGCTT CTGTACAATA TGCCGAACTG
CAAGCCAGAA TAAGTGCATC GACTGATACA GAAGAGCGCA GGCAATTAGA AAACCAGGCG
GCAGGTTTTA AGAGGACTAT GGATTCATAT GAAGCTGATG CCGGTAAGAG GGTAATAGTT
TTATGGATAG AAAAAAAATA G
 
Protein sequence
MDCQKTMNLL SLYVDGSLDN KPDDRAIKDH LSACKACSSE FALQKRLSTA MNSFKSEDIT 
APPDLCANIM GQLKQERKKV FHLLPAAWRR TIAAVAAILL MAGMSSGITS SLLPVANNDK
PNAARPSQVA STDNGAAAEV KPETHDPNRS KDAEQQPDSN ANESKVDVSS KETTSGNDVK
KNGTAMTNTE SNTGEVPSGT TKKPTEVGQS SPSVKATPSY AEKTAFLSKN MVITSTVLKI
SVNDLSEAKI KAVALAAGAG ASNQLFPEEG GLLMRLATPA EQAQQLINGL SGLGTTMDRQ
DENRDITSSY NKASVQYAEL QARISASTDT EERRQLENQA AGFKRTMDSY EADAGKRVIV
LWIEKK