Gene Dtox_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3938 
Symbol 
ID8430953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4113068 
End bp4114486 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content45% 
IMG OID645036156 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_003193254 
Protein GI258517032 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000403843 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.477567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATGG GTTATGGCTT GCATGTTGAG CAGACACAGA AATTAATCAT GACCCCGGAA 
TTGCGCCAGG CCATTACCGT TTTACAGTTG TCTTCCCTTG AGTTGAGCAT GTATATAGAC
CAGCAGTTGC AGGAAAACCC TATGCTGGAG GTTCGAGAAG ATGATTTAGA CCGGATTGAG
GAAAATGAGG GAGCAGAAGG CGACGCTAGC GGGGGGGAAG AGGAATTATC CCGGCAGGAA
TGTGATGTGG ACTGGGAAGA ATATTTTCAC GACAGTGACT TGGATTTGGG CCGCAGGGAA
AAATTAGCGG AGCAGTCCGG GAGCGGCTAT GAAAATTTTT TAACGCAGGC TCCAAATCTA
ACTGAATATT TGATGATGCA GTTAAATTTG AGCCGGTGCG GGGATTATTT AAAAGCTATC
GGTGAGTATA TTATAGGTAA TGTTGACCAC AACGGCTACC TGCACGTGTC AGTGAAGGAG
ATTGCCGAGC AATTAGAGGT AAGCCAGTCT AAGGTGGAGC AGGCTTTATC TGTTATTCAA
TCCTTTGATC CGCTTGGTGT CGGCGCTTCG TCTTTGCAGG AATGTTTGCT TATTCAGGTG
CGGTATTTGA ATATAAAAAA TAAGCTGGTT GCGGAACTGA TTGAAAAATA CTTGCCGGAT
ATTGCTAAAG GCAGGCTTAA CCAGATAGCT CAGCAGTTAG GAGTTGCAGT GACGGAGGTG
CAGGAGGCAG CAGATATTAT TAAGACGCTG GATCCCAAGC CCGGGCGCAA TTTTAGCGCT
ACGAACGATG TTCGCTATAT TGTTCCGGAT GTGATAGTAG AGAGAGTCGA AGGAGAATAT
ATTATTCTGG TTAACGATTC ATCGGTGCCT CGCCTGACTA TAAATACTGC ATATCGCTCT
GTTTTAACAC AGGATAAGTT TGATCTTCAG ACTCGCCGTT TTGTAGAAAG TAAACTCAAT
TCCGCTGCCT GGTTGTTGAA AAGTATTGAG CAGAGGCGCC TGACTTTATA TAAAGTAGCC
AGTTGCTTAG TTGATTTGCA GAAGGATTTT ATGGAATACG GTGTCAAGCA TTTGAAACCG
CTTAATTTAA AAACTGTAGC GGAAATAGTT GGCTTGCATG AATCGACTGT GAGCAGGGCC
ACCTCCAATA AGTACATTCA AACCCCGCAA GGCGTGTTTG AGATGAAATT TTTCTTTTCT
ACCGGCCTGA CTTCCGCCGG TGGAGGAATG ACTTCGGCTG AGAGTATTAA GAAGACACTC
AGAGAATTGA TTGCGTCTGA AGATGCCAGA AAACCGCTTA ATGATCAGAA GATCTCGGAT
ATTTTTGCCG AGCGAGGGAT AAAGATTTCT CGCAGGACGG TGGCTAAATA CAGGGATGAA
CTGAATATTC CTCCTTTGAA GCAGAGAAAA CGCTATTAA
 
Protein sequence
MRMGYGLHVE QTQKLIMTPE LRQAITVLQL SSLELSMYID QQLQENPMLE VREDDLDRIE 
ENEGAEGDAS GGEEELSRQE CDVDWEEYFH DSDLDLGRRE KLAEQSGSGY ENFLTQAPNL
TEYLMMQLNL SRCGDYLKAI GEYIIGNVDH NGYLHVSVKE IAEQLEVSQS KVEQALSVIQ
SFDPLGVGAS SLQECLLIQV RYLNIKNKLV AELIEKYLPD IAKGRLNQIA QQLGVAVTEV
QEAADIIKTL DPKPGRNFSA TNDVRYIVPD VIVERVEGEY IILVNDSSVP RLTINTAYRS
VLTQDKFDLQ TRRFVESKLN SAAWLLKSIE QRRLTLYKVA SCLVDLQKDF MEYGVKHLKP
LNLKTVAEIV GLHESTVSRA TSNKYIQTPQ GVFEMKFFFS TGLTSAGGGM TSAESIKKTL
RELIASEDAR KPLNDQKISD IFAERGIKIS RRTVAKYRDE LNIPPLKQRK RY