Gene Dtox_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3843 
Symbol 
ID8430857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4029515 
End bp4030642 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content36% 
IMG OID645036071 
ProductS-layer domain protein 
Protein accessionYP_003193170 
Protein GI258516948 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TGTTTATTTT AATATTGCTT ACCGTTTTTG TATCAGCCTA TGGCGGCAAA 
TCTTTCGCGC AAGATTCACA GTTTGTTGAT TTATCCTCTG AGCACTGGGC TCATAATATT
ATTAATACCA TGGTTTTACA TGGTTATATG TTAGGTTACC CAGATAATAG TTTTAAACCT
GAAAAAGCAA TGTCCCGCGA AGAATTTGCA GTCTTACTTG TACGCTGCAT GGGTATTTCA
CCTACAAGTA CCGATAACTC TTCATTTATT GACGTACCTG TTGAACACTG GTCTGCAAGA
TACATAGAAG CTGTAAAAGA ATATTTGCCC GGTTATTCTT TGGGAAACGG TACTTTTAAC
TTCCAGGGGA CAAAACCTAT TTCCAGAGAG GACGTAACAG CTGCCTTGGT AAAAGCTAAA
GGCGGTGTCA ATTCAAAGCA AGAAGATCTT GACATTCTCA AAAATAATTT TAAAGATTAC
TCGTCAATAA CTGACAGCCT TCAGCCTCTT ATAGCTGCCG CCCTGTCAAA TGAATTAATT
TCCGGTTATC CTGACGGAAC TTTTAGGCCT GAGCAATCTT TAACCAGGTC ACAGGCGGCA
GTTTTGTTAT ACAGGGCCTT TTGCAGAACC AATTCAGTTA TAAAAAATAT GATTAGTTTG
GGAGATATTA CAACTATCAA CACATCGGAT CAAATATTTA CACAGTTATC CAAACAATTA
AATAATTCCA CTTCCCGTGC CATTATTGAC GGTGAAACTT ACAATTTTGA AGTATATGTC
AAAAACGCTG CACAAAACAC AGAACAATTA AAAGACCTTA TTTATGTTTT TGTTAAAAGC
AGCGCTCCTT TTAGTTTTGA GGCAGCAAAT AAATATCATC CCGAACAGGT GCAGGAATAC
ATGAATAAAA TAAAAAACAT GGTTAATAAC TATTATCCGG CATACCCTCT GTTAATTATG
ACCGGTTTTA GAAAACAATA CGATGAATAC GAGTGGGTAC AGACAAGAAG TCTGTTTGAG
GATACTGATT ATCAGATTAC TAAAGATGAG AATAGCAAAA GTCACCTGGT AGATAAATTT
TTTTCCGGTG TTTTGTATAT TAACGGTAAA GCAATTGAAA GTTTTTAA
 
Protein sequence
MKKMFILILL TVFVSAYGGK SFAQDSQFVD LSSEHWAHNI INTMVLHGYM LGYPDNSFKP 
EKAMSREEFA VLLVRCMGIS PTSTDNSSFI DVPVEHWSAR YIEAVKEYLP GYSLGNGTFN
FQGTKPISRE DVTAALVKAK GGVNSKQEDL DILKNNFKDY SSITDSLQPL IAAALSNELI
SGYPDGTFRP EQSLTRSQAA VLLYRAFCRT NSVIKNMISL GDITTINTSD QIFTQLSKQL
NNSTSRAIID GETYNFEVYV KNAAQNTEQL KDLIYVFVKS SAPFSFEAAN KYHPEQVQEY
MNKIKNMVNN YYPAYPLLIM TGFRKQYDEY EWVQTRSLFE DTDYQITKDE NSKSHLVDKF
FSGVLYINGK AIESF