Gene Dtox_3607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3607 
Symbol 
ID8430613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3803254 
End bp3804444 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content49% 
IMG OID645035835 
Producthypothetical protein 
Protein accessionYP_003192942 
Protein GI258516720 
COG category[S] Function unknown 
COG ID[COG3825] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAATG ATTTTTTTTA TACCCTCAGG CAGGAGGGGG TGCCGGTCAC TCCTACCGAA 
TGGATGACAC TGCACGAAGG TTTAAAAATG GGTTTAGCCT TTTCCGGGCT GACTGGTTTT
TATTATCTGG GGCGGGCCTG CCTGGTAAAA AGCGAGGCCC ATTATGACCG CTATGATTTG
GCCTTTCAAC GCTGTTTTGG TCAAATTAAT ACTCCGGAAG ATTTTTTGGA AAAGGTCTTG
GCCTGGTTAG AGAGTGAATT GCCGCCTTTG GAAACGGAGG AGAGTTCTCC TTTCAAAGCC
TGGAACCTGG AAGAATTGCG CCTGCTGCTG GAAGACCGGC TGAACCGCCA GGAGGAGAAA
CACGAGGGTG GCTCGCACTG GATTGGTCCC GGGGGACACT CTCGTCTGGG TCACTCCGGC
ATTAATCCTG TCGGGCTGAG AATTGACGGG CAGTCTGTAA ACAACAGCGC GGTAAAGGTT
GCCGGCCAAA GAAAGTACAA GGAACTGCGC ACAGATGAGA CCTTGGAGAC CAGGCATTTT
GAGGTGGCGC TGCGCAAGCT GAGGCAGCTT ACTACCAGAG AAGACGGTCC ACTGGACGAA
CTGGATTTGG ATGGGACTAT AGATGCTACC TGCCAAAACG GTGGTTTCCT GAAACTGGAT
TGGCGCAGGC CGCGCAGGAA TGAACTGAAG GTGGCGCTTT TTATGGATTC AGGCGGATCT
ATGACTCCCT ATGTGCATAT TGTCAAACGG CTTTTTACCG CCGTAAATAA ATCCAGCCAT
TTTAAGGATT TGCAGTTCTA TTATTTTCAC AATTGTATTT ACGAAAGAAT TTATGCTAAC
TCTATGTGTG TGCCCCGTGA TTCTGTGTCT ACCCGCGAGA TACTGAAAAA GCTTGCTTCC
GACTATCGTA TAATTATAGT TGGCGACGCC AGCATGTCTC CGGGTGAACT GATTATGACG
GGTGGGGCCA TTGATTGGGG AGTCAGTAAA AATGAGCCAG GCCTGGCCTG GTTGAAAAGG
TTTTCTAACC GTTTTAGGTA TGCAGCCTGG TTGAATCCGA AACCGGAAAA AAATTGGCAC
AGCACTGACG GGGCGGAGAC AATAGCCCTT ATACGCCGTT ATTTTTCTAT GTTTGAGTTA
ACAGTGGAGG GTTTGGAAAG AGCTGTTAAG CGGCTCAAAG TCAGCCGTTA A
 
Protein sequence
MFNDFFYTLR QEGVPVTPTE WMTLHEGLKM GLAFSGLTGF YYLGRACLVK SEAHYDRYDL 
AFQRCFGQIN TPEDFLEKVL AWLESELPPL ETEESSPFKA WNLEELRLLL EDRLNRQEEK
HEGGSHWIGP GGHSRLGHSG INPVGLRIDG QSVNNSAVKV AGQRKYKELR TDETLETRHF
EVALRKLRQL TTREDGPLDE LDLDGTIDAT CQNGGFLKLD WRRPRRNELK VALFMDSGGS
MTPYVHIVKR LFTAVNKSSH FKDLQFYYFH NCIYERIYAN SMCVPRDSVS TREILKKLAS
DYRIIIVGDA SMSPGELIMT GGAIDWGVSK NEPGLAWLKR FSNRFRYAAW LNPKPEKNWH
STDGAETIAL IRRYFSMFEL TVEGLERAVK RLKVSR