Gene Dtox_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3970 
Symbol 
ID8430985 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4154410 
End bp4155511 
Gene Length1102 bp 
Protein Length366 aa 
Translation table11 
GC content51% 
IMG OID645036188 
Productpeptide chain release factor 2 
Protein accessionYP_003193286 
Protein GI258517064 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1186] Protein chain release factor B 
TIGRFAM ID[TIGR00020] peptide chain release factor 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000632573 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000161468 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTACACAG AGATTAAAAA GACGCTGGAG GAATTATCCG GTCGTGTGGC AGATTTGAGG 
GTTTCTCTTT GACATTGCCG GTAAAGAGTC GGAGATAGAA AAATTAGATA AGCTGATGAT
GGCGCCCGGT TTTTGGGATG ATCAGGCCCA GGCTCAAAAA ATTTCCCGGC AGCAGGCAGT
GCTCAAGGAT TCGGTAAGCC TTTATGGTGA GTTGAGGCAG TCACTGGAGG ATCTGGAGGT
GCTGGCTGAG CTGGCTCAGG AAGAGGACGA CGACATTGCC TTGCGGGAAA CCGCTAAGGA
TCTCAAACAA CTTGAGCTCC GGGTTTCTGA GTTGGAGCTT GATGTATTGC TCAGCGGCGA
GTATGACAGG GGAAACGCCA TTATTTCATT GCATGCCGGG GCCGGCGGCA CTGAGGCTCA
GGATTGGGTG GAAATGCTGC TGCGCATGTT TACCCGCTGG GCGGAAGATC ACCGGTACCG
GGTCATTGTT TTGGATATGC TGTCCGGTGA TGAGGCCGGC ATAAAGAGCG TTACGGTTGA
GATTGTCGGT CCCAATGCTT TCGGTTATTT AAAGTCGGAA AAAGGGGTGC ACCGGCTGGT
GCGCATTTCA CCGTTTGACA CGGCGGGCCG CAGGCACACC TCTTTTGCTT CGGTGGAGGT
TTTGCCGGAG GTGGATGAGG ATGTGGATAT TGAAATAAAA ACTGAGGATT TAAAAATAGA
TACTTTTCGG GCCGGTGGGG CCGGTGGGCA GCATGTGAAC AAAACTGACT CGGCTGTGCG
CATTACTCAT TTGCCCACCG GGATCGTGGT TTCCTGTCAG AATGAGAGAT CGCAGTTATC
CAACCGCAAT TCGGCTATGA AACTATTGAA GGCAAAGCTG GCGGAATTGG AACTGCAGAA
AAAAGAGGCT GAGATGGCTT CTTTGCGGGG CGACCACCAG GAAATTGCCT GGGGCAGCCA
GATTCGTTCT TATGTGTTTC ATCCTTATAG TTTGGTTAAG GATCACCGTA CCGGGGCGGA
AACGGGCAAT ATTGAGGCTG TGATGAATGG GGAACTGGAT AAATTTATGG CGGCTTTTTT
GCTTTGGCAG GTTCGGAAAT AG
 
Protein sequence
MYTEIKKTLE ELSGRVADLR VSLDIAGKES EIEKLDKLMM APGFWDDQAQ AQKISRQQAV 
LKDSVSLYGE LRQSLEDLEV LAELAQEEDD DIALRETAKD LKQLELRVSE LELDVLLSGE
YDRGNAIISL HAGAGGTEAQ DWVEMLLRMF TRWAEDHRYR VIVLDMLSGD EAGIKSVTVE
IVGPNAFGYL KSEKGVHRLV RISPFDTAGR RHTSFASVEV LPEVDEDVDI EIKTEDLKID
TFRAGGAGGQ HVNKTDSAVR ITHLPTGIVV SCQNERSQLS NRNSAMKLLK AKLAELELQK
KEAEMASLRG DHQEIAWGSQ IRSYVFHPYS LVKDHRTGAE TGNIEAVMNG ELDKFMAAFL
LWQVRK