Gene Dtox_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1849 
Symbol 
ID8428828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1963169 
End bp1964569 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content46% 
IMG OID645034185 
Productphage minor structural protein 
Protein accessionYP_003191319 
Protein GI258515097 
COG category[S] Function unknown 
COG ID[COG4926] Phage-related protein 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0322579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.615795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATATT TATTTGATTC AGCAGAAAAA CTGTTAGCCA TCTACTCCCA GGAAAATGCC 
ACTTGCCCAT ATTACGATGC AGTACATACA GAAAAACTTA CCGGGGAGAA TACCTTTATT
TTTACTATTC CGGCAGACCA CCAGGATAGC CGTTATATTA CTGAGGGGAA TCTTGTGGGA
TTTAAAGACC CATATAAAGA CTGGCAGCTT TTCGAGATAA AACGTATTAC AGATATTCAT
GGCGAGGGTT TAACCCGAAC TGCATACTGC GAGCATGTAC TTTATGAACT TATAGATGAT
TTTATCGAAG ATATACGGCC TACCGATTGT ACCGCATTAA TTGCGTTAAT TAAGGCTTTA
GAGGGAACCC GCTGGGAGCC TGGAGTTGTT GATGACCTGG GGGTAAACAG TACCAACTTT
TACTATGAAT CTGCACTTTC CGCTGTTCAA AAGGTTGCCG CCATATGGAA AGGTGAATTG
AGATTTCGAG TAGTTATCTC CAACAATGCT ATTACCAAGC GGTATGTCGA TTTACTGGCC
AGGCGTGGGG CAGTAACAGG CAAGCAGTTT ACCTACGACC GCAACACCTC GCAGATTGAG
CGAGAGGTTG ATTTAACCAG CGTTGTAACT GCTCTCTATG GCAGGGGAAA AGGCGTGGAG
GTGGGTGACG GGTACGGCAG ACGTTTAGAT TTCAGCGGAA TAGGATGGGC CGTAGCCAAC
GGAAATCCGG CTGATAAACC CTTGAGCCAA CGATGGATTG GAGATGCTCA GGCTCTCGCT
CAGTGGGGCA GGGCAGGCCG TCATCGCTTT GGCGTGTTTG AGGACTCTGA GGAAACTGAC
CCGGCGGTAC TATTACAAAA AACCTGGGAT ACCCTACAGG AACGAAAAAT GCCAAGAGTA
ACCTATAGCC TGGATGTTGT TGACCTGGAG AGTTTAAGCG GATATGGTCA CGAAAAGGTA
AGATTAGGTG ATACGGTCAG GGTTATCGAT AGGAAGTTTA ACCTAGAGAT TTTGGTGGAA
GCGAGAATAC TGGAGATTAA CCGAAACCTT TTAAAACCGG AGGATACCGA GATTACCCTG
GGTAACTTTA CCCCAAGTAT AACCGATGAA GCCTTGAAGC AAATGGAAAT TAATCGAGCC
GTTAATGATA AGCAGGGCGT ATGGGATAGG GCCAGCCAGT TTAATGCGGA CGGTACATTA
AGTGCCGGTA AGCTGACGGA TACACTGGTA GGCTTAGACC ATACTTTGCA ACTGGCGAGC
GAAGCTGTGA CCGAGGCTAA AATAGCTGTA GGGGCCATTT CAACTCCTAA ACTCGCTACT
AACGCTGTTA CCGCAGATAA ACTCGCACCC GGTACTATAA ATGAGGCAAA GATGAACTGG
AAAACACATC TTTTGTATTA A
 
Protein sequence
MLYLFDSAEK LLAIYSQENA TCPYYDAVHT EKLTGENTFI FTIPADHQDS RYITEGNLVG 
FKDPYKDWQL FEIKRITDIH GEGLTRTAYC EHVLYELIDD FIEDIRPTDC TALIALIKAL
EGTRWEPGVV DDLGVNSTNF YYESALSAVQ KVAAIWKGEL RFRVVISNNA ITKRYVDLLA
RRGAVTGKQF TYDRNTSQIE REVDLTSVVT ALYGRGKGVE VGDGYGRRLD FSGIGWAVAN
GNPADKPLSQ RWIGDAQALA QWGRAGRHRF GVFEDSEETD PAVLLQKTWD TLQERKMPRV
TYSLDVVDLE SLSGYGHEKV RLGDTVRVID RKFNLEILVE ARILEINRNL LKPEDTEITL
GNFTPSITDE ALKQMEINRA VNDKQGVWDR ASQFNADGTL SAGKLTDTLV GLDHTLQLAS
EAVTEAKIAV GAISTPKLAT NAVTADKLAP GTINEAKMNW KTHLLY