Gene Dtox_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1091 
Symbol 
ID8428030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1116965 
End bp1117960 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content40% 
IMG OID645033426 
Productspore coat protein, CotS family 
Protein accessionYP_003190600 
Protein GI258514378 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR02906] spore coat protein, CotS family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATG TCAAATCGTT AAGACAAGTG CTGGCCGAGT ACGGTTTGGA TAATGCGAAG 
TGCAGCCTTT TAAGTGATAA AGGTAAGAAA TCAGTCTGGA AGATTGAGAC TAGTAATGGT
TATGCTGTCT TAAAAAAGAT GCCCGGTTCA CCTTCGAGAA CGTCCTGGTT GGCCCGTGCA
GTTGATCATT TGGGGCGCTC GGGTGTTAAT CTTGCCCCGT TAATTCCATC CTTGAAGGGC
AGTCTCTCTG TTACTGCAGA TCAGTCCGGT TTTATTTTAT ACCGCTGGTT GAGCGGCAGG
CAGCCTGAAT TTAACCGTGA TCTTGATGCT ATACTGGAGT CTATGGCTTG CTTTCACCGG
GGAGGGAAAG GTTTTCAATT AAGTCCTGAA GAGCATATGC GTTCGCATCT GGGCAAATGG
CAGGATGACT ACGCAAAGAA GAGGATAATC TTAACACAGA TAAGGGATGA AAAATGCCAT
ATAATGTTTG ATAAATTTTC CCGACAGGTG TTTAAATACA TTAATCACTT TATTGACAAA
ATTGTCCGAA TGGAAAAGCA GCTTAAAGCA TCCTGTTACA AGGAATGGGT TAACCGACTG
GGCGTGAATA CATGTTTCTG CCACCAGGAT TTTTCTCCGA AAAACCTGCG TTGGCATGAG
GGTAAAGTTT ATATATTCGA TTATGATTCT CTTACTCTGG ATATACCGGC CAGGGATATT
CGTAAGTTAA TTAATAAGCT GATGAAGAAA AAGTCTCTCG ACAAGATACT GCTTAATAAT
ATTTACCAAC TCTACAACAA GTATAATCAA ATAACTGAAA GTGAGTGGCG TGTTGTCTTA
ACTGATTTGC TGTTTCCACA TCTGTTTTAT GGTATTGTTA CCAAGTATTA TTTTAAACGT
GCACAGGATT GGTCAAAAGA AAAATATATC AAAAAGTTAG AGAGTATGAT CAATGTTGAG
CTGGAAAAAG ATATTGTTCT GTCCGGCCTA ATTTGA
 
Protein sequence
MSDVKSLRQV LAEYGLDNAK CSLLSDKGKK SVWKIETSNG YAVLKKMPGS PSRTSWLARA 
VDHLGRSGVN LAPLIPSLKG SLSVTADQSG FILYRWLSGR QPEFNRDLDA ILESMACFHR
GGKGFQLSPE EHMRSHLGKW QDDYAKKRII LTQIRDEKCH IMFDKFSRQV FKYINHFIDK
IVRMEKQLKA SCYKEWVNRL GVNTCFCHQD FSPKNLRWHE GKVYIFDYDS LTLDIPARDI
RKLINKLMKK KSLDKILLNN IYQLYNKYNQ ITESEWRVVL TDLLFPHLFY GIVTKYYFKR
AQDWSKEKYI KKLESMINVE LEKDIVLSGL I