Gene Dtox_1375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1375 
Symbol 
ID8428324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1406134 
End bp1407342 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content47% 
IMG OID645033710 
Productproposed homoserine kinase 
Protein accessionYP_003190874 
Protein GI258514652 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3635] Predicted phosphoglycerate mutase, AP superfamily 
TIGRFAM ID[TIGR00306] 2,3-bisphosphoglycerate-independent phosphoglycerate mutase, archaeal form
[TIGR02535] proposed homoserine kinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0348409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000388433 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAATATT TGATACTTTT AGGTGACGGT ATGGCCGATT ATAAAATACC TGAGTTGGGA 
GATAAAACAC CACTGCAATA TGCTAAAACG CCGCATATGG ATTACCTGGC TGCCAACGGA
GAAATAGGCC AGGTCTGGAC GGTGCCTGAG GGTTTTCCTC CGGGAAGTGA CGTGGCCAAC
CTCTCAGCTA TGGGCTATAA TCCTAAAACT TGCTACTCGG GCCGCTCACC GCTGGAAGCT
TACAGCATGG GTATTATCAT GAGTGAAACC GATGTTTCTT TTCGCACCAA TCTGGTCACT
CTGACCGAAG AGCCGGAAGA ATACGAGCAA AAAACTATTC TCGACCATAG TTCAGATGAA
ATAACTACGG AAGAAGCCAG GGAGTTAATA GGGGAAATAA GAAAACACCT GTCTACCGAA
GAATTGCAGT TTTTTGCCGG TATCAGCTAC AGGCATTTAT TGATATGGAA GAACGGGCAC
CTGGATATAG AACTGACACC GCCTCACGAT ATACCCGGGA GATGTATTGC GGATTACCTG
CCCAAAGGCC CGCACAGCCG GGTACTGTTG GAGATGATGA AAAAGAGCTA TGTTTTGTTG
AATGATCACC CGGTAAACAG GAGCAGGCGC AGGAATAACC TGCGTCCTGC CAATTCCATT
TGGTTTTGGG GTGAAGGCAA GAAGCCCTCA ATGGAAAACT TTTATAAGAA ATACGGGCTT
AAAGGCTCAG TCATTTCAGC TGTTGATTTA ATCATGGGAC TTGGCAAGTG TGCCGGCATG
GATGTGGTTA AAGTGGAAGG GGCGACGGGC GGTACACATA CAAATTTTAG AGGAAAGGCT
CTGGCTGCTC TGGAGGAACT GAAGAAAGGC AAGGATTTTG TTTATATTCA CGTAGAAGCC
CCGGATGAAG CCGGACACAG GGGGGAACTG GATACCAAGA TAAAAACCAT AGAAGAAATA
GACGGGCAAA TGCTGTCTGT TTTGCTGCCG GGGCTGGAGG AATTCGATGA TTACAAAATA
ATGCTGCTTC CGGATCATCC TACCCCGCTG GCTATCAGAA CGCACACCAA AGACCCTGTA
CCCTTTGCTA TTCTGCGCAA GGGTGTGAAG AAACAGAGCA ATGTCACCTA CTGTGAAGAG
GATGCAGCGA AAAGCGGCTT AAAATTTGCT GCCGGGCATG AGTTGATGAA TTACTTTATC
AACGGTTAA
 
Protein sequence
MKYLILLGDG MADYKIPELG DKTPLQYAKT PHMDYLAANG EIGQVWTVPE GFPPGSDVAN 
LSAMGYNPKT CYSGRSPLEA YSMGIIMSET DVSFRTNLVT LTEEPEEYEQ KTILDHSSDE
ITTEEARELI GEIRKHLSTE ELQFFAGISY RHLLIWKNGH LDIELTPPHD IPGRCIADYL
PKGPHSRVLL EMMKKSYVLL NDHPVNRSRR RNNLRPANSI WFWGEGKKPS MENFYKKYGL
KGSVISAVDL IMGLGKCAGM DVVKVEGATG GTHTNFRGKA LAALEELKKG KDFVYIHVEA
PDEAGHRGEL DTKIKTIEEI DGQMLSVLLP GLEEFDDYKI MLLPDHPTPL AIRTHTKDPV
PFAILRKGVK KQSNVTYCEE DAAKSGLKFA AGHELMNYFI NG