Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1375 |
Symbol | |
ID | 8428324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1406134 |
End bp | 1407342 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645033710 |
Product | proposed homoserine kinase |
Protein accession | YP_003190874 |
Protein GI | 258514652 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3635] Predicted phosphoglycerate mutase, AP superfamily |
TIGRFAM ID | [TIGR00306] 2,3-bisphosphoglycerate-independent phosphoglycerate mutase, archaeal form [TIGR02535] proposed homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0348409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000000388433 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAATATT TGATACTTTT AGGTGACGGT ATGGCCGATT ATAAAATACC TGAGTTGGGA GATAAAACAC CACTGCAATA TGCTAAAACG CCGCATATGG ATTACCTGGC TGCCAACGGA GAAATAGGCC AGGTCTGGAC GGTGCCTGAG GGTTTTCCTC CGGGAAGTGA CGTGGCCAAC CTCTCAGCTA TGGGCTATAA TCCTAAAACT TGCTACTCGG GCCGCTCACC GCTGGAAGCT TACAGCATGG GTATTATCAT GAGTGAAACC GATGTTTCTT TTCGCACCAA TCTGGTCACT CTGACCGAAG AGCCGGAAGA ATACGAGCAA AAAACTATTC TCGACCATAG TTCAGATGAA ATAACTACGG AAGAAGCCAG GGAGTTAATA GGGGAAATAA GAAAACACCT GTCTACCGAA GAATTGCAGT TTTTTGCCGG TATCAGCTAC AGGCATTTAT TGATATGGAA GAACGGGCAC CTGGATATAG AACTGACACC GCCTCACGAT ATACCCGGGA GATGTATTGC GGATTACCTG CCCAAAGGCC CGCACAGCCG GGTACTGTTG GAGATGATGA AAAAGAGCTA TGTTTTGTTG AATGATCACC CGGTAAACAG GAGCAGGCGC AGGAATAACC TGCGTCCTGC CAATTCCATT TGGTTTTGGG GTGAAGGCAA GAAGCCCTCA ATGGAAAACT TTTATAAGAA ATACGGGCTT AAAGGCTCAG TCATTTCAGC TGTTGATTTA ATCATGGGAC TTGGCAAGTG TGCCGGCATG GATGTGGTTA AAGTGGAAGG GGCGACGGGC GGTACACATA CAAATTTTAG AGGAAAGGCT CTGGCTGCTC TGGAGGAACT GAAGAAAGGC AAGGATTTTG TTTATATTCA CGTAGAAGCC CCGGATGAAG CCGGACACAG GGGGGAACTG GATACCAAGA TAAAAACCAT AGAAGAAATA GACGGGCAAA TGCTGTCTGT TTTGCTGCCG GGGCTGGAGG AATTCGATGA TTACAAAATA ATGCTGCTTC CGGATCATCC TACCCCGCTG GCTATCAGAA CGCACACCAA AGACCCTGTA CCCTTTGCTA TTCTGCGCAA GGGTGTGAAG AAACAGAGCA ATGTCACCTA CTGTGAAGAG GATGCAGCGA AAAGCGGCTT AAAATTTGCT GCCGGGCATG AGTTGATGAA TTACTTTATC AACGGTTAA
|
Protein sequence | MKYLILLGDG MADYKIPELG DKTPLQYAKT PHMDYLAANG EIGQVWTVPE GFPPGSDVAN LSAMGYNPKT CYSGRSPLEA YSMGIIMSET DVSFRTNLVT LTEEPEEYEQ KTILDHSSDE ITTEEARELI GEIRKHLSTE ELQFFAGISY RHLLIWKNGH LDIELTPPHD IPGRCIADYL PKGPHSRVLL EMMKKSYVLL NDHPVNRSRR RNNLRPANSI WFWGEGKKPS MENFYKKYGL KGSVISAVDL IMGLGKCAGM DVVKVEGATG GTHTNFRGKA LAALEELKKG KDFVYIHVEA PDEAGHRGEL DTKIKTIEEI DGQMLSVLLP GLEEFDDYKI MLLPDHPTPL AIRTHTKDPV PFAILRKGVK KQSNVTYCEE DAAKSGLKFA AGHELMNYFI NG
|
| |