Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0528 |
Symbol | thiL |
ID | 3927823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 530003 |
End bp | 530950 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 637901650 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_507342 |
Protein GI | 88658107 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAAT TCGAATATAT TAAAAATTAC ATATATAAAT TAGATGATGA TTCATTAATA GGAGATGATG CTGCTACTAT AAATTGTACA CAAAATAAAT TATTAGTAAC AAAAGATATA CTTATAGAAG GTGTACATTT TTTAAAACAG TGCAATCCAA ATATATTAGC TAAAAAAGCT CTTAGATCAA ACTTATCAGA TATTGCAGCA ATGGGAGCTA TACCTTACGG CTACTGCTTA GGATTAGTAC TACCTAATAA CATATCCCAA GATTGGTGGA AAAATTTTAC TGATAGCTTA AGAGAAGAAC ATGAAAAATT CTGTATAAAA CTATTAGGAG GTGATACAAC ATCTCATAAA CAAGATGAAA TTATAGTAAG CATTACTGCA TTTGGAACAA GTAATGGTAA CATACTAAAA AGATCTGGAG CAAAAATTGG AGACTTTATC TATGTTAGCG GCAATATTGG AGATGCTGCA CTTGGATTAC TTGTGTATCA AAAAATAATC AACAAAAACT ATTACAAACT AAAAAATAAA TATGATATAC CTCAACCAAG AATTAATTTA GGAATCAGCA TTAATAAAAT CGCGTCTTCA TGTATAGACA TTTCTGATGG ATTAATACAA GATATTGAAC ATATTTGCAA CTCATCTCAA GTAGGCGCAT CAATATATTT AGATAAAATA CCTTTATCAA ATGAAGCCAA AGAAATAATA AACAATACAC CTCAATATAT AAACTATATT TTATCTGGAG GAGACGATTA TGAATTAGTG TTTACGATAA ATCCCAAATT CTCTCACCTA ATACAAGACA TATCTTGTAA AAATAAAGTA AAAATAAGTA AGATAGGAGA GATAACACTA GGCAATTGCG TCACACTATA TGATAATAAC AGAAATATTA TTACACCAAC AAATAAAGGA TTTAATCATT TTATGTAA
|
Protein sequence | MQEFEYIKNY IYKLDDDSLI GDDAATINCT QNKLLVTKDI LIEGVHFLKQ CNPNILAKKA LRSNLSDIAA MGAIPYGYCL GLVLPNNISQ DWWKNFTDSL REEHEKFCIK LLGGDTTSHK QDEIIVSITA FGTSNGNILK RSGAKIGDFI YVSGNIGDAA LGLLVYQKII NKNYYKLKNK YDIPQPRINL GISINKIASS CIDISDGLIQ DIEHICNSSQ VGASIYLDKI PLSNEAKEII NNTPQYINYI LSGGDDYELV FTINPKFSHL IQDISCKNKV KISKIGEITL GNCVTLYDNN RNIITPTNKG FNHFM
|
| |