Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0798 |
Symbol | thiC |
ID | 3927286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 813891 |
End bp | 815555 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637901916 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_507596 |
Protein GI | 88658541 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAG ACTTTAATAC CCTCTTTCCT TCTTCAACTA AAGAATACAT ATCAGGCACA ATATACAACA ATATAAAAGT TGGAATGCGT AGAATTCATA TTAATGATAA CAGTGAATCC ATTCTAACAT ATGATACTGG AGGACCACAT ACTGATCAAA AAATCAAAAT TGACATTAAT CAGGGTATTG AAAAGATAAG GCTTAATTGG ATTGTAGATA GACAAGACGT AGAGTATCAT AAAAGACAAG AAGTAAACAC AAATTCTGAA TACGCATTTC CACTACAAAG TAATAATATC TTAAAAGCAA ATAGTAATAA ACCTATAACC CAGATGTATT ATGCTCGAAA CAATATTATT ACTCCAGAAA TGGAATATGT AGCAATACGA GAAAACGCAC TAAGACAAAA GATTCTTTCT TATAAACCAA CTGTCATGGC ACCAGAAATT ACTCCAGAAT TCGTACGACA AGAAATAGCA TCTGGGAGAG CAATCATTCC AGCAAATGTA AATCATCCTG AATCAGAACC AATGATAATA GGCAAGAATT TCTTAGTAAA AATCAATGCT AATATTGGTA ATTCAGTAGT TAGTTCAAGC ATTGAAGACG AACTTCAAAA AATGATATAC GCAATTATAT ACGGTGCTGA TACAGTTATG GATTTATCTA CAGGGAACAA TATACATAAT ATTAGAGAAT GGATTATCAG AAACAGCCCA GTACCAATAG GTACAGTACC TATATATCAA GCATTAAACA AAGTAAATGG AGTAGTAGGA GACCTAGATT TCAATATCTT CAAAAAAACA TTAATAGAAC AAGCAGAACA AGGTGTTGAC TATTTTACCA TACATGCAGG AGTATTAAAA AATTACATTG ACTACACAGA TAACAGGCTA ACTGGCATCG TATCAAGAGG TGGAGCTATT ATGGCACACT GGTGTACTAT ACATAATAAA GAGAACTTTC TTTATACAAA TTTTGAAGAA ATATGCGATA TTATGAAACA CTATGACATT ACTTTCTCTC TAGGAGATGG ATTAAGGCCA GGATCCATAG CAGATGCAAA CGATACAGCA CAATTTTTAG AACTCAAAAC ACTAGGAGAA CTTACAGACA TTGCATGGAA GCACGATTGC CAAGTTATGA TAGAAGGACC AGGCCATGTT CCTATGCATT TAATAAAAGA AAATGTAGAA AAACAGGTTC ACTTCTGTAA GGAAGCTCCA TTCTATACTT TAGGACCATT AACTACAGAC ATTGCTCCAG GTTACGACCA CATAACGAGT GCAATCGGCG CAGCAATTAT AGGATGGTAT GGCACTTCTA TGTTGTGTTA TGTTACACCG AAAGAACACT TAGGATTACC TAACATTCAA GACGTAAAAG ATGGAGTCAT TGCATATAAA ATAGCAGCAC ATGCAGCAGA CTTAGCAAAA GGTAATCCAT CTGCTTATAT ACGTGATTAT GCATTAAGTT ATGCAAGGTT CAATTTCAAA TGGTATGATC AATTTAACTT ATCTCTTGAT CCAGAAACGG CAAAATCACT ACACGATGAA TCTCTTCCTT CTGAGAATGC AAAGTCTGCT CACTTCTGTT CAATGTGTGG GCCAAAATTC TGCTCAATGA AATTAACTCA TCAAATAAAA TCTATTGAAG AGTAA
|
Protein sequence | MKIDFNTLFP SSTKEYISGT IYNNIKVGMR RIHINDNSES ILTYDTGGPH TDQKIKIDIN QGIEKIRLNW IVDRQDVEYH KRQEVNTNSE YAFPLQSNNI LKANSNKPIT QMYYARNNII TPEMEYVAIR ENALRQKILS YKPTVMAPEI TPEFVRQEIA SGRAIIPANV NHPESEPMII GKNFLVKINA NIGNSVVSSS IEDELQKMIY AIIYGADTVM DLSTGNNIHN IREWIIRNSP VPIGTVPIYQ ALNKVNGVVG DLDFNIFKKT LIEQAEQGVD YFTIHAGVLK NYIDYTDNRL TGIVSRGGAI MAHWCTIHNK ENFLYTNFEE ICDIMKHYDI TFSLGDGLRP GSIADANDTA QFLELKTLGE LTDIAWKHDC QVMIEGPGHV PMHLIKENVE KQVHFCKEAP FYTLGPLTTD IAPGYDHITS AIGAAIIGWY GTSMLCYVTP KEHLGLPNIQ DVKDGVIAYK IAAHAADLAK GNPSAYIRDY ALSYARFNFK WYDQFNLSLD PETAKSLHDE SLPSENAKSA HFCSMCGPKF CSMKLTHQIK SIEE
|
| |