Gene ECH_0798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0798 
SymbolthiC 
ID3927286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp813891 
End bp815555 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content34% 
IMG OID637901916 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_507596 
Protein GI88658541 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAG ACTTTAATAC CCTCTTTCCT TCTTCAACTA AAGAATACAT ATCAGGCACA 
ATATACAACA ATATAAAAGT TGGAATGCGT AGAATTCATA TTAATGATAA CAGTGAATCC
ATTCTAACAT ATGATACTGG AGGACCACAT ACTGATCAAA AAATCAAAAT TGACATTAAT
CAGGGTATTG AAAAGATAAG GCTTAATTGG ATTGTAGATA GACAAGACGT AGAGTATCAT
AAAAGACAAG AAGTAAACAC AAATTCTGAA TACGCATTTC CACTACAAAG TAATAATATC
TTAAAAGCAA ATAGTAATAA ACCTATAACC CAGATGTATT ATGCTCGAAA CAATATTATT
ACTCCAGAAA TGGAATATGT AGCAATACGA GAAAACGCAC TAAGACAAAA GATTCTTTCT
TATAAACCAA CTGTCATGGC ACCAGAAATT ACTCCAGAAT TCGTACGACA AGAAATAGCA
TCTGGGAGAG CAATCATTCC AGCAAATGTA AATCATCCTG AATCAGAACC AATGATAATA
GGCAAGAATT TCTTAGTAAA AATCAATGCT AATATTGGTA ATTCAGTAGT TAGTTCAAGC
ATTGAAGACG AACTTCAAAA AATGATATAC GCAATTATAT ACGGTGCTGA TACAGTTATG
GATTTATCTA CAGGGAACAA TATACATAAT ATTAGAGAAT GGATTATCAG AAACAGCCCA
GTACCAATAG GTACAGTACC TATATATCAA GCATTAAACA AAGTAAATGG AGTAGTAGGA
GACCTAGATT TCAATATCTT CAAAAAAACA TTAATAGAAC AAGCAGAACA AGGTGTTGAC
TATTTTACCA TACATGCAGG AGTATTAAAA AATTACATTG ACTACACAGA TAACAGGCTA
ACTGGCATCG TATCAAGAGG TGGAGCTATT ATGGCACACT GGTGTACTAT ACATAATAAA
GAGAACTTTC TTTATACAAA TTTTGAAGAA ATATGCGATA TTATGAAACA CTATGACATT
ACTTTCTCTC TAGGAGATGG ATTAAGGCCA GGATCCATAG CAGATGCAAA CGATACAGCA
CAATTTTTAG AACTCAAAAC ACTAGGAGAA CTTACAGACA TTGCATGGAA GCACGATTGC
CAAGTTATGA TAGAAGGACC AGGCCATGTT CCTATGCATT TAATAAAAGA AAATGTAGAA
AAACAGGTTC ACTTCTGTAA GGAAGCTCCA TTCTATACTT TAGGACCATT AACTACAGAC
ATTGCTCCAG GTTACGACCA CATAACGAGT GCAATCGGCG CAGCAATTAT AGGATGGTAT
GGCACTTCTA TGTTGTGTTA TGTTACACCG AAAGAACACT TAGGATTACC TAACATTCAA
GACGTAAAAG ATGGAGTCAT TGCATATAAA ATAGCAGCAC ATGCAGCAGA CTTAGCAAAA
GGTAATCCAT CTGCTTATAT ACGTGATTAT GCATTAAGTT ATGCAAGGTT CAATTTCAAA
TGGTATGATC AATTTAACTT ATCTCTTGAT CCAGAAACGG CAAAATCACT ACACGATGAA
TCTCTTCCTT CTGAGAATGC AAAGTCTGCT CACTTCTGTT CAATGTGTGG GCCAAAATTC
TGCTCAATGA AATTAACTCA TCAAATAAAA TCTATTGAAG AGTAA
 
Protein sequence
MKIDFNTLFP SSTKEYISGT IYNNIKVGMR RIHINDNSES ILTYDTGGPH TDQKIKIDIN 
QGIEKIRLNW IVDRQDVEYH KRQEVNTNSE YAFPLQSNNI LKANSNKPIT QMYYARNNII
TPEMEYVAIR ENALRQKILS YKPTVMAPEI TPEFVRQEIA SGRAIIPANV NHPESEPMII
GKNFLVKINA NIGNSVVSSS IEDELQKMIY AIIYGADTVM DLSTGNNIHN IREWIIRNSP
VPIGTVPIYQ ALNKVNGVVG DLDFNIFKKT LIEQAEQGVD YFTIHAGVLK NYIDYTDNRL
TGIVSRGGAI MAHWCTIHNK ENFLYTNFEE ICDIMKHYDI TFSLGDGLRP GSIADANDTA
QFLELKTLGE LTDIAWKHDC QVMIEGPGHV PMHLIKENVE KQVHFCKEAP FYTLGPLTTD
IAPGYDHITS AIGAAIIGWY GTSMLCYVTP KEHLGLPNIQ DVKDGVIAYK IAAHAADLAK
GNPSAYIRDY ALSYARFNFK WYDQFNLSLD PETAKSLHDE SLPSENAKSA HFCSMCGPKF
CSMKLTHQIK SIEE