Gene ECH_0080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0080 
SymbolpolA 
ID3927608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp70668 
End bp73502 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content30% 
IMG OID637901204 
ProductDNA polymerase I 
Protein accessionYP_506910 
Protein GI88658019 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.432939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTTT TTACAATTAT AGATGCTTAT GGGTTGCTTT TTAGAGCATA TTATGCTTTA 
CCAAATCTGA GAACATCCTA TGGATTACCT ATTGGTGGGG TTTATGGTTT TATCAATATT
TTTTTAAAGT ATATAGAGAA GCATGTAACA GATTATTTAG TTGTTGTGTT TGATACTGGT
AGTAAGAATT TTCGTCATAA TATATATCCA GAATATAAAG GTAATCGTCC TAAACTTCCT
GATGACTTAA TACCTCAATT TTCATTGTTA AGAGAAGCTG TAAATGCTTT TAATATAGCT
TCTGAAGAAG TTGTAGGGTA CGAAGCTGAT GATGTTATAG CAACTTTAAG TAAGAAGTAT
TGTAAGTTGC AAGGTGTTAA AGTAACGGTT GTAACGTCAG ATAAGGATTT GTTGCAACTT
TTAAAATATA ACATTTGTAT ATTTGACCCA ATAAAAAATA AATATATTGA AGAAGAAGAT
GTACAAAGTA AATTTGGTAT ATCATCAAAT CAGTTGTTGG ATTTTTTGTC TTTAACAGGT
GATGCATCAG ATAATGTCCC TGGGATTCCA GGTATTGGTG TCAAAACAGC AGCAAAATTA
CTTAATGATT TTGGATCGTT AGATAATTTG TTGCTACATG TACATGAAGT TAAAACAAAT
AAGTGTCGTG AGTCTATTAC TCAATATAGT GACCAAGCAA TATTATCCCG TCAGTTAGTA
ACTTTATGTG ATGAGGTAGA TATATGTGGG GATATAGAAA AATATAGTTT TCAGATCTCT
GGAATACAAG AGTTAGTAGA ATTCTTAAAA AAATATGAAC TTCAATCTTT GATGAATAAG
GTTGATAAAT TTTTCAAAGT AGGTAATGCC TCATCAGCTG TTAATCAAAA TACCAGTAGT
AGTGATAAAA CTATGGGTGG TGATAAAGTT ATACATCATG AACCTCAATC TTCGATGAAT
GAGGTTGATA AATTTTTCAA AGTAGGTAAT GCTTCATCAG CTGTTAATCA AAATGCCAGT
AGTAGTGATA AAACTATGGG TGGTGATAAA GTTATACATC ATGAACTTCA ATCTTCGATG
AATAAGGTTG ATAAATTTTT CGAAGTAGGT AATACCTCAT CAACTGTTAA TCAAAATGCC
AGTAGTAGTG ATAAAACTAT GGGTGGTAGT GAAGTTATAC ATTACTCCAC TGAGTCACTT
AAGGTGTTTC TTGAAAACTG TAAAGGTGAA GGAATAATGG CATTTTATAT GGAAATGGCT
GATAACGTTA TTGATAGTGT TTCTTTATCA TATAAGGATG ATATTTTACT CTATATTGAC
AAGGATCATG TGACTGATGC ACTAGAACTT ATTAAGCCAG TACTTGGTTT AAATTATGTG
TTAAAGGTAA TATATGATGT AAAGACATTA TTAAAAGTTA TTCCTGATGT GGAAATTGTA
GCATTTGACG ATATTATGAT AATGTCATAT ATTTTAAGCC CAAGTGTACA TGATCATTCA
CTACAAGAGA TAATTAACTA TAATGTTAAA CAGGATGTTG TTAATGTAAA AACAGCAATA
ACTTTATTAT TATTGCATAA GTTGTTAAAG AAAAATCTGT TTGTAAATCA ACTGTATACT
ATTTATGAGA GAGTTGAGAA GCCGCTAATT CGTGTATTAG ATAGCATGGA AAAGGTAGGC
ATGTTGATTG ATATTGATAT TTTAAAAACA TTATCATCTA CTTTTTCAGA AAAAGTTAGT
GTACTAGAAA ATGAAATATA TAGGCTTGCA GGAACAGAAT TTAATATTGC ATCTTCAAAG
CAGTTGGGAA CTGTTTTATT TGATAAGATG GGTATAAAAA AAAGTAAGAA ATTGAGTTCA
GGTAGTTATA GTACTGATGC TGAAGTTTTA AATGATCTGG TATTTAATGA AATTGAAATA
GCGGATAAAA TATTACAGTG GCGTCATTTT ACAAAATTAA AAAGTACTTA TACTGATGCT
TTAGGAAAAC AGATAAATAG TAATAGCGGT AGAATACATA CTTTCTATTC TATGATATCG
ACTGCTACTG GAAGATTAAG TTCAAGTAAT CCGAATTTGC AGAATATTCC AATTAGAAGT
GAAGAGGGAA ATGCTATTAG GAGAGCATTT GTTGCACGAA AAGGATATAA GTTAGTTTCT
GCTGATTATT CACAAATAGA ATTACGAATA ATGGCACATA TAGCTGGTGT ACAGGCATTT
AAAGATGCAT TTTTTTTAGA TCAGGATATA CATTCGATAA CTGCAGAGCA AATATTTTGC
ACTCAAAGCC TGGATAAAAA TCTAAGGAGA AAAGCAAAAT CAATAAATTT TGGTATTATT
TATGGTATGA GTGCATTTGG TTTGGCTAAA CAGCTTGGTA TTTCTAGATC AGAAGCAAAT
GCATATATTG ATAATTATTT TAAATCTTAT CCGGAAATTC AGTCTTATAT GAACAATATT
AAAATTTATG CAAAAACTTA TGGTTATACA CGGACTATTT TTGGTAGAAA GTGTTTTATA
AGAGATATTA ATAGCAGTAA TGCTGCAGCT AGAAATTTTT CTGAGAGGGC AGCTATTAAT
GCTCCTTTAC AAGGAACATC AGCTGATATA ATAAAGATGT CAATGATTCA TTTGTTTGAT
AAGATTACAC ATGGATCACT TATACTTCAA GTGCATGATG AACTATTGTT TGAGATTTTA
GAAGAGTATG TAGATTATGC GGTGGAGGTC ATAACAAAAG TCATGGAAGG TATTGTTAAA
TTATCAGTAC CATTAAAAGT TGATATTCGT ATAGGAACTA ATTGGGCTGA TTTAGTTCCA
TATAGTAAAA AATGA
 
Protein sequence
MRVFTIIDAY GLLFRAYYAL PNLRTSYGLP IGGVYGFINI FLKYIEKHVT DYLVVVFDTG 
SKNFRHNIYP EYKGNRPKLP DDLIPQFSLL REAVNAFNIA SEEVVGYEAD DVIATLSKKY
CKLQGVKVTV VTSDKDLLQL LKYNICIFDP IKNKYIEEED VQSKFGISSN QLLDFLSLTG
DASDNVPGIP GIGVKTAAKL LNDFGSLDNL LLHVHEVKTN KCRESITQYS DQAILSRQLV
TLCDEVDICG DIEKYSFQIS GIQELVEFLK KYELQSLMNK VDKFFKVGNA SSAVNQNTSS
SDKTMGGDKV IHHEPQSSMN EVDKFFKVGN ASSAVNQNAS SSDKTMGGDK VIHHELQSSM
NKVDKFFEVG NTSSTVNQNA SSSDKTMGGS EVIHYSTESL KVFLENCKGE GIMAFYMEMA
DNVIDSVSLS YKDDILLYID KDHVTDALEL IKPVLGLNYV LKVIYDVKTL LKVIPDVEIV
AFDDIMIMSY ILSPSVHDHS LQEIINYNVK QDVVNVKTAI TLLLLHKLLK KNLFVNQLYT
IYERVEKPLI RVLDSMEKVG MLIDIDILKT LSSTFSEKVS VLENEIYRLA GTEFNIASSK
QLGTVLFDKM GIKKSKKLSS GSYSTDAEVL NDLVFNEIEI ADKILQWRHF TKLKSTYTDA
LGKQINSNSG RIHTFYSMIS TATGRLSSSN PNLQNIPIRS EEGNAIRRAF VARKGYKLVS
ADYSQIELRI MAHIAGVQAF KDAFFLDQDI HSITAEQIFC TQSLDKNLRR KAKSINFGII
YGMSAFGLAK QLGISRSEAN AYIDNYFKSY PEIQSYMNNI KIYAKTYGYT RTIFGRKCFI
RDINSSNAAA RNFSERAAIN APLQGTSADI IKMSMIHLFD KITHGSLILQ VHDELLFEIL
EEYVDYAVEV ITKVMEGIVK LSVPLKVDIR IGTNWADLVP YSKK