Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0080 |
Symbol | polA |
ID | 3927608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 70668 |
End bp | 73502 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 637901204 |
Product | DNA polymerase I |
Protein accession | YP_506910 |
Protein GI | 88658019 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.432939 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTTT TTACAATTAT AGATGCTTAT GGGTTGCTTT TTAGAGCATA TTATGCTTTA CCAAATCTGA GAACATCCTA TGGATTACCT ATTGGTGGGG TTTATGGTTT TATCAATATT TTTTTAAAGT ATATAGAGAA GCATGTAACA GATTATTTAG TTGTTGTGTT TGATACTGGT AGTAAGAATT TTCGTCATAA TATATATCCA GAATATAAAG GTAATCGTCC TAAACTTCCT GATGACTTAA TACCTCAATT TTCATTGTTA AGAGAAGCTG TAAATGCTTT TAATATAGCT TCTGAAGAAG TTGTAGGGTA CGAAGCTGAT GATGTTATAG CAACTTTAAG TAAGAAGTAT TGTAAGTTGC AAGGTGTTAA AGTAACGGTT GTAACGTCAG ATAAGGATTT GTTGCAACTT TTAAAATATA ACATTTGTAT ATTTGACCCA ATAAAAAATA AATATATTGA AGAAGAAGAT GTACAAAGTA AATTTGGTAT ATCATCAAAT CAGTTGTTGG ATTTTTTGTC TTTAACAGGT GATGCATCAG ATAATGTCCC TGGGATTCCA GGTATTGGTG TCAAAACAGC AGCAAAATTA CTTAATGATT TTGGATCGTT AGATAATTTG TTGCTACATG TACATGAAGT TAAAACAAAT AAGTGTCGTG AGTCTATTAC TCAATATAGT GACCAAGCAA TATTATCCCG TCAGTTAGTA ACTTTATGTG ATGAGGTAGA TATATGTGGG GATATAGAAA AATATAGTTT TCAGATCTCT GGAATACAAG AGTTAGTAGA ATTCTTAAAA AAATATGAAC TTCAATCTTT GATGAATAAG GTTGATAAAT TTTTCAAAGT AGGTAATGCC TCATCAGCTG TTAATCAAAA TACCAGTAGT AGTGATAAAA CTATGGGTGG TGATAAAGTT ATACATCATG AACCTCAATC TTCGATGAAT GAGGTTGATA AATTTTTCAA AGTAGGTAAT GCTTCATCAG CTGTTAATCA AAATGCCAGT AGTAGTGATA AAACTATGGG TGGTGATAAA GTTATACATC ATGAACTTCA ATCTTCGATG AATAAGGTTG ATAAATTTTT CGAAGTAGGT AATACCTCAT CAACTGTTAA TCAAAATGCC AGTAGTAGTG ATAAAACTAT GGGTGGTAGT GAAGTTATAC ATTACTCCAC TGAGTCACTT AAGGTGTTTC TTGAAAACTG TAAAGGTGAA GGAATAATGG CATTTTATAT GGAAATGGCT GATAACGTTA TTGATAGTGT TTCTTTATCA TATAAGGATG ATATTTTACT CTATATTGAC AAGGATCATG TGACTGATGC ACTAGAACTT ATTAAGCCAG TACTTGGTTT AAATTATGTG TTAAAGGTAA TATATGATGT AAAGACATTA TTAAAAGTTA TTCCTGATGT GGAAATTGTA GCATTTGACG ATATTATGAT AATGTCATAT ATTTTAAGCC CAAGTGTACA TGATCATTCA CTACAAGAGA TAATTAACTA TAATGTTAAA CAGGATGTTG TTAATGTAAA AACAGCAATA ACTTTATTAT TATTGCATAA GTTGTTAAAG AAAAATCTGT TTGTAAATCA ACTGTATACT ATTTATGAGA GAGTTGAGAA GCCGCTAATT CGTGTATTAG ATAGCATGGA AAAGGTAGGC ATGTTGATTG ATATTGATAT TTTAAAAACA TTATCATCTA CTTTTTCAGA AAAAGTTAGT GTACTAGAAA ATGAAATATA TAGGCTTGCA GGAACAGAAT TTAATATTGC ATCTTCAAAG CAGTTGGGAA CTGTTTTATT TGATAAGATG GGTATAAAAA AAAGTAAGAA ATTGAGTTCA GGTAGTTATA GTACTGATGC TGAAGTTTTA AATGATCTGG TATTTAATGA AATTGAAATA GCGGATAAAA TATTACAGTG GCGTCATTTT ACAAAATTAA AAAGTACTTA TACTGATGCT TTAGGAAAAC AGATAAATAG TAATAGCGGT AGAATACATA CTTTCTATTC TATGATATCG ACTGCTACTG GAAGATTAAG TTCAAGTAAT CCGAATTTGC AGAATATTCC AATTAGAAGT GAAGAGGGAA ATGCTATTAG GAGAGCATTT GTTGCACGAA AAGGATATAA GTTAGTTTCT GCTGATTATT CACAAATAGA ATTACGAATA ATGGCACATA TAGCTGGTGT ACAGGCATTT AAAGATGCAT TTTTTTTAGA TCAGGATATA CATTCGATAA CTGCAGAGCA AATATTTTGC ACTCAAAGCC TGGATAAAAA TCTAAGGAGA AAAGCAAAAT CAATAAATTT TGGTATTATT TATGGTATGA GTGCATTTGG TTTGGCTAAA CAGCTTGGTA TTTCTAGATC AGAAGCAAAT GCATATATTG ATAATTATTT TAAATCTTAT CCGGAAATTC AGTCTTATAT GAACAATATT AAAATTTATG CAAAAACTTA TGGTTATACA CGGACTATTT TTGGTAGAAA GTGTTTTATA AGAGATATTA ATAGCAGTAA TGCTGCAGCT AGAAATTTTT CTGAGAGGGC AGCTATTAAT GCTCCTTTAC AAGGAACATC AGCTGATATA ATAAAGATGT CAATGATTCA TTTGTTTGAT AAGATTACAC ATGGATCACT TATACTTCAA GTGCATGATG AACTATTGTT TGAGATTTTA GAAGAGTATG TAGATTATGC GGTGGAGGTC ATAACAAAAG TCATGGAAGG TATTGTTAAA TTATCAGTAC CATTAAAAGT TGATATTCGT ATAGGAACTA ATTGGGCTGA TTTAGTTCCA TATAGTAAAA AATGA
|
Protein sequence | MRVFTIIDAY GLLFRAYYAL PNLRTSYGLP IGGVYGFINI FLKYIEKHVT DYLVVVFDTG SKNFRHNIYP EYKGNRPKLP DDLIPQFSLL REAVNAFNIA SEEVVGYEAD DVIATLSKKY CKLQGVKVTV VTSDKDLLQL LKYNICIFDP IKNKYIEEED VQSKFGISSN QLLDFLSLTG DASDNVPGIP GIGVKTAAKL LNDFGSLDNL LLHVHEVKTN KCRESITQYS DQAILSRQLV TLCDEVDICG DIEKYSFQIS GIQELVEFLK KYELQSLMNK VDKFFKVGNA SSAVNQNTSS SDKTMGGDKV IHHEPQSSMN EVDKFFKVGN ASSAVNQNAS SSDKTMGGDK VIHHELQSSM NKVDKFFEVG NTSSTVNQNA SSSDKTMGGS EVIHYSTESL KVFLENCKGE GIMAFYMEMA DNVIDSVSLS YKDDILLYID KDHVTDALEL IKPVLGLNYV LKVIYDVKTL LKVIPDVEIV AFDDIMIMSY ILSPSVHDHS LQEIINYNVK QDVVNVKTAI TLLLLHKLLK KNLFVNQLYT IYERVEKPLI RVLDSMEKVG MLIDIDILKT LSSTFSEKVS VLENEIYRLA GTEFNIASSK QLGTVLFDKM GIKKSKKLSS GSYSTDAEVL NDLVFNEIEI ADKILQWRHF TKLKSTYTDA LGKQINSNSG RIHTFYSMIS TATGRLSSSN PNLQNIPIRS EEGNAIRRAF VARKGYKLVS ADYSQIELRI MAHIAGVQAF KDAFFLDQDI HSITAEQIFC TQSLDKNLRR KAKSINFGII YGMSAFGLAK QLGISRSEAN AYIDNYFKSY PEIQSYMNNI KIYAKTYGYT RTIFGRKCFI RDINSSNAAA RNFSERAAIN APLQGTSADI IKMSMIHLFD KITHGSLILQ VHDELLFEIL EEYVDYAVEV ITKVMEGIVK LSVPLKVDIR IGTNWADLVP YSKK
|
| |