Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS4482 |
Symbol | |
ID | 2851565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | - |
Start bp | 4392993 |
End bp | 4395668 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637507719 |
Product | DNA polymerase I |
Protein accession | YP_030729 |
Protein GI | 49187477 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAA TCATTTCAAA TTTGTATGGG GAGGTTTCTG ATTTGGAAAA AAAGGTCGTA TTAGTAGATG GTAATAATAT CGCGTATCGT GCTTTCTTTG CACTACCGCT TTTAAATAAC GACAAAGGTA TACATACGAA CGCAATTTAC GGTTTTACAA TGATGTTAAT GAGAATATTA GAGGAAGAAA AACCGACGCA TATGTTAGTA GCATTTGATG CGGGGAAAAC GACATTCCGT CATAAAACAT ATAGTGAGTA TAAAGGGGGA CGTCAAAAGA CTCCGCCTGA GTTATCAGAG CAATTCCCGT TTATTCGTGA GATGCTTGAT GCATTCAATG TACCACGTTA TGAATTAGAA AATTATGAAG CGGATGACAT TATGGGAACG CTAGCGAAAG AAGCGAGTGA ACAAGGCGCA AGTGTAAAAG TTATTTCAGG AGATAAAGAC TTACTTCAAC TTGTTTCTGA TAACACGCTT GTATGTATTC CTCGAAAAGG GATTACGGAA GTAGACGAAT ATACGAAAGA AGCTTTATTT GAAAAATACA GCTTATCACC AAAGCAAATT ATCGATATGA AAGGTTTAAT GGGAGACCAA TCAGATAATA TTCCAGGTGT ACCAGGAGTT GGTGAAAAAA CTGCGATTAA ATTGTTAACA CAGTTTGGAA CGGTTGAAGA AGTGTATGAA AATATAGATC AAGTAAGCGG GAAGAAATTA AAAGAAAAGC TGGAAGCAAA TAAAGACCAA GCTCTTATGA GTAAAGATCT TGCAACCATT ATTACAGACG CACCGATTAC TGTGAATGTG GATGATATGG AGTATAAAGG ATATGAAGCA AGTGATGTCA TTCCAATGTT CGAGAATTTA GGATTTACAT CTCTTTTAAA CAAATTAGGT GTTACGCCAG AAGAAACAGC TCCGGCTGAA TTAGATGATA TTACATTTGA TATTGTAGAA GAAGTTACAG AAGAAATGCT TCAGCAAGAT AGTGCGCTTA TCGTTGAAGT ACAAGAAGAT AACTATCATA AAGCAGACAT TCAAGGTTTC GGTATTCAAA ATGAAAATGG ATGTTACTTT ATTCAGACAG ACATTGCACT TAAATCAGAT GCTTTTAAAG AGTGGCTTGC AGATGGAGAA ATGAGAAAGT ATACATTTGA TGCGAAGCGT GCGATCGTTG CGCTGAAATG GAACGGTATA GATATGCAAG GGATTGACTT TGATCTATTA ATCGCTGCTT ACTTACTTGA TCCGGCTGAT ACAGATAAAG ATTTCCGTAC TGTAGCGAAA ATGAAAGAAA CGCATGCTGT GAAATCTGAT GAAGAAGTTT ACGGAAAAGG TGCGAAGCGT GCTGTTCCAG AGTTAGAGAT AGTAGCGGAG CATGTAGCTC GTAAAGTGCA TGTATTATAT GATGTAAAGC AAACATTTGT TGAAGAGTTA GAAAAGAATG AGCAATATGA ACTGTTTACA GAGTTGGAAT TACCACTTGC ACGTGTATTA GCTGATATGG AAGTAAAAGG TGTAAAAGTT GATACAGAAC GTCTTCGTAA TATGGGAGAA GAACTTGCAG GTAGATTAAA GGAAATGGAA CAGGAAATTT ATAAGCTGGC GGGAACAGAA TTTAATATTA ATTCACCGAA GCAGCTTGGT GTGATTCTGT TTGAGAATTT AAACTTACCG GTTATTAAGA AGACGAAAAC AGGTTATTCT ACATCAGCAG ATGTATTAGA CAAGCTGATG GATCACCATG AAATTATTCC AAACATTTTA CATTACCGTC AATTAGGGAA ACTCAATTCA ACTTATATTG AAGGTTTATT AAAAGTTGTA CATGAAGATT CATCTAAAAT TCATACTCGC TTCAATCAAG TATTAACGCA AACAGGTCGA TTAAGTTCAA CGGATCCAAA CTTGCAAAAT ATCCCGATTC GATTAGAAGA AGGAAGAAAG ATTCGTCAGG CATTCGTTCC ATCAGAAGAA GGATGGATTA TGTACGCGGC CGATTATTCA CAAATTGAAC TTCGTGTATT AGCTCATATT GCCAATGATA AAGGGCTAGT TGAAGCGTTC CAACATGACA TGGATATTCA TACAAAAACA GCGATGGATG TATTTGGCGT TGAAAAAGAT GAAGTAACTT CAAATATGAG ACGACAAGCG AAAGCTGTTA ACTTCGGAAT TGTGTATGGT ATTAGTGATT ATGGTCTTTC ACAAAACTTA GGAATTACAA GAAAAGCAGC AGCGGAATTT ATTGAAAAGT ATTTAGAAAG TTTCCCTGGT GTACAAGAAT ATATGGATGA CATCGTAAAA GATGCGAAGC AAAAAGGATA TGTGGCTACA TTATTAAATC GTCGCCGTTA CATTCCGGAA ATTACGAGTC GTAATTTCAA CTTGCGTAGC TTTGCTGAGC GTACAGCTAT GAATACACCA ATTCAAGGTA CTGCAGCAGA TATTATTAAA AAAGCGATGA TTATTATGGC AGATCGTTTA GAAGAAGAAG GATTACAAGC GCGTCTTCTT CTGCAAGTAC ACGATGAATT AATATTTGAG GCACCAAAAG AAGAAGTTGA AAAATTAGAG AAGCTTGTAC CAGAAGTAAT GGAGCATGCA ATTGAACTGG CAGTTCCACT GAAAGTTGAT TATTCTTACG GTCCAACTTG GTACGACGCA AAATAA
|
Protein sequence | MKKIISNLYG EVSDLEKKVV LVDGNNIAYR AFFALPLLNN DKGIHTNAIY GFTMMLMRIL EEEKPTHMLV AFDAGKTTFR HKTYSEYKGG RQKTPPELSE QFPFIREMLD AFNVPRYELE NYEADDIMGT LAKEASEQGA SVKVISGDKD LLQLVSDNTL VCIPRKGITE VDEYTKEALF EKYSLSPKQI IDMKGLMGDQ SDNIPGVPGV GEKTAIKLLT QFGTVEEVYE NIDQVSGKKL KEKLEANKDQ ALMSKDLATI ITDAPITVNV DDMEYKGYEA SDVIPMFENL GFTSLLNKLG VTPEETAPAE LDDITFDIVE EVTEEMLQQD SALIVEVQED NYHKADIQGF GIQNENGCYF IQTDIALKSD AFKEWLADGE MRKYTFDAKR AIVALKWNGI DMQGIDFDLL IAAYLLDPAD TDKDFRTVAK MKETHAVKSD EEVYGKGAKR AVPELEIVAE HVARKVHVLY DVKQTFVEEL EKNEQYELFT ELELPLARVL ADMEVKGVKV DTERLRNMGE ELAGRLKEME QEIYKLAGTE FNINSPKQLG VILFENLNLP VIKKTKTGYS TSADVLDKLM DHHEIIPNIL HYRQLGKLNS TYIEGLLKVV HEDSSKIHTR FNQVLTQTGR LSSTDPNLQN IPIRLEEGRK IRQAFVPSEE GWIMYAADYS QIELRVLAHI ANDKGLVEAF QHDMDIHTKT AMDVFGVEKD EVTSNMRRQA KAVNFGIVYG ISDYGLSQNL GITRKAAAEF IEKYLESFPG VQEYMDDIVK DAKQKGYVAT LLNRRRYIPE ITSRNFNLRS FAERTAMNTP IQGTAADIIK KAMIIMADRL EEEGLQARLL LQVHDELIFE APKEEVEKLE KLVPEVMEHA IELAVPLKVD YSYGPTWYDA K
|
| |