Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK4328 |
Symbol | polA |
ID | 3027141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | - |
Start bp | 4440563 |
End bp | 4443238 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637548544 |
Product | DNA polymerase I |
Protein accession | YP_085907 |
Protein GI | 52140922 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAA TCATTTCAAA TTTGTATGGG GAGGTTTCTG ATTTGGAAAA AAAGGTCGTA TTAGTAGATG GTAATAATAT CGCGTATCGT GCTTTCTTTG CACTACCGCT TTTAAATAAC GACAAGGGTA TACATACGAA CGCAATTTAC GGTTTTACAA TGATGTTAAT GAGAATATTA GAGGAAGAAA AACCGACGCA TATGTTAGTT GCATTTGATG CGGGTAAAAC AACGTTCCGT CATAAAACGT ATAGTGAGTA TAAAGGAGGA CGTCAAAAGA CTCCACCTGA ATTATCAGAG CAATTCCCGT TTATTCGTGA GATGCTCGAT GCATTTAATG TACCGCGTTA TGAATTAGAA AATTATGAAG CGGATGACAT TATGGGAACG CTAGCGAAAG AAGCGAGTGA ACAAGGAGCT AGTGTAAAAG TTATTTCAGG AGATAAAGAT TTACTTCAAC TTGTTTCTGA TAACACGCTT GTATGCATTC CTAGAAAAGG GATTACGGAA GTAGATGAAT ATACGGAAGA AGCTTTGTTT GAGAAATACA GCTTATCACC GAAGCAAATT ATTGATATGA AAGGTTTAAT GGGAGACCAA TCAGATAATA TTCCAGGTGT ACCAGGAGTT GGTGAAAAAA CTGCGATTAA ATTGTTAACA CAGTTTGGAA CGGTTGAAGA AGTGTATGAA AATATAGATC AAGTAAGCGG GAAGAAATTA AAAGAAAAGC TGGAAGCAAA TAAAGATCAA GCTCTTATGA GTAAAGATCT TGCAACCATT ATTACAGACG CACCGATTAC TGTGAATGTG GATGATATGG AGTATAAAGG ATATGAAGCA AGTGATGTCA TTCCAATGTT CGAGAATTTA GGATTTACAT CTCTTTTAAA CAAATTAGGT GTTACGCCAG AAGAAACGGC TCCAGCTGAA TTAGATGATA TTACATTTGA TATTGTAGAA GAAGTTACAG AAGAAATGCT TCAGCAAGAT AGTGCGCTTA TCGTTGAAGT ACAAGAAGAT AACTATCATA AAGCAGACAT TCAAGGTTTC GGTATTCAAA ATGAAAATGG ATGTTACTTT ATTCAGACAG ATATTGCACT TAAATCAGAT GCTTTTAAAG AGTGGCTTGC AGATGGAGAA ATGAGAAAGT ATACATTTGA TGCGAAACGT GCGATCGTTG CGCTGAAATG GAACGGTATA GATATGCAAG GGATTGACTT TGATCTATTA ATCGCTGCTT ACTTACTTGA TCCGGCTGAT ACAGATAAAG ATTTCCGTAC TGTAGCGAAA ATGAAAGAAA CGCATGCTGT GAAATCTGAT GAAGAAGTTT ACGGAAAAGG TGCGAAGCGT GCTGTTCCAG AGTTAGAGAT AGTAGCGGAG CATGTAGCTC GTAAAGTGCA TGTATTATAT GATGTAAAGC AAACATTCGT TGAAGAGTTA GAAAAGAATG AGCAATATGA ACTGTTTACA GAGTTAGAAT TACCACTTGC ACGTGTATTA GCTGATATGG AAGTAAAAGG TGTAAAAGTT GATACAGAAC GTCTTCGTAA TATGGGAGAA GAACTTGCAG GTAGATTAAA GGAAATGGAA CAGGAAATTT ATAAACTGGC GGGAACAGAA TTTAATATTA ATTCACCGAA GCAGCTTGGT GTGATTCTGT TTGAGAATTT AAACTTACCG GTTATTAAGA AGACGAAAAC AGGTTATTCT ACGTCAGCAG ATGTATTAGA CAAGCTGATG GATCACCATG AAATTATTCC AAACATTTTA CATTACCGTC AATTAGGAAA ACTCAATTCA ACTTATATTG AAGGTTTATT AAAGGTTGTA CATGAAGATT CATCTAAAAT TCATACTCGT TTCAATCAAG TATTAACGCA AACGGGCCGA TTAAGTTCAA CGGATCCAAA CTTGCAAAAT ATCCCGATTC GATTAGAAGA AGGAAGAAAG ATTCGTCAGG CATTCGTTCC ATCAGAAGAA GGATGGATTA TGTACGCGGC CGATTATTCA CAAATTGAAC TTCGTGTATT AGCTCATATT GCAAATGATA AAGGGCTAGT TGAAGCGTTC CAACATGACA TGGATATTCA TACAAAAACA GCTATGGATG TATTTGGCGT TGAAAAAGAT GAAGTAACTT CAAATATGAG ACGACAAGCG AAAGCTGTTA ACTTCGGAAT TGTGTATGGT ATTAGTGATT ATGGTCTTTC ACAAAACTTA GGAATTACAA GAAAAGCAGC AGCGGAATTT ATTGAAAAGT ATTTAGAAAG TTTCCCTGGT GTACAAGAAT ACATGGATGA CATCGTAAAA GATGCGAAGC AAAAAGGATA TGTGGCTACA TTATTAAATC GTCGCCGTTA CATTCCGGAA ATTACGAGTC GTAATTTCAA CTTGCGTAGC TTTGCTGAGC GTACAGCTAT GAATACACCA ATTCAAGGTA CTGCAGCAGA TATTATTAAA AAAGCGATGA TTATTATGGC AGATCGTTTA GAAGAAGAAG GATTACAAGC TCGTCTTCTA TTGCAAGTAC ACGATGAATT AATATTTGAG GCACCAAAAG AAGAAATTGA AAAATTAGAG AAGCTTGTAC CAGAAGTAAT GGAGCATGCA ATTGAACTGG CAGTTCCACT GAAAGTTGAT TATTCTTACG GTCCAACTTG GTACGACGCA AAATAA
|
Protein sequence | MKKIISNLYG EVSDLEKKVV LVDGNNIAYR AFFALPLLNN DKGIHTNAIY GFTMMLMRIL EEEKPTHMLV AFDAGKTTFR HKTYSEYKGG RQKTPPELSE QFPFIREMLD AFNVPRYELE NYEADDIMGT LAKEASEQGA SVKVISGDKD LLQLVSDNTL VCIPRKGITE VDEYTEEALF EKYSLSPKQI IDMKGLMGDQ SDNIPGVPGV GEKTAIKLLT QFGTVEEVYE NIDQVSGKKL KEKLEANKDQ ALMSKDLATI ITDAPITVNV DDMEYKGYEA SDVIPMFENL GFTSLLNKLG VTPEETAPAE LDDITFDIVE EVTEEMLQQD SALIVEVQED NYHKADIQGF GIQNENGCYF IQTDIALKSD AFKEWLADGE MRKYTFDAKR AIVALKWNGI DMQGIDFDLL IAAYLLDPAD TDKDFRTVAK MKETHAVKSD EEVYGKGAKR AVPELEIVAE HVARKVHVLY DVKQTFVEEL EKNEQYELFT ELELPLARVL ADMEVKGVKV DTERLRNMGE ELAGRLKEME QEIYKLAGTE FNINSPKQLG VILFENLNLP VIKKTKTGYS TSADVLDKLM DHHEIIPNIL HYRQLGKLNS TYIEGLLKVV HEDSSKIHTR FNQVLTQTGR LSSTDPNLQN IPIRLEEGRK IRQAFVPSEE GWIMYAADYS QIELRVLAHI ANDKGLVEAF QHDMDIHTKT AMDVFGVEKD EVTSNMRRQA KAVNFGIVYG ISDYGLSQNL GITRKAAAEF IEKYLESFPG VQEYMDDIVK DAKQKGYVAT LLNRRRYIPE ITSRNFNLRS FAERTAMNTP IQGTAADIIK KAMIIMADRL EEEGLQARLL LQVHDELIFE APKEEIEKLE KLVPEVMEHA IELAVPLKVD YSYGPTWYDA K
|
| |