Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4337 |
Symbol | polA |
ID | 6270254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4051386 |
End bp | 4054172 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641728145 |
Product | DNA polymerase I |
Protein accession | YP_001882560 |
Protein GI | 187733745 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000947993 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCAGA TCCCCCAAAA TCCACTTATC CTTGTAGATG GTTCATCTTA TCTTTATCGC GCATATCACG CGTTTCCCCC GCTGACTAAC AGCGCAGGCG AGCCGACCGG TGCGATGTAT GGTGTCCTCA ACATGCTGCG CAGTCTGATC ATGCAATATA AACCGACGCA TGCAGCGGTG GTCTTTGACG CCAAGGGAAA AACCTTTCGT GATGAACTGT TTGAACATTA CAAATCACAT CGCCCGCCAA TGCCGGACGA TCTGAGAGCA CAAATCGAAC CTCTGCACGC GATGGTTAAA GCGATGGGAC TGCCGCTGCT GGCGGTTTCT GGCGTAGAAG CGGACGACGT TATCGGTACT CTGGCGCGCG AAGCCGAAAA AGCCGGGCGT CCGGTGCTGA TCAGCACTGG CGATAAAGAT ATGGCGCAGC TGGTGACGCC AAATATTACG CTTATCAATA CCATGACGAA TACCATCCTC GGACCGGAAG AGGTGGTGAA TAAGTACGGC GTGCCGCCAG AACTGATCAT CGATTTCCTG GCGCTGATGG GTGACTCCTC CGATAACATC CCTGGCGTAC CGGGCGTCGG TGAAAAAACC GCGCAGGCAT TGCTGCAAGG TCTGGGCGGG CTGGATACGC TGTATGCCGA GCCAGAAAAA ATTGCTGGGT TGAGCTTCCG TGGCGCGAAA ACAATGGCAG CGAAGCTCGA GCAAAACAAA GAAGTTGCTT ATCTCTCATA CCAGCTGGCG ACGATTAAAA CCGACGTTGA ACTGGAGCTG ACCTGTGAAC AACTGGAAGT GCAGCAACCG GCAGCGGAAG AGTTGTTGGG GCTGTTCAAA AAGTATGAGT TCAAACGCTG GACTGCTGAT GTCGAAGCGG GCAAATGGTT ACAGGCCAAA GGGGCAAAAC CAGCCGCGAA GCCGCAGGAA ACCAGTGTTG CAGACGAAGC GCCAGAAGTG ACGGCAACGG TGATTTCTTA TGACAACTAC GTCACCATCC TTGATGAAGA AACACTGAAA GCGTGGATTG CGAAGCTGGA AAAAGCGCCG GTATTTGCAT TTGATACCGA AACCGACAGC CTTGATAACA TCTCTGCTAA CCTGGTCGGG CTTTCTTTTG CTATCGAGCC AGGCGTAGCG GCATATATTC CGGTTGCTCA TGATTATCTT GATGCGCCCG ATCAAATCTC TCGCGAGCGT GCACTCGAGT TGCTAAAACC GCTGCTGGAA GATGAAAAGG CGCTGAAGGT CGGGCAAAAC CTGAAATACG ATCGCGGTAT TCTGGCGAAT TACGGCATTG AGCTGCGTGG GATTGCGTTT GATACCATGC TGGAGTCCTA CATTCTCAAT AGCGTTGCCG GGCGTCACGA TATGGACAGC CTCGCGGAAC GTTGGTTGAA GCACAAAACC ATCACTTTTG AAGAGATTGC GGGTAAAGGC AAAAATCAAC TGACCTTTAA CCAGATTGCC CTCGAAGAGG CCGGACGTTA CGCCGCCGAA GATGCTGATG TCACCTTGCA GTTGCATCTG AAAATGTGGC CGGACCTGCA AAAACACAAA GGGCCGTTGA ACGTCTTCGA GAATATCGAA ATGCCGCTGG TGCCGGTGCT TTCACGCATT GAACGTAACG GTGTGAAGAT CGATCCGAAA GTGCTGCACA ATCATTCTGA AGAGCTCACC TTGCGTCTGG CTGAGCTGGA AAAGAAAGCG CATGAAATTG CAGGTGAAGA ATTTAACCTT TCTTCCACCA AGCAGTTGCA AACCATTCTG TTTGAAAAAC AGGGTATTAA ACCGCTGAAG AAAACGCCGG GTGGCGCGCC GTCAACGTCG GAAGAGGTAC TGGAAGAACT GGCGCTGGAC TATCCGTTGC CAAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTGAA ATCGACCTAC ACCGACAAGC TGCCGCTGAT GATCAACCCG AAAACCGGGC GTGTGCATAC CTCTTATCAC CAGGCAGTAA CTGCAACGGG ACGTTTATCG TCAACCGATC CTAACCTGCA AAACATTCCG GTGCGTAACG AAGAAGGTCG TCGTATCCGC CAGGCGTTTA TTGCGCCAGA GGATTATGTG ATTGTCTCAG CGGACTACTC GCAGATTGAA CTGCGCATTA TGGCGCATCT TTCGCGTGAC AAAGGCTTGC TGACCGCATT CGCGGAAGGA AAAGATATCC ACCGGGCAAC GGCGGCAGAA GTGTTTGGTT TGCCACTGGA AACCGTCACC AGCGAGCAAC GCCGTAGCGC GAAAGCGATC AACTTTGGTC TGATTTATGG CATGAGTGCT TTCGGTCTGG CGCGGCAATT GAACATTCCA CGTAAAGAAG CGCAGAAGTA CATGGACCTT TACTTCGAAC GCTACCCTGG CGTGCTGGAG TATATGGAAC GCACCCGTGC TCAGGCGAAA GAGCAGGGCT ACGTTGAAAC GCTGGACGGA CGCCGTCTGT ATCTGCCGGA TATCAAATCC AGCAATGGTG CTCGTCGTGC AGCGGCTGAA CGTGCAGCCA TTAACGCGCC AATGCAGGGA ACCGCCGCCG ACATTATCAA ACGGGCGATG ATTGCCGTTG ATGCGTGGTT ACAGGCTGAG CAACCGCGTG TACGTATGAT CATGCAGGTA CACGATGAAC TGGTATTTGA AGTTCATAAA GATGATGTTG ATGCCGTCGC GAAGCAGATT CATCAACTGA TGGAAAACTG TACCCGTCTG GATGTGCCGT TGCTGGTGGA AGTGGGGAGT GGCGAAAACT GGGATCAGGC GCACTAA
|
Protein sequence | MVQIPQNPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYKPTHAAV VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT LAREAEKAGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPEEVVNKYG VPPELIIDFL ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLSFRGAK TMAAKLEQNK EVAYLSYQLA TIKTDVELEL TCEQLEVQQP AAEELLGLFK KYEFKRWTAD VEAGKWLQAK GAKPAAKPQE TSVADEAPEV TATVISYDNY VTILDEETLK AWIAKLEKAP VFAFDTETDS LDNISANLVG LSFAIEPGVA AYIPVAHDYL DAPDQISRER ALELLKPLLE DEKALKVGQN LKYDRGILAN YGIELRGIAF DTMLESYILN SVAGRHDMDS LAERWLKHKT ITFEEIAGKG KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPDLQKHK GPLNVFENIE MPLVPVLSRI ERNGVKIDPK VLHNHSEELT LRLAELEKKA HEIAGEEFNL SSTKQLQTIL FEKQGIKPLK KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYV IVSADYSQIE LRIMAHLSRD KGLLTAFAEG KDIHRATAAE VFGLPLETVT SEQRRSAKAI NFGLIYGMSA FGLARQLNIP RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLDG RRLYLPDIKS SNGARRAAAE RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHK DDVDAVAKQI HQLMENCTRL DVPLLVEVGS GENWDQAH
|
| |