Gene SbBS512_E4337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4337 
SymbolpolA 
ID6270254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4051386 
End bp4054172 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content52% 
IMG OID641728145 
ProductDNA polymerase I 
Protein accessionYP_001882560 
Protein GI187733745 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000947993 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCAGA TCCCCCAAAA TCCACTTATC CTTGTAGATG GTTCATCTTA TCTTTATCGC 
GCATATCACG CGTTTCCCCC GCTGACTAAC AGCGCAGGCG AGCCGACCGG TGCGATGTAT
GGTGTCCTCA ACATGCTGCG CAGTCTGATC ATGCAATATA AACCGACGCA TGCAGCGGTG
GTCTTTGACG CCAAGGGAAA AACCTTTCGT GATGAACTGT TTGAACATTA CAAATCACAT
CGCCCGCCAA TGCCGGACGA TCTGAGAGCA CAAATCGAAC CTCTGCACGC GATGGTTAAA
GCGATGGGAC TGCCGCTGCT GGCGGTTTCT GGCGTAGAAG CGGACGACGT TATCGGTACT
CTGGCGCGCG AAGCCGAAAA AGCCGGGCGT CCGGTGCTGA TCAGCACTGG CGATAAAGAT
ATGGCGCAGC TGGTGACGCC AAATATTACG CTTATCAATA CCATGACGAA TACCATCCTC
GGACCGGAAG AGGTGGTGAA TAAGTACGGC GTGCCGCCAG AACTGATCAT CGATTTCCTG
GCGCTGATGG GTGACTCCTC CGATAACATC CCTGGCGTAC CGGGCGTCGG TGAAAAAACC
GCGCAGGCAT TGCTGCAAGG TCTGGGCGGG CTGGATACGC TGTATGCCGA GCCAGAAAAA
ATTGCTGGGT TGAGCTTCCG TGGCGCGAAA ACAATGGCAG CGAAGCTCGA GCAAAACAAA
GAAGTTGCTT ATCTCTCATA CCAGCTGGCG ACGATTAAAA CCGACGTTGA ACTGGAGCTG
ACCTGTGAAC AACTGGAAGT GCAGCAACCG GCAGCGGAAG AGTTGTTGGG GCTGTTCAAA
AAGTATGAGT TCAAACGCTG GACTGCTGAT GTCGAAGCGG GCAAATGGTT ACAGGCCAAA
GGGGCAAAAC CAGCCGCGAA GCCGCAGGAA ACCAGTGTTG CAGACGAAGC GCCAGAAGTG
ACGGCAACGG TGATTTCTTA TGACAACTAC GTCACCATCC TTGATGAAGA AACACTGAAA
GCGTGGATTG CGAAGCTGGA AAAAGCGCCG GTATTTGCAT TTGATACCGA AACCGACAGC
CTTGATAACA TCTCTGCTAA CCTGGTCGGG CTTTCTTTTG CTATCGAGCC AGGCGTAGCG
GCATATATTC CGGTTGCTCA TGATTATCTT GATGCGCCCG ATCAAATCTC TCGCGAGCGT
GCACTCGAGT TGCTAAAACC GCTGCTGGAA GATGAAAAGG CGCTGAAGGT CGGGCAAAAC
CTGAAATACG ATCGCGGTAT TCTGGCGAAT TACGGCATTG AGCTGCGTGG GATTGCGTTT
GATACCATGC TGGAGTCCTA CATTCTCAAT AGCGTTGCCG GGCGTCACGA TATGGACAGC
CTCGCGGAAC GTTGGTTGAA GCACAAAACC ATCACTTTTG AAGAGATTGC GGGTAAAGGC
AAAAATCAAC TGACCTTTAA CCAGATTGCC CTCGAAGAGG CCGGACGTTA CGCCGCCGAA
GATGCTGATG TCACCTTGCA GTTGCATCTG AAAATGTGGC CGGACCTGCA AAAACACAAA
GGGCCGTTGA ACGTCTTCGA GAATATCGAA ATGCCGCTGG TGCCGGTGCT TTCACGCATT
GAACGTAACG GTGTGAAGAT CGATCCGAAA GTGCTGCACA ATCATTCTGA AGAGCTCACC
TTGCGTCTGG CTGAGCTGGA AAAGAAAGCG CATGAAATTG CAGGTGAAGA ATTTAACCTT
TCTTCCACCA AGCAGTTGCA AACCATTCTG TTTGAAAAAC AGGGTATTAA ACCGCTGAAG
AAAACGCCGG GTGGCGCGCC GTCAACGTCG GAAGAGGTAC TGGAAGAACT GGCGCTGGAC
TATCCGTTGC CAAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTGAA ATCGACCTAC
ACCGACAAGC TGCCGCTGAT GATCAACCCG AAAACCGGGC GTGTGCATAC CTCTTATCAC
CAGGCAGTAA CTGCAACGGG ACGTTTATCG TCAACCGATC CTAACCTGCA AAACATTCCG
GTGCGTAACG AAGAAGGTCG TCGTATCCGC CAGGCGTTTA TTGCGCCAGA GGATTATGTG
ATTGTCTCAG CGGACTACTC GCAGATTGAA CTGCGCATTA TGGCGCATCT TTCGCGTGAC
AAAGGCTTGC TGACCGCATT CGCGGAAGGA AAAGATATCC ACCGGGCAAC GGCGGCAGAA
GTGTTTGGTT TGCCACTGGA AACCGTCACC AGCGAGCAAC GCCGTAGCGC GAAAGCGATC
AACTTTGGTC TGATTTATGG CATGAGTGCT TTCGGTCTGG CGCGGCAATT GAACATTCCA
CGTAAAGAAG CGCAGAAGTA CATGGACCTT TACTTCGAAC GCTACCCTGG CGTGCTGGAG
TATATGGAAC GCACCCGTGC TCAGGCGAAA GAGCAGGGCT ACGTTGAAAC GCTGGACGGA
CGCCGTCTGT ATCTGCCGGA TATCAAATCC AGCAATGGTG CTCGTCGTGC AGCGGCTGAA
CGTGCAGCCA TTAACGCGCC AATGCAGGGA ACCGCCGCCG ACATTATCAA ACGGGCGATG
ATTGCCGTTG ATGCGTGGTT ACAGGCTGAG CAACCGCGTG TACGTATGAT CATGCAGGTA
CACGATGAAC TGGTATTTGA AGTTCATAAA GATGATGTTG ATGCCGTCGC GAAGCAGATT
CATCAACTGA TGGAAAACTG TACCCGTCTG GATGTGCCGT TGCTGGTGGA AGTGGGGAGT
GGCGAAAACT GGGATCAGGC GCACTAA
 
Protein sequence
MVQIPQNPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYKPTHAAV 
VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT
LAREAEKAGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPEEVVNKYG VPPELIIDFL
ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLSFRGAK TMAAKLEQNK
EVAYLSYQLA TIKTDVELEL TCEQLEVQQP AAEELLGLFK KYEFKRWTAD VEAGKWLQAK
GAKPAAKPQE TSVADEAPEV TATVISYDNY VTILDEETLK AWIAKLEKAP VFAFDTETDS
LDNISANLVG LSFAIEPGVA AYIPVAHDYL DAPDQISRER ALELLKPLLE DEKALKVGQN
LKYDRGILAN YGIELRGIAF DTMLESYILN SVAGRHDMDS LAERWLKHKT ITFEEIAGKG
KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPDLQKHK GPLNVFENIE MPLVPVLSRI
ERNGVKIDPK VLHNHSEELT LRLAELEKKA HEIAGEEFNL SSTKQLQTIL FEKQGIKPLK
KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH
QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYV IVSADYSQIE LRIMAHLSRD
KGLLTAFAEG KDIHRATAAE VFGLPLETVT SEQRRSAKAI NFGLIYGMSA FGLARQLNIP
RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLDG RRLYLPDIKS SNGARRAAAE
RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHK DDVDAVAKQI
HQLMENCTRL DVPLLVEVGS GENWDQAH