Gene B21_03698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03698 
SymbolpolA 
ID8114938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3951781 
End bp3954567 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content52% 
IMG OID644849859 
Producthypothetical protein 
Protein accessionYP_003001432 
Protein GI251787128 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000500829 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCAGA TCCCCCAAAA TCCACTTATC CTTGTAGATG GTTCATCTTA TCTTTATCGC 
GCATATCACG CGTTTCCCCC GCTGACTAAC AGCGCAGGCG AGCCGACCGG TGCGATGTAT
GGTGTCCTCA ACATGCTGCG CAGTCTGATC ATGCAATATA AACCGACGCA TGCAGCGGTG
GTCTTTGACG CCAAGGGAAA AACCTTTCGT GATGAACTGT TTGAACATTA CAAATCACAT
CGCCCGCCAA TGCCGGACGA TCTGCGTGCA CAAATCGAAC CCTTGCACGC GATGGTTAAA
GCGATGGGAC TGCCGCTGCT GGCGGTTTCT GGCGTAGAAG CGGACGACGT TATCGGTACT
CTGGCGCGCG AAGCCGAAAA AGCCGGGCGT CCGGTGCTGA TCAGCACTGG CGATAAAGAT
ATGGCGCAGC TGGTGACGCC AAATATTACA CTTATCAATA CCATGACGAA TACCATCCTC
GGACCGGAAG AGGTGGTGAA TAAGTACGGC GTGCCGCCAG AACTGATCAT CGATTTCCTG
GCGCTGATGG GTGACTCCTC TGATAACATT CCTGGCGTAC CGGGCGTCGG TGAAAAAACC
GCGCAGGCAT TGCTGCAAGG TCTTGGCGGA CTGGATACGC TGTATGCCGA GCCAGAAAAA
ATTGCTGGGT TGAGCTTCCG TGGCGCGAAA ACAATGGCAG CGAAGCTCGA GCAAAACAAA
GAAGTTGCTT ATCTCTCATA CCAGCTGGCG ACGATTAAAA CCGACGTTGA ACTGGAGCTG
ACCTGTGAAC AACTGGAAGT GCAGCAACCG GCAGCGGAAG AGTTGTTGGG GCTGTTCAAA
AAGTATGAGT TCAAACGCTG GACTGCTGAT GTCGAAGCGG GCAAATGGTT ACAGGCCAAA
GGGGCAAAAC CAGCCGCGAA GCCACAGGAA ACCAGTGTTG CAGACGAAGC ACCAGAAGTG
ACGGCAACGG TGATTTCTTA TGACAACTAC GTCACCATCC TTGATGAAGA AACACTGAAA
GCGTGGATTG CGAAGCTGGA AAAAGCGCCG GTATTTGCAT TTGATACCGA AACCGACAGC
CTTGATAACA TCTCTGCTAA CCTGGTCGGG CTTTCTTTTG CTATCGAGCC AGGCGTAGCG
GCATATATTC CGGTTGCTCA TGATTATCTT GATGCGCCCG ATCAAATCTC TCGCGAGCGT
GCACTCGAGT TGCTAAAACC GCTGCTGGAA GATGAAAAGG CGCTGAAGGT CGGGCAAAAC
CTGAAATACG ATCGCGGTAT TCTGGCGAAC TACGGCATTG AACTGCGTGG GATTGCGTTT
GATACCATGC TGGAGTCCTA CATTCTCAAT AGCGTTGCCG GGCGTCACGA TATGGACAGC
CTCGCGGAAC GTTGGTTGAA GCACAAAACC ATCACTTTTG AAGAGATTGC TGGTAAAGGC
AAAAATCAAC TGACCTTTAA CCAGATTGCC CTCGAAGAAG CCGGACGTTA CGCCGCCGAA
GATGCAGATG TCACCTTGCA GTTGCATCTG AAAATGTGGC CGGATCTGCA AAAACACAAA
GGGCCGTTGA ACGTCTTCGA GAATATCGAA ATGCCGCTGG TGCCGGTGCT TTCACGCATT
GAACGTAACG GTGTGAAGAT CGATCCGAAA GTGCTGCACA ATCATTCTGA AGAGCTCACC
CTTCGTCTGG CTGAGCTGGA AAAGAAAGCG CATGAAATTG CAGGTGAGGA ATTTAACCTT
TCTTCCACCA AGCAGTTACA AACCATTCTC TTTGAAAAAC AGGGCATTAA ACCGCTGAAG
AAAACGCCGG GTGGCGCGCC GTCAACGTCG GAAGAGGTAC TGGAAGAACT GGCGCTGGAC
TATCCGTTGC CAAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTGAA ATCGACCTAC
ACCGACAAGC TGCCGCTGAT GATCAACCCG AAAACCGGGC GTGTGCATAC CTCTTATCAC
CAGGCAGTAA CTGCAACGGG ACGTTTATCG TCAACCGATC CTAACCTGCA AAACATTCCG
GTGCGTAACG AAGAAGGTCG TCGTATCCGC CAGGCGTTTA TTGCGCCAGA GGATTATGTG
ATTGTCTCAG CGGACTACTC GCAGATTGAA CTGCGCATTA TGGCGCATCT TTCGCGTGAC
AAAGGCTTGC TGACCGCATT CGCGGAAGGA AAAGATATCC ACCGGGCAAC GGCGGCAGAA
GTGTTTGGTT TGCCACTGGA AACCGTCACC AGCGAGCAAC GCCGTAGCGC GAAAGCGATC
AACTTTGGTC TGATTTATGG CATGAGTGCT TTCGGTCTGG CGCGGCAATT GAACATTCCA
CGTAAAGAAG CGCAGAAGTA CATGGACCTT TACTTCGAAC GCTACCCTGG CGTGCTGGAG
TATATGGAAC GCACCCGTGC TCAGGCGAAA GAGCAGGGCT ACGTTGAAAC GCTGGACGGA
CGCCGTCTGT ATCTGCCGGA TATCAAATCC AGCAATGGTG CTCGTCGTGC AGCGGCTGAA
CGTGCAGCCA TTAACGCGCC AATGCAGGGA ACCGCCGCCG ACATTATCAA ACGGGCGATG
ATTGCCGTTG ATGCGTGGTT ACAGGCTGAG CAACCGCGTG TACGTATGAT CATGCAGGTA
CACGATGAAC TGGTATTTGA AGTTCATAAA GATGATGTTG ATGCCGTCGC GAAGCAGATT
CATCAACTGA TGGAAAACTG TACCCGTCTG GATGTGCCGT TGCTGGTGGA AGTGGGGAGT
GGCGAAAACT GGGATCAGGC GCACTAA
 
Protein sequence
MVQIPQNPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYKPTHAAV 
VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT
LAREAEKAGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPEEVVNKYG VPPELIIDFL
ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLSFRGAK TMAAKLEQNK
EVAYLSYQLA TIKTDVELEL TCEQLEVQQP AAEELLGLFK KYEFKRWTAD VEAGKWLQAK
GAKPAAKPQE TSVADEAPEV TATVISYDNY VTILDEETLK AWIAKLEKAP VFAFDTETDS
LDNISANLVG LSFAIEPGVA AYIPVAHDYL DAPDQISRER ALELLKPLLE DEKALKVGQN
LKYDRGILAN YGIELRGIAF DTMLESYILN SVAGRHDMDS LAERWLKHKT ITFEEIAGKG
KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPDLQKHK GPLNVFENIE MPLVPVLSRI
ERNGVKIDPK VLHNHSEELT LRLAELEKKA HEIAGEEFNL SSTKQLQTIL FEKQGIKPLK
KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH
QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYV IVSADYSQIE LRIMAHLSRD
KGLLTAFAEG KDIHRATAAE VFGLPLETVT SEQRRSAKAI NFGLIYGMSA FGLARQLNIP
RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLDG RRLYLPDIKS SNGARRAAAE
RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHK DDVDAVAKQI
HQLMENCTRL DVPLLVEVGS GENWDQAH