Gene EcE24377A_4382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4382 
SymbolpolA 
ID5587789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4372179 
End bp4374965 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content52% 
IMG OID640928000 
ProductDNA polymerase I 
Protein accessionYP_001465344 
Protein GI157158243 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.70331e-10 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCAGA TCCCCCAAAA TCCACTTATC CTTGTAGATG GTTCATCTTA TCTTTATCGC 
GCATATCACG CGTTTCCCCC GCTGACTAAC AGCGCAGGCG AGCCGACCGG TGCGATGTAT
GGTGTCCTCA ACATGCTGCG CAGTCTGATC ATGCAATATA AACCGACGCA TGCAGCGGTG
GTCTTTGACG CCAAGGGAAA AACCTTTCGT GATGAACTGT TTGAACATTA CAAATCACAT
CGCCCGCCAA TGCCGGACGA TCTGCGAGCA CAAATCGAAC CTCTGCACGC GATGGTTAAA
GCGATGGGAC TGCCGCTGCT GGCGGTTTCT GGCGTAGAAG CGGACGACGT TATCGGTACT
CTGGCGCGCG AAGCCGAAAA AGCCGGGCGT CCGGTGCTGA TCAGCACTGG CGATAAAGAT
ATGGCGCAGC TGGTGACGCC AAATATTACG CTTATCAATA CCATGACGAA TACCATCCTC
GGACCGGAAG AGGTGGTGAA TAAGTACGGC GTGCCGCCAG AACTGATCAT CGATTTCCTG
GCGCTGATGG GTGACTCCTC CGATAACATT CCTGGCGTAC CGGGCGTCGG TGAAAAAACC
GCGCAGGCAT TGCTGCAAGG TCTTGGCGGG CTGGATACGC TGTATGCCGA GCCAGAAAAA
ATTGCTGGGT TGAGCTTCCG TGGCGCGAAA ACAATGGCAG CGAAGCTCGA GCAAAACAAA
GAAGTTGCTT ATCTCTCATA CCAGCTGGCG ACGATTAAAA CCGACGTTGA ACTGGAGCTG
ACCTGTGAAC AACTGGAAGT GCAGCAACCG GCAGCGGAAG AGTTGTTGGG GCTGTTCAAA
AAGTATGAGT TCAAACGCTG GACTGCTGAT GTCGAAGCGG GCAAATGGTT ACAGGCCAAA
GGGGCAAAAC CAGCCGCGAA GCCGCAGGAA ACCAGTGTTG CAGACGAAGC GCCAGAAGTG
ACGGCAACGG TGATTTCTTA TGACAACTAC GTCACCATCC TTGATGAAGA AACACTGAAA
GCGTGGATTG CGAAGCTGGA AAAAGCGCCG GTATTTGCAT TTGATACCGA AACCGACAGC
CTTGATAACA TCTCTGCTAA CCTGGTCGGG CTTTCTTTTG CTATCGAGCC AGGCGTAGCG
GCATATATTC CGGTTGCTCA TGATTATCTT GATGCGCCCG ATCAAATCTC TCGCGAGCGT
GCACTCGAGT TGCTAAAACC GCTGCTGGAA GATGAAAAGG CGCTGAAGGT CGGGCAAAAC
CTGAAATACG ATCGCGGTAT TCTGGCGAAC TACGGCATTG AACTGCGTGG GATTGCGTTT
GATACCATGC TGGAGTCCTA CATTCTCAAT AGCGTTGCCG GGCGTCACGA TATGGACAGC
CTCGCGGAAC GTTGGTTGAA GCACAAAACC ATCACTTTTG AAGAGATTGC TGGTAAAGGC
AAAAATCAAC TGACCTTTAA CCAGATTGCC CTCGAAGAAG CCGGACGTTA CGCCGCCGAA
GATGCAGATG TCACCTTGCA GTTGCATCTG AAAATGTGGC CGGATCTGCA AAAACACAAA
GGGCCGTTGA ACGTCTTCGA GAATATCGAA ATGCCGCTGG TGCCAGTGCT TTCACGCATT
GAACGTAACG GTGTGAAGAT CGATCCGAAA GTGCTGCACA ATCATTCTGA AGAGCTCACC
CTTCGTCTGG CTGAGCTGGA AAAGAAAGCG CATGAAATTG CAGGTGAGGA ATTTAACCTT
TCTTCCACCA AGCAGTTACA AACCATTCTC TTTGAAAAAC AGGGTATTAA ACCGCTGAAG
AAAACGCCGG GTGGCGCGCC GTCAACGTCG GAAGAGGTAC TGGAAGAACT GGCGCTGGAC
TATCCGTTGC CAAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTGAA ATCGACCTAC
ACCGACAAGC TGCCGCTGAT GATCAACCCG AAAACCGGGC GTGTGCATAC CTCTTATCAC
CAGGCAGTAA CTGCAACGGG ACGTTTATCG TCAACCGATC CTAACCTGCA AAACATTCCG
GTGCGTAACG AAGAAGGTCG TCGTATCCGC CAGGCGTTTA TTGCGCCAGA GGATTATGTG
ATTGTCTCAG CGGACTACTC GCAGATTGAA CTGCGCATTA TGGCGCATCT TTCGCGTGAC
AAAGGCTTGC TGACCGCATT CGCGGAAGGA AAAGATATCC ACCGGGCAAC GGCGGCAGAA
GTGTTTGGTT TGCCACTGGA AACCGTCACC AGCGAGCAAC GCCGTAGCGC GAAAGCGATC
AACTTTGGTC TGATTTATGG CATGAGTGCT TTCGGTCTGG CGCGGCAATT GAACATTCCA
CGTAAAGAAG CGCAGAAGTA CATGGACCTT TACTTCGAAC GCTACCCTGG CGTGCTGGAG
TATATGGAAC GCACCCGTGC TCAGGCGAAA GAGCAGGGCT ACGTTGAAAC GCTGGACGGA
CGCCGTCTGT ATCTGCCGGA TATCAAATCC AGCAATGGTG CTCGTCGTGC AGCGGCTGAA
CGTGCAGCCA TTAACGCGCC AATGCAGGGA ACCGCCGCCG ACATTATCAA ACGGGCGATG
ATTGCCGTTG ATGCGTGGTT ACAGGCTGAG CAACCGCGTG TACGTATGAT CATGCAGGTA
CACGATGAAC TGGTATTTGA AGTTCATAAA GATGATGTTG ATGCCGTCGC GAAGCAGATT
CATCAACTGA TGGAAAACTG TACCCGTCTG GATGTGCCGT TGCTGGTGGA AGTGGGGAGT
GGCGAAAACT GGGATCAGGC GCACTAA
 
Protein sequence
MVQIPQNPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYKPTHAAV 
VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT
LAREAEKAGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPEEVVNKYG VPPELIIDFL
ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLSFRGAK TMAAKLEQNK
EVAYLSYQLA TIKTDVELEL TCEQLEVQQP AAEELLGLFK KYEFKRWTAD VEAGKWLQAK
GAKPAAKPQE TSVADEAPEV TATVISYDNY VTILDEETLK AWIAKLEKAP VFAFDTETDS
LDNISANLVG LSFAIEPGVA AYIPVAHDYL DAPDQISRER ALELLKPLLE DEKALKVGQN
LKYDRGILAN YGIELRGIAF DTMLESYILN SVAGRHDMDS LAERWLKHKT ITFEEIAGKG
KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPDLQKHK GPLNVFENIE MPLVPVLSRI
ERNGVKIDPK VLHNHSEELT LRLAELEKKA HEIAGEEFNL SSTKQLQTIL FEKQGIKPLK
KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH
QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYV IVSADYSQIE LRIMAHLSRD
KGLLTAFAEG KDIHRATAAE VFGLPLETVT SEQRRSAKAI NFGLIYGMSA FGLARQLNIP
RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLDG RRLYLPDIKS SNGARRAAAE
RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHK DDVDAVAKQI
HQLMENCTRL DVPLLVEVGS GENWDQAH