Gene EcSMS35_0062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0062 
SymbolpolB 
ID6144677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp68577 
End bp70928 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content56% 
IMG OID641614963 
ProductDNA polymerase II 
Protein accessionYP_001742179 
Protein GI170683863 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0283986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.967142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCAGG CAGGTTTTAT CTTAACCCGA CACTGGCGGG ATACCCCGCA AGGAACCGAA 
GTTTCCTTTT GGTTAGCGTC GGACAACGGG CCGTTGCAGG TTACGCTTGC GCCGCAAGAG
TCCGTGGCGT TTATTCCCGC CGATCAGGTT CCCCGCGCCC AGCATATTTT GCAGGGTGAG
CAAGGCTTTC GCCTGACGCC GCTGGCGTTA AAGGATTTTC ATCGCCAGCC GGTGTATGGC
CTTTACTGTC GCGCCCATCG CCAGTTGATG AATTACGAAA AGCGCCTGCG TGAAGGTGGC
GTTACCGTCT ACGAGACCGA TGTGCGCCCG CCAGAACGCT ATCTGATGGA GCGGTTTATC
ACCTCACCGG TGTGGGTCGA GGGTGATATA CGCAATGGCG CTATCGTTAA TGCCCGTCTG
AAACCGCATC CCGACTATCG TCCGCCGCTC AGGTGGGTTT CTATTGATAT TGAAACCACC
CGCCACGGTG AGCTGTACTG CATCGGCCTG GAAGGCTGCG GGCAGCGCAT CGTTTATATG
CTGGGGCCAG AGAATGGCGA CGCCTCCGCG CTCGATTTTG AACTGGAATA CGTCGCCAGT
CGCCCGCAGC TACTGGAAAA ACTCAACGCC TGGTTTGCGA CTCACGATCC TGATGTGATC
ATCGGCTGGA ACGTGGTGCA ATTCGATCTA CGAATGCTGC AAAAACATGC CGAGCGTTAT
CGTATTCCGC TGCGTCTGGG GCGTGATAAC AGTGAGCTGG AGTGGCGCGA GCACGGCTTT
AAAAACGGCG TCTTTTTTGC CCAGGCCAAA GGGCGACTGA TTATCGACGG TATCGAGGCG
CTGAAATCCG CGTTCTGGAA TTTCTCTTCA TTCTCGCTGG AAACCGTCGC TCAGGAGTTA
TTAGGCGAAG GAAAATCTAT CGATAACCCG TGGGATCGAA TGGACGAAAT TGACCGCCGT
TTCGCCGAAG ATAAGCCTGC GCTGGCAACT TATAACCTGA AAGATTGCGA GCTGGTGACG
CAGATCTTCC ACAAAACTGA AATCATGCCG TTTTTGCTCG AACGGGCGAC GGTGAACGGC
CTGCCGGTAG ATCGACACGG CGGTTCGGTG GCAGCGTTTG GTCATCTCTA TTTTCCCCGT
ATGCATCGCG CTGGTTATGT CGCGCCTAAT CTCGGCGAAG TGCCGCCGCA CGCCAGCCCT
GGCGGCTACG TGATGGATTC GCGACCTGGG CTTTATGATT CGGTGCTGGT GCTGGACTAC
AAAAGCCTGT ACCCGTCGAT CATCCGCACC TTTCTGATTG ATCCCGTCGG GCTGGTGGAA
GGCATGGCGC AGCCTGATCC AGAGCACAGC ACCGAAGGTT TTCTCGACGC CTGGTTCTCG
CGGGAGAAAC ATTGCCTGCC GGAGATTGTG ACTAACATCT GGCACGGGCG CGATGAAGCC
AAACGCCAGG GCAATAAGCC ACTGTCGCAG GCGCTGAAGA TCATCATGAA TGCCTTTTAT
GGCGTGCTCG GGACTACCGC CTGTCGCTTC TTCGATCCGC GCCTGGCATC GTCGATCACC
ATGCGTGGTC ATCAGATCAT GCGGCAAACC AAAGCGTTGA TTGAAGCGCA GGGCTACGAC
GTGATCTACG GCGATACCGA CTCAACGTTT GTCTGGTTGA AAGGCGCACA TTCGGAAGAA
GAAGCGGCGA AAATTGGTCG TGCACTGGTG CAGCACGTTA ACGCCTGGTG GGCAGAAACG
CTGCAAAAAC AACGGCTGAC CAGCGCATTA GAACTGGAGT ATGAAACCCA TTTCTGCCGT
TTTCTGATGC CAACCATTCG CGGAGCCGAT ACCGGCAGTA AAAAACGTTA TGCCGGGCTG
ATTCAGGAGG GCGATAAGCA GCGGATGGTG TTTAAAGGGC TGGAAACCGT GCGCACCGAC
TGGACGCCGC TGGCCCAGCA GTTTCAGCAG GAGCTTTACC TGCGTATCTT CCGCAACGAG
CCATATCAGG AGTACGTCCG CGAAACCATC GACAAACTGA TGGCGGGTGA ACTGGATGCG
CGGCTGGTTT ACCGAAAACG CCTTCGCCGT CCGCTGAGCG AATATCAGCG TAATGTTCCG
CCTCATGTAC GCGCCGCTCG CCTTGCCGAT GAAGAAAATC AAAAGCGCGG TCGCCCCTTG
CAATATCAGA ACCGCGGCAC CATTAAGTAC GTATGGACCA CCAACGGCCC GGAGCCGCTG
GACTACCAAC GTTCACCACT GGATTACGAA CACTATCTGA CCCGCCAGCT ACAACCCGTG
GCGGAGGGAA TACTCCCTTT TATTGAGGAT AATTTTGCTA CACTTATGAC CGGGCAACTT
GGGCTATTTT GA
 
Protein sequence
MAQAGFILTR HWRDTPQGTE VSFWLASDNG PLQVTLAPQE SVAFIPADQV PRAQHILQGE 
QGFRLTPLAL KDFHRQPVYG LYCRAHRQLM NYEKRLREGG VTVYETDVRP PERYLMERFI
TSPVWVEGDI RNGAIVNARL KPHPDYRPPL RWVSIDIETT RHGELYCIGL EGCGQRIVYM
LGPENGDASA LDFELEYVAS RPQLLEKLNA WFATHDPDVI IGWNVVQFDL RMLQKHAERY
RIPLRLGRDN SELEWREHGF KNGVFFAQAK GRLIIDGIEA LKSAFWNFSS FSLETVAQEL
LGEGKSIDNP WDRMDEIDRR FAEDKPALAT YNLKDCELVT QIFHKTEIMP FLLERATVNG
LPVDRHGGSV AAFGHLYFPR MHRAGYVAPN LGEVPPHASP GGYVMDSRPG LYDSVLVLDY
KSLYPSIIRT FLIDPVGLVE GMAQPDPEHS TEGFLDAWFS REKHCLPEIV TNIWHGRDEA
KRQGNKPLSQ ALKIIMNAFY GVLGTTACRF FDPRLASSIT MRGHQIMRQT KALIEAQGYD
VIYGDTDSTF VWLKGAHSEE EAAKIGRALV QHVNAWWAET LQKQRLTSAL ELEYETHFCR
FLMPTIRGAD TGSKKRYAGL IQEGDKQRMV FKGLETVRTD WTPLAQQFQQ ELYLRIFRNE
PYQEYVRETI DKLMAGELDA RLVYRKRLRR PLSEYQRNVP PHVRAARLAD EENQKRGRPL
QYQNRGTIKY VWTTNGPEPL DYQRSPLDYE HYLTRQLQPV AEGILPFIED NFATLMTGQL
GLF