Gene EcolC_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3597 
Symbol 
ID6067600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3933801 
End bp3936152 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content55% 
IMG OID641603015 
ProductDNA polymerase II 
Protein accessionYP_001726538 
Protein GI170021584 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000326268 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGGCGCAGG CAGGTTTTAT CTTAACCCGA CACTGGCGGG ACACCCCGCA AGGGACAGAA 
GTCTCCTTCT GGCTGGCGAC GGACAACGGG CCGTTGCAGG TTACGCTTGC ACCGCAAGAG
TCCGTGGCGT TTATTCCCGC CGATCAGGTT CCCCGCGCTC AGCATATTTT GCAGGGTGAA
CAAGGCTTTC GCCTGACACC GCTGGCGTTA AAGGATTTTC ACCGCCAGCC GGTGTATGGC
CTTTACTGTC GCGCCCATCG CCAATTGATG AATTACGAAA AGCGCCTGCG TGAAGGTGGC
GTTACCGTCT ACGAGGCCGA TGTGCGTCCG CCAGAACGCT ATCTGATGGA GCGGTTTATC
ACCTCACCGG TGTGGGTCGA GGGTGATATG CACAATGGCA CTATCGTTAA TGCCCGTCTG
AAACCGCATC CCGACTATCG TCCGCCGCTC AAGTGGGTTT CTATAGATAT TGAAACCACC
CGCCACGGTG AGCTGTACTG CATCGGCCTG GAAGGCTGCG GGCAGCGCAT CGTTTATATG
CTGGGGCCGG AGAATGGCGA CGCCTCCTCG CTTGATTTCG AACTGGAATA CGTCGCCAGC
CGCCCGCAGT TGCTGGAAAA ACTCAACGCC TGGTTTGCCA ACTACGATCC TGATGTGATC
ATCGGTTGGA ACGTGGTGCA GTTCGATCTG CGAATGCTGC AAAAACATGC CGAGCGTTAC
CGTCTTCCGC TGCGTCTTGG GCGCGATAAT AGCGAGCTGG AGTGGCGCGA GCACGGCTTT
AAAAACGGCG TCTTTTTTGC CCAGGCTAAA GGTCGGCTAA TTATCGACGG TATCGAGGCG
CTGAAATCCG CGTTCTGGAA TTTCTCTTCA TTCTCGCTGG AAACTGTCGC TCAGGAGCTA
TTAGGCGAAG GAAAATCTAT CGATAACCCG TGGGATCGAA TGGACGAAAT TGACCGCCGT
TTCGCCGAAG ATAAACCTGC GCTGGCAACT TATAACCTGA AAGATTGCGA GCTGGTGACG
CAGATCTTCC ACAAAACTGA AATCATGCCA TTTTTACTCG AACGGGCAAC GGTGAACGGC
CTGCCGGTGG ACCGACACGG CGGTTCGGTG GCGGCATTTG GTCATCTCTA TTTTCCGCGA
ATGCATCGCG CTGGTTATGT CGCGCCTAAT CTCGGCGAAG TGCCGCCGCA CGCCAGCCCT
GGCGGCTACG TGATGGATTC ACGGCCAGGG CTTTATGATT CAGTGCTGGT GCTGGACTAT
AAAAGCCTGT ACCCGTCGAT CATCCGCACC TTTCTGATTG ATCCCGTCGG GCTGGTGGAA
GGCATGGCGC AGCCTGATCC AGAGCACAGT ACCGAAGGTT TTCTCGATGC CTGGTTCTCG
CGAGAAAAAC ATTGCCTGCC GGAGATTGTG ACTAACATCT GGCACGGGCG CGATGAAGCC
AAACGCCAGG GTAACAAACC GCTGTCGCAG GCGCTGAAAA TCATCATGAA TGCCTTTTAT
GGCGTGCTCG GCACCACCGC CTGCCGCTTC TTCGATCCGC GGCTGGCATC GTCGATCACC
ATGCGTGGTC ATCAGATCAT GCGGCAAACC AAAGCGTTGA TTGAAGCACA GGGCTACGAC
GTTATCTACG GCGATACCGA CTCAACGTTT GTCTGGCTGA AAGGCGCACA TTCGGAAGAA
GAAGCGGCGA AAATCGGTCG TGCACTGGTG CAGCACGTTA ACGCCTGGTG GGCGGAAACG
CTGCAAAAAC AACGGCTGAC CAGCGCATTA GAACTGGAGT ATGAAACCCA TTTCTGCCGT
TTTCTGATGC CAACCATTCG CGGAGCCGAT ACCGGCAGTA AAAAGCGTTA TGCCGGACTG
ATTCAGGAGG GCGACAAGCA GCGGATGGTG TTTAAAGGGC TGGAAACCGT GCGCACCGAC
TGGACGCCGC TGGCCCAGCA GTTTCAGCAG GAGCTATACC TGCGCATCTT CCGCAACGAG
CCATATCAGG AATATGTACG CGAAACCATC GACAAACTGA TGGCGGGTGA ACTGGATGCG
CGACTGGTTT ACCGTAAACG CCTTCGCCGT CCGCTGAGCG AGTATCAGCG TAATGTGCCG
CCTCATGTAC GCGCCGCTCG CCTTGCCGAT GAAGAAAACC AAAAGCGTGG TCGCCCCTTG
CAATATCAGA ATCGCGGCAC CATTAAGTAC GTATGGACCA CCAACGGCCC GGAGCCGCTG
GACTACCAAC GTTCACCACT GGATTACGAA CACTATCTGA CCCGCCAGCT ACAACCCGTG
GCGGAGGGAA TACTCCCTTT TATTGAGGAT AATTTTGCTA CACTTATGAC CGGGCAACTT
GGGCTATTTT GA
 
Protein sequence
MAQAGFILTR HWRDTPQGTE VSFWLATDNG PLQVTLAPQE SVAFIPADQV PRAQHILQGE 
QGFRLTPLAL KDFHRQPVYG LYCRAHRQLM NYEKRLREGG VTVYEADVRP PERYLMERFI
TSPVWVEGDM HNGTIVNARL KPHPDYRPPL KWVSIDIETT RHGELYCIGL EGCGQRIVYM
LGPENGDASS LDFELEYVAS RPQLLEKLNA WFANYDPDVI IGWNVVQFDL RMLQKHAERY
RLPLRLGRDN SELEWREHGF KNGVFFAQAK GRLIIDGIEA LKSAFWNFSS FSLETVAQEL
LGEGKSIDNP WDRMDEIDRR FAEDKPALAT YNLKDCELVT QIFHKTEIMP FLLERATVNG
LPVDRHGGSV AAFGHLYFPR MHRAGYVAPN LGEVPPHASP GGYVMDSRPG LYDSVLVLDY
KSLYPSIIRT FLIDPVGLVE GMAQPDPEHS TEGFLDAWFS REKHCLPEIV TNIWHGRDEA
KRQGNKPLSQ ALKIIMNAFY GVLGTTACRF FDPRLASSIT MRGHQIMRQT KALIEAQGYD
VIYGDTDSTF VWLKGAHSEE EAAKIGRALV QHVNAWWAET LQKQRLTSAL ELEYETHFCR
FLMPTIRGAD TGSKKRYAGL IQEGDKQRMV FKGLETVRTD WTPLAQQFQQ ELYLRIFRNE
PYQEYVRETI DKLMAGELDA RLVYRKRLRR PLSEYQRNVP PHVRAARLAD EENQKRGRPL
QYQNRGTIKY VWTTNGPEPL DYQRSPLDYE HYLTRQLQPV AEGILPFIED NFATLMTGQL
GLF