Gene ECH74115_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0729 
SymbolholA 
ID6969381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp755207 
End bp756238 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content52% 
IMG OID643384763 
ProductDNA polymerase III subunit delta 
Protein accessionYP_002269276 
Protein GI209397273 
COG category[L] Replication, recombination and repair 
COG ID[COG1466] DNA polymerase III, delta subunit 
TIGRFAM ID[TIGR01128] DNA polymerase III, delta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000331026 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.281474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGGT TGTACCCGGA ACAACTCCGC GCGCAGCTCA ATGAAGGGCT GCGCGCGGCG 
TATCTTTTAC TTGGTAACGA TCCTCTGTTA TTGCAGGAAA GCCAGGACGC TGTTCGTCAG
GTAGCTGCGG CACAAGGATT CGAAGAACAC CACACTTTTT CCATTGATCC CAACACTGAC
TGGAATGCGA TCTTTTCGTT ATGCCAGGCC ATGAGTCTGT TTGCCAGTCG ACAAACGCTA
TTGCTGTTGT TACCAGAAAA CGGACCAAAC GCGGCGATCA ATGAACAACT TCTCACACTC
ACCGGACTTC TGCATGACGA CCTGCTGTTG ATCGTCCGCG GTAATAAATT AAGCAAAGCG
CAAGAAAATG CCGCCTGGTT TACTGCTCTT GCGAATCGCA GCGTGCAGGT GACCTGTCAG
ACACCGGAGC AGGCTCAGCT TCCCCGCTGG GTTGCTGCGC GCGCAAAACA GCTCAACTTA
GAACTGGATG ACGCGGCGAA TCAGGTGCTC TGCTACTGTT ATGAAGGTAA CCTGCTGGCG
CTGGCTCAGG CACTGGAGCG TTTATCGCTG CTCTGGCCAG ACGGCAAACT GACATTACCG
CGCGTTGAAC AGGCGGTGAA TGATGCCGCG CATTTCACTC CTTTTCATTG GGTTGATGCT
TTGTTGATGG GAAAAAGTAA GCGCGCATTG CATATTCTTC AGCAACTGCG TCTGGAAGGC
AGCGAACCGG TTATTTTGTT GCGCACATTA CAACGTGAAC TCTTGTTACT GGTTAACCTG
AAACGCCAGT CTGCCCATAC GCCACTGCGT GCGTTGTTTG ATAAGCATCG GGTATGGCAG
AACCGCCGGG GCATGATGGG CGAGGCGTTA AATCGCTTAA GTCAGCCGCA GTTACGTCAG
GCCGTGCAAC TCCTGACACG AACGGAACTC ACCCTCAAAC AAGATTACAG TCAGTCAGTG
TGGGCAGAGT TGGAAGGGTT ATCTCTTCTG TTGTGCCATA AACCCCTGGC GGACGTATTT
ATCGACGGTT GA
 
Protein sequence
MIRLYPEQLR AQLNEGLRAA YLLLGNDPLL LQESQDAVRQ VAAAQGFEEH HTFSIDPNTD 
WNAIFSLCQA MSLFASRQTL LLLLPENGPN AAINEQLLTL TGLLHDDLLL IVRGNKLSKA
QENAAWFTAL ANRSVQVTCQ TPEQAQLPRW VAARAKQLNL ELDDAANQVL CYCYEGNLLA
LAQALERLSL LWPDGKLTLP RVEQAVNDAA HFTPFHWVDA LLMGKSKRAL HILQQLRLEG
SEPVILLRTL QRELLLLVNL KRQSAHTPLR ALFDKHRVWQ NRRGMMGEAL NRLSQPQLRQ
AVQLLTRTEL TLKQDYSQSV WAELEGLSLL LCHKPLADVF IDG