Gene EcSMS35_0660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0660 
SymbolholA 
ID6146832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp671178 
End bp672209 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content52% 
IMG OID641615550 
ProductDNA polymerase III subunit delta 
Protein accessionYP_001742756 
Protein GI170679792 
COG category[L] Replication, recombination and repair 
COG ID[COG1466] DNA polymerase III, delta subunit 
TIGRFAM ID[TIGR01128] DNA polymerase III, delta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000888397 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGGT TGTACCCGGA ACAACTCCGC GCGCAGCTCA ATGAAGGGCT GCGCGCGGCG 
TATCTTTTAC TTGGTAACGA TCCTCTGTTA TTGCAGGAAA GCCAGGACGC TGTTCGTCAG
GTCGCTGCGG CACAAGGATT CGAAGAACAC CACACTTTTT CCATTGATCC CAACACTGAC
TGGAATGCGA TCTTTTCGTT ATGCCAGGCT ATGAGTCTGT TTGCCAGTCG ACAAACGCTA
TTGCTGTTGT TACCAGAAAA CGGACCGAAT GCGGCGATCA ATGAACAACT TCTCACACTC
ACCGGACTTC TGCATGACGA CCTGCTGTTG ATCGTCCGCG GTAATAAATT AAGCAAAGCG
CAAGAAAATG CCGCCTGGTT TACTGCTCTT GCGAATCGCA GCGTGCAGGT GACCTGTCAG
ACACCGGAGC AGGCTCAGCT TCCCCGCTGG GTTGCTGCGC GTGCAAAACA GCTCAACTTA
GAACTGGATG ACGCTGCAAA TCAGGTGCTC TGCTACTGTT ATGAAGGTAA CCTGCTGGCG
CTGGCCCAGG CACTGGAGCG TTTATCGCTG CTCTGGCCAG ACGGCAAACT GACATTACCG
CGCGTTGAAC AGGCGGTGAA TGATGCCGCG CATTTCACCC CTTTTCATTG GGTTGATGCT
TTGTTGATGG GAAAAAGTAA GCGCGCGTTG CATATTCTTC AGCAACTGCG TCTGGAAGGC
AGCGAACCGG TTATTTTGTT GCGCACATTA CAACGTGAAC TGTTGTTACT GGTTAACCTG
AAACGCCAGT CTGCCCATAT GCCACTGCGT GCATTGTTTG ATAAGCATCG GGTATGGCAG
AACCGCCGGG GCATGATGGG CGAGGCGTTA AATCGCTTAA GCCAGCCGCA GTTACGTCAG
GCCGTGCAAC TCCTGACACG AACGGAACTC ACCCTCAAAC AAGATTACGG TCAGTCAGTG
TGGGCAGAGC TGGAAGGGTT ATCTCTTCTG TTGTGCCATA AACCCCTGGC GGACGTATTT
ATCGACGGTT GA
 
Protein sequence
MIRLYPEQLR AQLNEGLRAA YLLLGNDPLL LQESQDAVRQ VAAAQGFEEH HTFSIDPNTD 
WNAIFSLCQA MSLFASRQTL LLLLPENGPN AAINEQLLTL TGLLHDDLLL IVRGNKLSKA
QENAAWFTAL ANRSVQVTCQ TPEQAQLPRW VAARAKQLNL ELDDAANQVL CYCYEGNLLA
LAQALERLSL LWPDGKLTLP RVEQAVNDAA HFTPFHWVDA LLMGKSKRAL HILQQLRLEG
SEPVILLRTL QRELLLLVNL KRQSAHMPLR ALFDKHRVWQ NRRGMMGEAL NRLSQPQLRQ
AVQLLTRTEL TLKQDYGQSV WAELEGLSLL LCHKPLADVF IDG