Gene EcolC_3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3005 
SymbolholA 
ID6065950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3285629 
End bp3286660 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content53% 
IMG OID641602422 
ProductDNA polymerase III subunit delta 
Protein accessionYP_001725957 
Protein GI170021003 
COG category[L] Replication, recombination and repair 
COG ID[COG1466] DNA polymerase III, delta subunit 
TIGRFAM ID[TIGR01128] DNA polymerase III, delta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00124747 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGGT TGTACCCGGA ACAACTCCGC GCGCAGCTCA ATGAAGGGCT GCGCGCGGCG 
TATCTTTTAC TTGGTAACGA TCCTCTGTTA TTGCAGGAAA GCCAGGACGC TGTTCGTCAG
GTAGCTGCGG CACAAGGATT CGAAGAACAC CACACTTTTT CCATTGATCC CAACACTGAC
TGGAATGCGA TCTTTTCGTT ATGCCAGGCC ATGAGTCTGT TTGCCAGTCG ACAAACGCTA
TTGCTGTTGT TACCAGAAAA CGGACCAAAT GCGGCGATCA ATGAACACCT CCTCACGCTC
ACCGGATTGC TGCATGACGA TCTGCTGTTG ATCGTCCGCG GTAATAAATT AAGCAAAGCG
CAAGAAAATG CCGCCTGGTT TACTGCTCTT GCGAATCGCA GCGTGCAGGT GACCTGTCAG
ACACCGGAGC AGGCTCAGCT TCCCCGCTGG GTTGCTGCGC GCGCAAAACA GCTCAACTTA
GAACTGGATG ACGCGGCAAA TCAGGTGCTC TGCTACTGTT ATGAAGGTAA CCTGCTGGCG
CTGGCTCAGG CACTGGAGCG TTTATCGCTG CTCTGGCCAG ACGGCAAATT GACATTACCG
CGCGTTGAAC AGGCGGTGAA TGATGCCGCG CATTTCACCC CTTTTCATTG GGTTGATGCT
TTGTTGATGG GAAAAAGTAA GCGCGCGTTG CATATTCTTC AGCAACTGCG TCTGGAAGGC
AGCGAGCCGG TTATTTTGTT GCGCACATTA CAACGTGAAC TGTTGTTACT GGTTAACCTG
AAACGCCAGT CTGCCCATAC GCCACTGCGT GCGTTGTTTG ATAAGCATCG GGTATGGCAG
AACCGCCGGG GCATGATGGG CGAGGCGTTA AATCGCTTAA GTCAGCCGCA GTTACGTCAG
GCCGTGCAAC TCCTGACACG AACGGAACTC ACCCTCAAAC AAGATTACGG TCAGTCAGTG
TGGGCAGAGC TGGAAGGGTT ATCTCTTCTG TTGTGCCATA AACCCCTGGC GGACGTATTT
ATCGACGGTT GA
 
Protein sequence
MIRLYPEQLR AQLNEGLRAA YLLLGNDPLL LQESQDAVRQ VAAAQGFEEH HTFSIDPNTD 
WNAIFSLCQA MSLFASRQTL LLLLPENGPN AAINEHLLTL TGLLHDDLLL IVRGNKLSKA
QENAAWFTAL ANRSVQVTCQ TPEQAQLPRW VAARAKQLNL ELDDAANQVL CYCYEGNLLA
LAQALERLSL LWPDGKLTLP RVEQAVNDAA HFTPFHWVDA LLMGKSKRAL HILQQLRLEG
SEPVILLRTL QRELLLLVNL KRQSAHTPLR ALFDKHRVWQ NRRGMMGEAL NRLSQPQLRQ
AVQLLTRTEL TLKQDYGQSV WAELEGLSLL LCHKPLADVF IDG