Gene EcE24377A_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0666 
SymbolholA 
ID5586759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp693250 
End bp694281 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content53% 
IMG OID640924382 
ProductDNA polymerase III subunit delta 
Protein accessionYP_001461808 
Protein GI157155534 
COG category[L] Replication, recombination and repair 
COG ID[COG1466] DNA polymerase III, delta subunit 
TIGRFAM ID[TIGR01128] DNA polymerase III, delta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000205044 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGGT TGTACCCGGA ACAACTCCGC GCGCAGCTCA ATGAAGGGCT GCGCGCGGCG 
TATCTTTTAC TTGGTAACGA TCCTCTGTTA TTGCAGGAAA GCCAGGACGC TGTTCGTCAG
GTAGCTGCGG CACAAGGATT CGAAGAACAC CACACTTTTT CCATTGATCC CAACACTGAC
TGGAATGCGA TCTTTTCGTT ATGCCAGGCT ATGAGTCTGT TTGCCAGTCG ACAAACGCTA
TTGCTGTTGT TACCAGAAAA CGGACCGAAT GGGGCGATCA ATGAACAACT TCTCACACTC
ACCGGACTTC TGCATGACGA CCTGCTGTTG ATCGTCCGCG GTAATAAATT AAGCAAAGCG
CAAGAAAATG CCGCCTGGTT TACTGCTCTT GCGAATCGCA GCGTGCAGGT GACCTGTCAG
ACACCGGAGC AGGCTCAGCT TCCCCGCTGG GTTGCTGCGC GCGCAAAACA GCTCAACTTA
GAACTGGATG ACGCGGCGAA TCAGGTGCTC TGCTACTGTT ATGAAGGTAA CCTGCTGGCG
CTGGCTCAGG CACTGGAGCG TTTATCGCTG CTCTGGCCAG ACGGCAAATT GACATTACCG
CGCGTTGAAC AGGCGGTGAA TGATGCCGCG CATTTCACCC CTTTTCATTG GGTTGATGCT
TTGTTGATGG GAAAAAGTAA GCGCGCGTTG CATATTCTTC AGCAACTGCG TCTGGAAGGC
AGCGAGCCGG TTATTTTGTT GCGCACATTA CAACGTGAAC TGTTGTTACT GGTGAACCTG
AAACGCCAGT CTGCCCATAC GCCACTGCGT GCGTTGTTTG ATAAGCATCG GGTATGGCAG
AACCGCCGGG GCATGATGGG CGAGGCGTTA AATCGCTTAA GCCAGCCGCA GTTACGTCAG
GCTGTGCAAC TCCTGACACG AACGGAACTC ACCCTCAAAC AAGATTACGG TCAGTCAGTG
TGGGCAGAGC TGGAAGGGTT ATCTCTTCTG TTGTGCCATA AACCCCTGGC GGACGTATTT
ATCGACGGTT GA
 
Protein sequence
MIRLYPEQLR AQLNEGLRAA YLLLGNDPLL LQESQDAVRQ VAAAQGFEEH HTFSIDPNTD 
WNAIFSLCQA MSLFASRQTL LLLLPENGPN GAINEQLLTL TGLLHDDLLL IVRGNKLSKA
QENAAWFTAL ANRSVQVTCQ TPEQAQLPRW VAARAKQLNL ELDDAANQVL CYCYEGNLLA
LAQALERLSL LWPDGKLTLP RVEQAVNDAA HFTPFHWVDA LLMGKSKRAL HILQQLRLEG
SEPVILLRTL QRELLLLVNL KRQSAHTPLR ALFDKHRVWQ NRRGMMGEAL NRLSQPQLRQ
AVQLLTRTEL TLKQDYGQSV WAELEGLSLL LCHKPLADVF IDG