Gene EcHS_A0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0547 
SymboldnaX 
ID5590924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp558157 
End bp560088 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content58% 
IMG OID640919731 
ProductDNA polymerase III subunits gamma and tau 
Protein accessionYP_001457315 
Protein GI157159997 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID[TIGR00678] DNA polymerase III, delta' subunit
[TIGR02397] DNA polymerase III, subunit gamma and tau 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.014697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATC AGGTCTTAGC CCGAAAATGG CGCCCACAAA CCTTTGCTGA CGTCGTCGGC 
CAGGAACATG TGCTGACCGC ACTGGCGAAC GGCTTGTCGT TAGGGCGTAT TCATCATGCT
TATCTTTTTT CCGGCACCCG TGGCGTCGGA AAAACCTCTA TCGCCCGACT GCTGGCGAAG
GGGCTAAACT GCGAAACCGG CATTACCGCG ACGCCGTGCG GCGTGTGCGA TAACTGTCGT
GAAATCGAGC AGGGGCGCTT TGTCGATCTG ATTGAAATCG ACGCCGCCTC GCGCACCAAA
GTTGAAGATA CCCGCGACCT GCTGGATAAC GTCCAGTACG CTCCGGCGCG TGGTCGTTTC
AAAGTTTATC TGATCGACGA AGTGCATATG CTGTCGCGCC ACAGCTTTAA CGCACTGTTA
AAAACCCTTG AAGAGCCGCC GGAGCACGTT AAGTTTCTGC TGGCGACGAC CGATCCACAG
AAATTGCCGG TGACGATTTT GTCACGCTGT CTGCAATTTC ATCTCAAGGC GCTGGATGTC
GAGCAAATTC GCCATCAGCT TGAGCACATC CTCAACGAAG AACATATCGC TCACGAGCCG
CGGGCGTTGC AATTGCTGGC GCGCGCCGCT GAAGGCAGCC TGCGAGATGC CTTAAGTCTG
ACCGACCAGG CGATTGCCAG CGGTGACGGC CAGGTTTCAA CCCAGGCGGT CAGTGCGATG
CTGGGTACGC TTGACGACGA TCAGGCGCTG TCGCTGGTTG AAGCGATGGT CGAGGCCAAC
GGCGAGCGCG TAATGGCGCT GATTAATGAA GCCGCTGCCC GTGGTATCGA GTGGGAAGCG
TTGCTGGTGG AAATGCTCGG TCTGTTGCAT CGTATTGCGA TGGTACAACT TTCGCCTGCT
GCACTTGGCA ACGACATGGC CGCCATCGAG CTGCGGATGC GTGAACTGGC GCGCACCATA
CCGCCGACGG ATATTCAGCT TTACTATCAG ACGCTGTTGA TTGGTCGCAA AGAATTACCG
TATGCGCCGG ACCGCCGCAT GGGCGTTGAG ATGACGCTGC TGCGCGCGCT GGCATTCCAT
CCGCGTATGC CGCTGCCTGA GCCAGAAGTG CCACGCCAGT CCTTTGCACC TGTCGCACCA
ACGGCAGTAA TGACGCCAAC CCAGGTGCCG CCGCAACCGC AATCAGCGCC GCAGCAGGCT
CCGACTGTAC CGCTCCCGGA AACCACCAGC CAGGTGCTGG CGGCGCGCCA GCAGTTGCAG
CGCGTGCAGG GAGCAACCAA AGCAAAAAAG AGTGAACCGG CAGCCGCTAC CCGCGCGCGG
CCGGTGAATA ACGCTGCGCT GGAAAGACTG GCTTCGGTCA CCGATCGCGT TCAGGCGCGT
CCGGTGCCAT CGGCGCTGGA AAAAGCGCCA GCTAAAAAAG AAGCGTATCG CTGGAAGGCG
ACCACTCCGG TGATGCAGCA AAAAGAAGTG GTCGCCACGC CGAAGGCGCT GAAAAAAGCG
CTGGAACATG AAAAAACGCC GGAACTGGCG GCGAAGCTGG CGGCAGAAGC CATTGAGCGC
GACGCGTGGG CGGCACAGGT TAGCCAACTT TCGCTACCAA AACTGGTCGA ACAGGTGGCC
TTAAATGCCT GGAAAGAGGA GAGCGACAAC GCAGTATGTC TGCATTTGCG CACCTCTCAG
CGGCATTTGA ACAACCGCGG TGCACAGCAA AAACTGGCTG AAGCGTTGAG CACGTTAAAA
GGTTCAACGG TTGAACTGAC TATCGTTGAA GATGATAATC CCGCGGTGCG TACGCCGCTG
GAATGGCGTC AGGCGATATA CGAAGAAAAA CTTGCGCAGG CGCGCGAGTC CATTATTGCG
GATAATAATA TTCAGACCCT GCGTCGATTC TTCGATGCGG AGCTGGATGA AGACAGTATC
CGCCCCATTT GA
 
Protein sequence
MSYQVLARKW RPQTFADVVG QEHVLTALAN GLSLGRIHHA YLFSGTRGVG KTSIARLLAK 
GLNCETGITA TPCGVCDNCR EIEQGRFVDL IEIDAASRTK VEDTRDLLDN VQYAPARGRF
KVYLIDEVHM LSRHSFNALL KTLEEPPEHV KFLLATTDPQ KLPVTILSRC LQFHLKALDV
EQIRHQLEHI LNEEHIAHEP RALQLLARAA EGSLRDALSL TDQAIASGDG QVSTQAVSAM
LGTLDDDQAL SLVEAMVEAN GERVMALINE AAARGIEWEA LLVEMLGLLH RIAMVQLSPA
ALGNDMAAIE LRMRELARTI PPTDIQLYYQ TLLIGRKELP YAPDRRMGVE MTLLRALAFH
PRMPLPEPEV PRQSFAPVAP TAVMTPTQVP PQPQSAPQQA PTVPLPETTS QVLAARQQLQ
RVQGATKAKK SEPAAATRAR PVNNAALERL ASVTDRVQAR PVPSALEKAP AKKEAYRWKA
TTPVMQQKEV VATPKALKKA LEHEKTPELA AKLAAEAIER DAWAAQVSQL SLPKLVEQVA
LNAWKEESDN AVCLHLRTSQ RHLNNRGAQQ KLAEALSTLK GSTVELTIVE DDNPAVRTPL
EWRQAIYEEK LAQARESIIA DNNIQTLRRF FDAELDEDSI RPI