Gene EcHS_A0186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0186 
SymboldnaE 
ID5593439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp203283 
End bp206765 
Gene Length3483 bp 
Protein Length1160 aa 
Translation table11 
GC content55% 
IMG OID640919373 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_001456967 
Protein GI157159649 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0596563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAC CACGTTTCGT ACACCTGCGG GTGCACAGCG ACTACTCGAT GATCGATGGC 
CTGGCCAAAA CCGCACCGTT GGTAAAAAAG GCGGCGGCGT TGGGTATGCC AGCACTGGCG
ATCACCGATT TCACCAACCT TTGTGGTCTG GTGAAGTTCT ACGGAGCGGG ACATGGCGCA
GGGATTAAGC CTATCGTCGG GGCAGATTTT AACGTCCAGT GCGACCTGCT GGGTGATGAG
TTAACCCACC TGACGGTACT GGCGGCGAAC AATACCGGCT ATCAGAATCT GACGTTGCTG
ATCTCAAAAG CGTATCAGCG CGGGTACGGT GCCGCCGGGC CGATCATCGA TCGCGACTGG
CTTATCGAAT TAAACGAAGG GTTGATCCTT CTTTCCGGCG GACGCATGGG CGACGTCGGA
CGCAGTCTTT TGCGTGGTAA CAGCGCGCTG GTAGATGAGT GTGTCGCGTT TTATGAAGAA
CACTTCCCGG ATCGCTATTT TCTCGAGCTG ATCCGCACCG GCAGGCCGGA TGAAGAAAGC
TATCTGCACG CGGCGGTGGA ACTGGCGGAA GCGCGCGGTT TGCCCGTCGT GGCGACCAAC
GACGTGCGCT TTATCGACAG CAGCGACTTT GACGCACACG AAATCCGCGT CGCGATCCAC
GACGGCTTTA CCCTCGACGA TCCTAAACGC CCGCGTAACT ATTCGCCGCA GCAATATATG
CGTAGCGAAG AGGAGATGTG TGAGCTGTTT GCCGACATCC CCGAAGCCCT TGCCAACACC
GTTGAGATCG CCAAACGCTG TAACGTAACC GTGCGTCTTG GTGAATACTT CCTGCCGCAG
TTCCCGACCG GGGACATGAG CACCGAAGAT TATCTGGTCA AGCGTGCAAA AGAGGGCCTG
GAAGAGCGTC TGGCCTTTTT ATTCCCTGAC GAGGAAGAAC GTGTTAAGCG CCGCCCGGAA
TATGACGAAC GTCTGGAGAC TGAACTTCAG GTTATCAACC AGATGGGCTT CCCGGGCTAC
TTCCTCATCG TTATGGAATT TATCCAGTGG TCGAAAGATA ACGGCGTACC GGTAGGGCCA
GGCCGTGGCT CCGGTGCGGG TTCACTGGTG GCCTACGCGC TGAAAATCAC CGACCTCGAT
CCGCTGGAAT TTGACCTGCT GTTCGAACGT TTCCTTAACC CGGAACGTGT CTCCATGCCT
GACTTCGACG TTGACTTCTG TATGGAGAAA CGCGATCAGG TTATCGAGCA CGTAGCGGAC
ATGTACGGTC GTGATGCGGT ATCGCAGATC ATCACCTTCG GTACAATGGC GGCGAAAGCG
GTGATCCGCG ACGTAGGCCG CGTGCTGGGG CATCCGTACG GCTTTGTCGA TCGTATCTCG
AAACTGATCC CGCCCGATCC GGGGATGACG CTGGCGAAAG CGTTTGAAGC CGAGCCGCAG
CTGCCGGAAA TCTACGAAGC GGATGAAGAA GTTAAGGCGC TGATCGACAT GGCGCGCAAA
CTGGAAGGGG TCACCCGTAA CGCCGGTAAG CACGCCGGTG GGGTGGTTAT CGCGCCGACC
AAAATTACCG ATTTTGCGCC GCTTTACTGC GATGAAGAGG GCAAACATCC GGTCACCCAG
TTTGATAAAA GCGACGTTGA ATACGCCGGA CTGGTGAAGT TCGACTTCCT TGGTTTGCGT
ACGCTCACCA TCATCAACTG GGCGCTGGAG ATGATCAACA AGCGGCGGGC GAAGAATGGC
GAGCCGCCGC TGGATATCGC CGCGATCCCG CTGGATGACA AGAAAAGCTT CGACATGCTG
CAACGCTCGG AAACCACGGC GGTATTCCAG CTTGAATCGC GCGGCATGAA GGACCTGATC
AAGCGTCTGC AACCTGACTG CTTCGAAGAT ATGATCGCAC TGGTGGCGCT GTTCCGCCCA
GGTCCGTTGC AATCAGGGAT GGTGGATAAC TTTATCGACC GTAAACATGG TCGCGAAGAA
ATCTCCTATC CGGACGTACA GTGGCAGCAT GAAAGCCTGA AACCGGTACT GGAGCCAACC
TACGGCATCA TCCTGTATCA GGAACAGGTC ATGCAGATTG CGCAGGTGCT TTCTGGTTAT
ACCCTCGGTG GCGCGGATAT GCTGCGTCGT GCGATGGGTA AGAAAAAGCC GGAAGAGATG
GCTAAGCAGC GTTCTGTATT TGCTGAAGGT GCAGAAAAGA ACGGAATCAA CGCCGAACTG
GCGATGAAAA TCTTCGACCT GGTGGAGAAA TTCGCGGGTT ACGGATTTAA CAAATCGCAC
TCTGCGGCCT ATGCTTTGGT GTCATATCAA ACGTTATGGC TGAAAGCGCA CTATCCTGCG
GAGTTTATGG CGGCGGTAAT GACCGCCGAT ATGGACAACA CCGAGAAGGT GGTGGGCCTG
GTGGATGAGT GCTGGCGGAT GGGGCTGAAA ATCCTGCCAC CAGATATAAA CTCCGGTCTT
TACCATTTCC ACGTCAACGA CGACGGCGAA ATCGTGTATG GTATTGGCGC CATCAAAGGG
GTAGGTGAAG GTCCGATTGA GGCCATCATC GAAGCCCGTA ATAAAGGCGG CTACTTCCGC
GAACTGTTTG ATCTTTGCGC CCGTACCGAC ACCAAAAAGT TAAACCGGCG GGTGCTGGAA
AAACTGATCA TGTCCGGGGC GTTTGACCGT CTTGGACCAC ACCGCGCGGC ACTGATGAAC
TCGCTGGGCG ATGCGTTAAA AGCGGCAGAT CAACACGCGA AAGCGGAAGC TATCGGTCAG
GCCGATATGT TCGGCGTGCT GGCCGAAGAG CCGGAACAAA TTGAACAATC CTACGCCAGC
TGCCAACCGT GGCCGGAGCA GGTGGTATTA GATGGGGAAC GTGAAACGTT AGGCCTGTAC
CTGACCGGAC ACCCTATCAA CCAGTATTTA AAAGAGATTG AGCGTTATGT CGGAGGCGTA
AGGCTGAAAG ACATGCACCC GACAGAACGT GGTAAAGTCA TCACGGCTGC GGGGCTCGTT
GTTGCCGCGC GGGTTATGGT CACCAAGCGC GGCAATCGTA TCGGTATCTG CACGCTGGAT
GACCGTTCCG GGCGGCTGGA AGTGATGTTG TTTACTGACG CCCTGGATAA ATACCAGCAA
TTGCTGGAAA AAGACCGCAT ACTTATCGTC AGCGGACAGG TCAGCTTTGA TGACTTCAGC
GGTGGGCTTA AAATGACCGC TCGCGAAGTG ATGGATATTG ACGAAGCCCG GGAAAAATAT
GCTCGCGGGC TTGCTATCTC GCTGACGGAC AGGCAAATTG ATGACCAGCT TTTAAACCGA
CTCCGTCAGT CTCTGGAACC CCACCGCTCT GGGACAATTC CAGTACATCT CTACTATCAG
AGGGCGGATG CACGCGCGCG GTTGCGTTTT GGCGCGACGT GGCGTGTCTC TCCGAGCGAT
CGTTTATTAA ACGATCTCCG TGGCCTCATT GGTTCGGAGC AGGTGGAACT GGAGTTTGAC
TAA
 
Protein sequence
MSEPRFVHLR VHSDYSMIDG LAKTAPLVKK AAALGMPALA ITDFTNLCGL VKFYGAGHGA 
GIKPIVGADF NVQCDLLGDE LTHLTVLAAN NTGYQNLTLL ISKAYQRGYG AAGPIIDRDW
LIELNEGLIL LSGGRMGDVG RSLLRGNSAL VDECVAFYEE HFPDRYFLEL IRTGRPDEES
YLHAAVELAE ARGLPVVATN DVRFIDSSDF DAHEIRVAIH DGFTLDDPKR PRNYSPQQYM
RSEEEMCELF ADIPEALANT VEIAKRCNVT VRLGEYFLPQ FPTGDMSTED YLVKRAKEGL
EERLAFLFPD EEERVKRRPE YDERLETELQ VINQMGFPGY FLIVMEFIQW SKDNGVPVGP
GRGSGAGSLV AYALKITDLD PLEFDLLFER FLNPERVSMP DFDVDFCMEK RDQVIEHVAD
MYGRDAVSQI ITFGTMAAKA VIRDVGRVLG HPYGFVDRIS KLIPPDPGMT LAKAFEAEPQ
LPEIYEADEE VKALIDMARK LEGVTRNAGK HAGGVVIAPT KITDFAPLYC DEEGKHPVTQ
FDKSDVEYAG LVKFDFLGLR TLTIINWALE MINKRRAKNG EPPLDIAAIP LDDKKSFDML
QRSETTAVFQ LESRGMKDLI KRLQPDCFED MIALVALFRP GPLQSGMVDN FIDRKHGREE
ISYPDVQWQH ESLKPVLEPT YGIILYQEQV MQIAQVLSGY TLGGADMLRR AMGKKKPEEM
AKQRSVFAEG AEKNGINAEL AMKIFDLVEK FAGYGFNKSH SAAYALVSYQ TLWLKAHYPA
EFMAAVMTAD MDNTEKVVGL VDECWRMGLK ILPPDINSGL YHFHVNDDGE IVYGIGAIKG
VGEGPIEAII EARNKGGYFR ELFDLCARTD TKKLNRRVLE KLIMSGAFDR LGPHRAALMN
SLGDALKAAD QHAKAEAIGQ ADMFGVLAEE PEQIEQSYAS CQPWPEQVVL DGERETLGLY
LTGHPINQYL KEIERYVGGV RLKDMHPTER GKVITAAGLV VAARVMVTKR GNRIGICTLD
DRSGRLEVML FTDALDKYQQ LLEKDRILIV SGQVSFDDFS GGLKMTAREV MDIDEAREKY
ARGLAISLTD RQIDDQLLNR LRQSLEPHRS GTIPVHLYYQ RADARARLRF GATWRVSPSD
RLLNDLRGLI GSEQVELEFD