Gene EcSMS35_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0195 
SymboldnaE 
ID6145063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp214308 
End bp217790 
Gene Length3483 bp 
Protein Length1160 aa 
Translation table11 
GC content55% 
IMG OID641615096 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_001742312 
Protein GI170682325 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.220929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAC CACGTTTCGT ACACCTGCGG GTGCACAGCG ACTACTCGAT GATCGATGGC 
CTGGCCAAAA CCGCACCGCT GGTAAAAAAG GCGGCGGCGT TGGGTATGCC AGCACTGGCG
ATCACCGATT TCACCAACCT TTGCGGTCTG GTGAAGTTCT ACGGAGCGGG ACATGGCGCA
GGGATTAAGC CCATCGTCGG GGCGGATTTT AACGTCCAGT GCGACCTGCT GGGTGATGAG
TTAACCCACC TGACGGTGCT GGCGGCGAAC AATACCGGCT ATCAGAATCT GACGTTGCTG
ATCTCAAAAG CGTATCAGCG CGGGTACGGT GCCGCCGGGC CGATCATCGA TCGCGACTGG
CTTATCGAAT TAAATGAAGG GTTGATCCTT CTTTCCGGCG GGCGCATGGG CGACGTTGGA
CGCAGTCTTT TGCGTGGTAA CAGCGCGCTG GTAGATGAGT GTGTCGCGTT TTATGAAGAA
CACTTCCCGG ATCGCTATTT TCTCGAGCTG ATCCGCACCG GCAGGCCGGA CGAAGAAAGC
TATCTGCACG CTGCGGTGGA ACTGGCGGAA TCACGCGGTT TGCCCGTTGT GGCGACCAAC
GACGTGCGCT TTATCGACAG CAGCGACTTT GACGCACACG AAATCCGCGT CGCGATCCAC
GACGGCTTTA CCCTCGACGA TCCTAAACGC CCGCGTAACT ATTCGCCGCA GCAATATATG
CGTAGCGAAG AGGAGATGTG CGAGCTTTTT GCCGACATTC CCGAAGCCCT TGCCAACACC
GTTGAGATAG CCAAACGTTG TAACGTAACT GTGCGTCTGG GTGAATACTT CTTGCCGCAA
TTCCCGACCG GGGACATGAG CACCGAAGAT TATCTGGTCA AGCGTGCAAA AGAAGGCCTG
GAAGAGCGTC TGGCCTTTTT ATTCCCTGAC GAGGAAGAAC GTCTGAAGCG CCGCCCGGAA
TATGACGAGC GTCTGGAGAC TGAACTTCAG GTTATCAACC AGATGGGCTT CCCGGGTTAC
TTCCTCATCG TTATGGAATT TATCCAGTGG TCGAAAGATA ACGGCGTACC GGTAGGGCCA
GGCCGTGGCT CCGGTGCGGG TTCACTGGTG GCCTACGCGC TGAAAATCAC CGACCTCGAT
CCGCTGGAAT TTGACCTGCT GTTCGAACGT TTCCTTAACC CGGAACGTGT CTCCATGCCT
GACTTCGACG TTGACTTCTG TATGGAGAAA CGCGACCAGG TTATCGAACA TGTGGCGGAC
ATGTACGGTC GCGATGCGGT ATCGCAGATT ATCACCTTCG GTACGATGGC GGCGAAAGCG
GTGATCCGCG ACGTAGGCCG CGTGCTGGGC CATCCGTACG GCTTTGTCGA TCGTATCTCG
AAACTGATCC CGCCCGATCC GGGGATGACG CTGGCGAAAG CGTTTGAAGC CGAGCCGCAG
CTGCCGGAAA TCTACGAAGC GGATGAAGAA GTTAAGGCGC TGATCGACAT GGCGCGCAAA
CTGGAAGGGG TCACCCGTAA CGCCGGTAAG CACGCCGGTG GGGTGGTTAT CGCACCGACC
AAAATTACCG ATTTTGCGCC GCTTTACTGC GATGAAGAGG GCAAACATCC GGTCACCCAG
TTTGATAAAA GCGACGTTGA ATACGCCGGG CTGGTGAAGT TCGACTTCCT CGGTTTGCGT
ACGCTCACCA TCATCAACTG GGCGCTGGAG ATGATCAACA AGCGGCGGGC GAAGAATGGC
GAGCCGCCGC TGGATATCGC CGCGATCCCG CTGGATGACA AGAAAAGCTT CGACATGCTG
CAACGCTCGG AAACCACGGC GGTATTCCAG CTTGAATCGC GCGGCATGAA GGACCTGATC
AAGCGTCTGC AACCTGACTG CTTCGAAGAT ATGATCGCAC TGGTGGCACT GTTCCGCCCT
GGTCCGTTGC AGTCAGGGAT GGTGGATAAC TTTATCGACC GTAAACATGG TCGCGAAGAG
ATCTCCTATC CGGACGTGCA GTGGCAGCAT GAAAGCCTGA AACCGGTACT GGAGCCAACC
TACGGCATCA TCCTGTATCA GGAACAGGTC ATGCAGATTG CCCAGGTGCT TTCTGGTTAT
ACCCTCGGTG GCGCGGATAT GCTGCGTCGT GCGATGGGTA AGAAAAAGCC GGAAGAGATG
GCTAAGCAGC GTTCTGTATT TGCTGAAGGT GCAGAAAAGA ACGGAATTAA CGCCGAACTG
GCGATGAAAA TCTTCGACCT GGTGGAGAAA TTCGCGGGTT ACGGATTTAA CAAATCGCAC
TCTGCGGCCT ATGCTTTGGT GTCATATCAA ACGTTATGGC TGAAAGCGCA CTATCCGGCG
GAGTTTATGG CGGCGGTAAT GACCGCCGAT ATGGACAACA CCGAGAAGGT GGTGGGCCTG
GTGGATGAGT GCTGGCGGAT GGGGCTGAAA ATCCTGCCAC CAGATATAAA CTCCGGTCTT
TACCATTTCC ACGTCAACGA CGACGGCGAA ATCGTGTATG GTATTGGCGC GATCAAAGGG
GTCGGTGAAG GTCCGATTGA GGCCATCATC GAAGCCCGTA ATAAAGGCGG CTACTTCCGC
GAACTGTTTG ATCTCTGCGC CCGAACCGAC ACCAAAAAGT TAAACCGGCG AGTGCTGGAA
AAATTGATCA TGTCCGGGGC GTTTGACCGT CTTGGGCCAC ACCGCGCGGC GTTGATGAAC
TCGCTGGGCG ATGCGTTAAA AGCGGCTGAT CAACACGCAA AAGCGGAAGC CATCGGTCAG
GCCGATATGT TCGGCGTGCT GGCAGAAGAG CCGGAACAAA TTGAACAATC CTACGCCAGC
TGCCAACCGT GGCCGGAGCA GGTGGTATTA GATGGGGAAC GTGAAACGTT AGGTCTGTAC
CTGACGGGAC ACCCTATCAA CCAGTATTTA AAAGAGATTG AGCGTTATGT CGGAGGCATA
AGGCTGAAAG ACATGCACCC GACAGAACGT GGTAAAGTCA TCACGGCTGC GGGGCTCGTT
GTTGCTGCGC GGGTTATGGT CACCAAGCGC GGCAATCGTA TCGGTATCTG CACGCTGGAT
GACCGTTCCG GGCGGCTGGA AGTGATGTTG TTTACTGACG CCCTGGATAA ATACCAGCAA
TTGCTGGAAA AAGACCGCAT ACTTATCGTC AGCGGACAGG TCAGCTTTGA TGACTTCAGC
GGTGGGCTTA AAATGACCGC TCGCGAAGTG ATGGATATTG ACGAAGCCCG GGAAAAATAT
GCTCGCGGGC TTGCTATCTC GCTGACGGAC AGGCAAATTG ATGACCAGCT TTTAAACCGA
CTCCGTCAGT CTCTGGAACC CCACCGCTCT GGGACAATTC CAGTACATCT CTACTATCAG
AGGGCGGATG CACGCGCGCG GTTGCGTTTT GGCGCGACGT GGCGTGTCTC TCCGAGCGAT
CGTTTATTAA ACGATCTCCG TGGCCTCATT GGTTCGGAGC AGGTGGAACT GGAGTTTGAC
TAA
 
Protein sequence
MSEPRFVHLR VHSDYSMIDG LAKTAPLVKK AAALGMPALA ITDFTNLCGL VKFYGAGHGA 
GIKPIVGADF NVQCDLLGDE LTHLTVLAAN NTGYQNLTLL ISKAYQRGYG AAGPIIDRDW
LIELNEGLIL LSGGRMGDVG RSLLRGNSAL VDECVAFYEE HFPDRYFLEL IRTGRPDEES
YLHAAVELAE SRGLPVVATN DVRFIDSSDF DAHEIRVAIH DGFTLDDPKR PRNYSPQQYM
RSEEEMCELF ADIPEALANT VEIAKRCNVT VRLGEYFLPQ FPTGDMSTED YLVKRAKEGL
EERLAFLFPD EEERLKRRPE YDERLETELQ VINQMGFPGY FLIVMEFIQW SKDNGVPVGP
GRGSGAGSLV AYALKITDLD PLEFDLLFER FLNPERVSMP DFDVDFCMEK RDQVIEHVAD
MYGRDAVSQI ITFGTMAAKA VIRDVGRVLG HPYGFVDRIS KLIPPDPGMT LAKAFEAEPQ
LPEIYEADEE VKALIDMARK LEGVTRNAGK HAGGVVIAPT KITDFAPLYC DEEGKHPVTQ
FDKSDVEYAG LVKFDFLGLR TLTIINWALE MINKRRAKNG EPPLDIAAIP LDDKKSFDML
QRSETTAVFQ LESRGMKDLI KRLQPDCFED MIALVALFRP GPLQSGMVDN FIDRKHGREE
ISYPDVQWQH ESLKPVLEPT YGIILYQEQV MQIAQVLSGY TLGGADMLRR AMGKKKPEEM
AKQRSVFAEG AEKNGINAEL AMKIFDLVEK FAGYGFNKSH SAAYALVSYQ TLWLKAHYPA
EFMAAVMTAD MDNTEKVVGL VDECWRMGLK ILPPDINSGL YHFHVNDDGE IVYGIGAIKG
VGEGPIEAII EARNKGGYFR ELFDLCARTD TKKLNRRVLE KLIMSGAFDR LGPHRAALMN
SLGDALKAAD QHAKAEAIGQ ADMFGVLAEE PEQIEQSYAS CQPWPEQVVL DGERETLGLY
LTGHPINQYL KEIERYVGGI RLKDMHPTER GKVITAAGLV VAARVMVTKR GNRIGICTLD
DRSGRLEVML FTDALDKYQQ LLEKDRILIV SGQVSFDDFS GGLKMTAREV MDIDEAREKY
ARGLAISLTD RQIDDQLLNR LRQSLEPHRS GTIPVHLYYQ RADARARLRF GATWRVSPSD
RLLNDLRGLI GSEQVELEFD