Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0177 |
Symbol | dnaE |
ID | 6271122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 192003 |
End bp | 195485 |
Gene Length | 3483 bp |
Protein Length | 1160 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641724428 |
Product | DNA polymerase III subunit alpha |
Protein accession | YP_001878986 |
Protein GI | 187733784 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAC CACGTTTCGT ACACCTGCGG GTGCACAGCG ACTACTCGAT GATCGATGGC CTGGCCAAAA CTGCACCGTT GGTAAAAAAG GCGGCGGCGT TGGGTATGCC AGCACTGGCG ATCACCGATT TCACCAACCT TTGTGGTCTG GTGAAGTTCT ACGGAGCGGG ACATGGCGCA GGGATTAAGC CTATCGTCGG GGCAGATTTT AACGTCCAGT GCGACCTGCT GGGTGATGAG TTAACCCACC TGACGGTACT GGCGGCGAAC AATACCGGCT ATCAGAATCT GACGTTGCTG ATCTCAAAAG CGTATCAGCG CGGGTACGGT GCCGCTGGGC CGATCATCGA TCGCGACTGG CTTATCGAAT TAAATGAAGG GTTGATCCTT CTTTCCGGCG GGCGCATGGG CGACGTCGGA CGCAGTCTTT TGCGTGGTAA CAGCGCGCTG GTAGGTGAGT GTGTCGCGTT TTATGAAGAA CACTTCCCGG ATCGCTATTT TCTCGAGCTG ATCCGCACCG GCAGGCTGGA TGAAGAAAGC TATCTGCACG CGGCGGTGGA ACTGGCGGAA GCGCGCGGTT TGCCCGTCGT GGCGACTAAC GATGTGCGCT TTATCGACAG CAGCGACTTT GACGCACACG AAATCCGCGT CGCGATCCAC GACGGCTTTA CCCTCGACGA TCCTAAACGC CCGCGTAACT ATTCGCCGCA GCAATATATG CGTAGCGAAG AGGAGATGTG TGAGCTGTTT GCCGACATCC CCGAAGCCCT TGCCAACACC GTTGAGATCG CCAAACGCTG TAACGTAACT GTGCGTCTTG GTGAATACTT CCTGCCGCAG TTCCCGACCG GGGACATGAG CACCGAAGAT TATCTGGTCA AGCGTGCAAA AGAGGGCCTG GAAGAGCGTC TGGCCTTTTT ATTCCCTGAC GAGGAAGAAC GTGTTAAGCG CCGCCCGGAA TATGACGAAC GTCTGGAGAC TGAACTTCAG GTTATCAACC AGATGGGCTT CCCGGGCTAC TTCCTCATCG TTATGGAATT TATCCAGTGG TCGAAAGATA ACGGCGTACC GGTAGGGCCA GGCCGTGGCT CCGGTGCGGG TTCACTGGTG GCCTACGCGC TGAAAATCAC CGACCTCGAT CCGCTGGAAT TTGACCTGCT GTTCGAACGT TTCCTTAACC CGGAACGTGT CTCCATGCCT GACTTCGACG TTGACTTCTG TATGGAGAAA CGCGATCAGG TTATCGAGCA CGTAGCGGAC ATGTACGGTC GTGATGCGGT ATCGCAGATC ATCACCTTCG GTACAATGGC GGCGAAAGCG GTGATCCGCG ACGTAGGCCG CGTGCTGGGG CATCCGTACG GCTTTGTCGA TCGTATCTCG AAACTGATCC CGCCCGATCC GGGGATGACG CTGGCGAAAG CGTTTGAAGC CGAGCCGCAG CTGCCGGAAA TCTACGAAGC GGATGAAGAA GTTAAGGCGC TGATCGACAT GGCGCGCAAA CTGGAAGGGG TCACCCGTAA CGCCGGTAAG CACGCCGGTG GGGTGGTTAT CGCGCCGACC AAAATTACCG ATTTTGCGCC GCTTTACTGC GATGAAGAGG GCAAACATCC GGTCACCCAG TTTGATAAAA GCGACGTTGA ATACGCCGGA CTGGTGAAGT TCGACTTCCT TGGTTTGCGT ACGCTCACCA TCATCAACTG GGCGCTGGAG ATGATCAACA AGCGGCGGGC GAAGAATGGC GAGCCGCCGC TGGATATCGC CGCGATCCCG CTGGATGACA AGAAAAGCTT CGACATGCTG CAACGCTCGG AAACCACGGC GGTATTCCAG CTTGAATCGC GCGGCATGAA GGACCTGATC AAGCGTCTGC AACCTGACTG CTTCGAAGAT ATGATCGCAC TGGTGGCGCT GTTCCGCCCA GGTCCGTTGC AATCAGGGAT GGTGGATAAC TTTATCGACC GTAAACATGG TCGCGAAGAA ATCTCCTATC CGGACGTACA GTGGCAGCAT GAAAGCCTGA AACCGGTACT GGAGCCAACC TACGGCATCA TCCTGTATCA GGAACAGGTC ATGCAGATTG CGCAGGTGCT TTCTGGTTAT ACCCTCGGTG GCGCGGATAT GCTGCGTCGT GCGATGGGTA AGAAAAAGCC GGAAGAGATG GCTAAGCAGC GTTCTGTATT TGCTGAAGGT GCAGAAAAGA ACGGAATCAA CGCCGAACTG GCGATGAAAA TCTTCGACCT GGTGGAGAAA TTCGCGGGTT ACGGATTTAA CAAATCGCAC TCTGCGGCCT ATGCTTTGGT GTCATATCAA ACGTTATGGC TGAAAGCGCA CTATCCGGCG GAGTTTATGG CGGCGGTAAT GACCGCCGAT ATGGACAACA CCGAGAAGGT GGTGGGCCTG GTGGATGAGT GCTGGCGGAT GGGGCTGAAA ATCCTGCCAC CAGATATAAA CTCCGGTCTT TACCATTTCC ACGTCAACGA CGACGGCGAA ATCGTGTATG GTATTGGCGC CATCAAAGGG GTAGGTGAAG GTCCGATTGA GGCCATCATC GAAGCCCGTA ATAAAGGCGG CTACTTCCGC GAACTGTTTG ATCTCTGCGC CCGTACCGAC ACCAAAAAGT TAAACCGGCG GGTGCTGGAA AAACTGATCA TGTCCGGGGC GTTTGACCGT CTTGGGCCAC ACCGCGCGGC GCTGATGAAC TCGCTGGGTG ATGCGTTAAA AGCGGCAGAT CAACACGCGA AAGCGGAAGC TATCGGTCAG GCCGATATGT TCGGCGTGCT GGCCGAAGAG CCGGAACAAA TTGAACAATC CTACGCCAGC TGCCAACCGT GGCCGGAGCA GGTGGTGTTA GATGGGGAAC GTGAAACGTT AGGTCTGTAC CTTACGGGAC ACCCTATCAA CCAGTATTTA AAAGAGATTG AGCGTTATGT CGGAGGCGTA AGGCTGAAAG ACATGCACCC GACAGAACGT GGTAAAGTCA TCACGGCTGC GGGGCTCGTT GTTGCCGCGC GGGTTATGGT CACCAAGCGC GGCAATCGTA TCGGTATCTG CACGCTGGAT GACCGTTCCG GGCGGCTGGA AGTGATGTTG TTTACTGACG CCCTGGATAA ATACCAGCAA TTGCTGGAAA AAGACCGCAT ACTTATCGTC AGCGGACAGG TCAGCTTTGA TGACTTCAGC GGTGGGCTTA AAATGACCGC TCGCGAAGTG ATGGATATTG ACGAAGCCCG GGAAAAATAT GCTAGCGGGC TTGCTATCTC GCTGACGGAC AGGCAAATTG ATGACCAGCT TTTAAACCGA CTCCGTCAGT CTCTGGAACC CCACCGCTCT GGGACAATTC CAGTACATCT CTACTATCAG AGGGCGGATG CACGCGCGCG GTTGCGTTTT GGCGCGACGT GGCGTGTCTC TCCGAGCGAT CGTTTATTAA ACGATCTCCG TGGCCTCATT GGTTCGGAGC AGGTGGAACT GGAGTTTGAC TAA
|
Protein sequence | MSEPRFVHLR VHSDYSMIDG LAKTAPLVKK AAALGMPALA ITDFTNLCGL VKFYGAGHGA GIKPIVGADF NVQCDLLGDE LTHLTVLAAN NTGYQNLTLL ISKAYQRGYG AAGPIIDRDW LIELNEGLIL LSGGRMGDVG RSLLRGNSAL VGECVAFYEE HFPDRYFLEL IRTGRLDEES YLHAAVELAE ARGLPVVATN DVRFIDSSDF DAHEIRVAIH DGFTLDDPKR PRNYSPQQYM RSEEEMCELF ADIPEALANT VEIAKRCNVT VRLGEYFLPQ FPTGDMSTED YLVKRAKEGL EERLAFLFPD EEERVKRRPE YDERLETELQ VINQMGFPGY FLIVMEFIQW SKDNGVPVGP GRGSGAGSLV AYALKITDLD PLEFDLLFER FLNPERVSMP DFDVDFCMEK RDQVIEHVAD MYGRDAVSQI ITFGTMAAKA VIRDVGRVLG HPYGFVDRIS KLIPPDPGMT LAKAFEAEPQ LPEIYEADEE VKALIDMARK LEGVTRNAGK HAGGVVIAPT KITDFAPLYC DEEGKHPVTQ FDKSDVEYAG LVKFDFLGLR TLTIINWALE MINKRRAKNG EPPLDIAAIP LDDKKSFDML QRSETTAVFQ LESRGMKDLI KRLQPDCFED MIALVALFRP GPLQSGMVDN FIDRKHGREE ISYPDVQWQH ESLKPVLEPT YGIILYQEQV MQIAQVLSGY TLGGADMLRR AMGKKKPEEM AKQRSVFAEG AEKNGINAEL AMKIFDLVEK FAGYGFNKSH SAAYALVSYQ TLWLKAHYPA EFMAAVMTAD MDNTEKVVGL VDECWRMGLK ILPPDINSGL YHFHVNDDGE IVYGIGAIKG VGEGPIEAII EARNKGGYFR ELFDLCARTD TKKLNRRVLE KLIMSGAFDR LGPHRAALMN SLGDALKAAD QHAKAEAIGQ ADMFGVLAEE PEQIEQSYAS CQPWPEQVVL DGERETLGLY LTGHPINQYL KEIERYVGGV RLKDMHPTER GKVITAAGLV VAARVMVTKR GNRIGICTLD DRSGRLEVML FTDALDKYQQ LLEKDRILIV SGQVSFDDFS GGLKMTAREV MDIDEAREKY ASGLAISLTD RQIDDQLLNR LRQSLEPHRS GTIPVHLYYQ RADARARLRF GATWRVSPSD RLLNDLRGLI GSEQVELEFD
|
| |