Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0472 |
Symbol | sbcC |
ID | 6968192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 478176 |
End bp | 481319 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384520 |
Product | exonuclease subunit SbcC |
Protein accession | YP_002269034 |
Protein GI | 209396784 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.227308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTC TCAGCCTGCG CCTGAAAAAC CTGAACTCAT TAAAAGGCGA ATGGAAGATT GATTTCACCC GCGAGCCGTT CGCCAGCAAC GGGCTGTTTG CCATTACCGG CCCGACCGGT GCGGGGAAAA CCACCCTGCT GGACGCCATT TGTCTGGCGC TGTATCACGA AACTCCGCGT CTCTCTAACG TTTCACAATC GCAAAATGAT CTCATGACCC GCGATACCGC TGAATGTCTG GCGGAGGTGG AGTTTGAAGT GAAAGGTGAA GCGTATCGCG CGTTCTGGAG CCAGAATCGG GCGCGTAACC AACCCGACGG TAATTTGCAG GTGCCACGCG TAGAGCTGGC GCGCTGCGCC GACGGCAAAA TTCTCGCCGA CAAAGTGAAA GATAAGCTGG AACTGACAGC GACGTTAACC GGGCTGGATT ACGGGCGCTT CACCCGTTCG ATGCTGCTTT CGCAGGGGCA ATTTGCTGCC TTCCTGAATG CCAAACCCAA AGAACGCGCG GAATTGCTCG AGGAGTTAAC CGGCACTGAA ATCTACGGGC AAATCTCGGC GATGGTTTTT GAGCAGCACA AATCGGCCCG CACAGAGCTG GAGAAGCTGC AAGCGCAGGC CAGCGGCGTC GCGTTGCTCA CGCCGGAACA AGTGCAATCG CTGACAGCGA GTTTGCAGGT ACTTACTGAC GAAGAAAAAC AGTTACTTAC CGCGCAGCAG CAAGAACAAC AATCACTAAA CTGGTTAACG CGTCTGGACG AATTGCAGCA AGAAGCCAGC CGCCGTCAGC AGGCCTTGCA ACAGGCGTTA GCCGAAGAAG AAAAAGCGCA ACCTCAACTG GCGGCGCTCA GTCTGGCACA ACCGGCACGA AATCTTCGTC CGCACTGGGA ACGCATCGCA GAACACAGCG CGGCGCTGGC GCATACTCGC CAGCAGATTG AAGAAGTAAA TACTCGCTTA CAGAACACAA TGGCGCTTCG CGCGAGCATT CGCCACCACG CGGCGAAGCA GTCAGCAGAA TTACAGCAGC AGCAACAAAG CCTGAATACC TGGTTACAGG AACACGACCG CTTCCGTCAG TGGAACAACG AACTGGCGGG TTGGCGTGCG CAGTTCTCCC AACAAACCAG CGATCGCGAG CATCTGCGGC AATGGCAGCA ACAGTTAACC CATGCTGAGC AAAAACTTAA TGCGCTTGCG GCGATCACGT TGATGTTAAC CGCCGATGAA GTTGCTACCG CCCTGGCGCA ACATGCTGAG CAACGCCCAC TGCGTCAGCG CCTGGTCGCG CTGCATGGGC AGATTGTTCC CCAACAAAAA CGTCTGGCGC AGTTAATGGT CACTATCCAG AATGTCACTC TGGAACAGAC GCAACGTAAT GCCGCACTTA ACGAAATGCG CCAGCGTTAT AAAGAAAAGA CGCAGCAACT TGCCGATGTG AAAACCATTT GCGAGCAGGA AGCGCGCATC AAAACGCTGG AAGCTCAACG TGCACAGTTA CAGGCGGGTC AGCCTTGTCC ACTTTGTGGT TCCACCAGCC ATCCGGCGGT CGAGGCGTAT CAGGCGCTGG AGCCTGGCGT TAATCAGTCT CGATTACTGG CGCTGGAAAA CGAAGTTAAA AAGCTCGGCG AAGAAGGTGC GGCGCTGCGC GGGCAACTGG ATGCATTAAC GAAGCAGCTT CAGCGCGATG AAAACGAAGC GCAAAGCCTC CGACAAGATG AGCAAGCACT TACTCAACAA TGGCAAGCCG TCACGGCCAG CCTCAATATC ACCTTGCAGC CACAGGACGA TATTCAACCG TGGCTGGATG CACAAGATGA GCACGAACGC CAGCTGCGGT TACTCAGCCA ACGACATGAA TTACAAGGGC AGATTGCCGC GCATAATCAG CAAATTATCC AGTATCAACA GCAAATTGAA CAACGCCAGC AGCAGCTCTT AACGGCATTG GCGGGTTATG CACTGACATT GCCACAGGAA GATGAAGAAG AGAGCTGGTT GGCGACACGT CAGCAAGAAG CGCAGAGCTG GCAGCAACGC CAGAACGAAT TAACCGCGCT GCAAAACCGT ATTCAGCAGC TGACGCCGAT TCTGGAAACG TTGCCGCAAA GTGATGATCT CCCGCACAGC GAAGAAACTG TGGCGCTGGA TAACTGGCGG CAGGTTCATG AACAATGTCT CGCATTACAC AGCCAGCAGC AGACGTTACA GCAACAGGAT GTTCTGGCGG CGCAAAGTCT GCAAAAAGCC CAGGCGCAGT TTGACACCGC GCTACAGGCC AGCGTCTTTG ACGATCAGCA GGCGTTCCTT GCGGCGCTAA TGGATGAACA AACACTAACG CAGCTGGAAC AGCTCAAGCA GAATCTGGAA AACCAGCGCC GTCAGGCGCA AACGCTGGTC ACTCAGACAG CAGAAACGCT GGCACAGCAT CAACAACACC GACCTGACGG GTTGGCTCTC ACTGTGACGG TGGAGCAGAT TCAGCAAGAG TTAGCGCAAA CTCACCAAAA GTTGCGTGAA AACACCACGA GTCAAGGCGA GATTCGCCAG CAGCTGAAGC AGGATGCAGA TAACCGTCAG CAACAACAAA CCTTACTGCA GCAAATTGCT CAAATGACGC AGCAGGTTGA GGACTGGGGA TATCTGAATT CGCTAATAGG TTCCAAAGAG GGCGATAAAT TCCGCAAGTT TGCCCAGGGG CTGACGCTGG ATAATTTAGT CCATCTCGCT AATCAGCAAC TTACCCGGCT GCACGGGCGC TATCTGTTAC AGCGCAAAGC CAGCGAGGCG CTGGAAGTCG AGGTTGTTGA TACCTGGCAG GCAGATGCGG TACGCGATAC CCGTACCCTT TCCGGCGGCG AAAGTTTCCT CGTTAGTCTG GCGCTGGCGC TGGCGCTTTC GGATCTGGTC AGCCATAAAA CACGTATTGA CTCGCTGTTC CTTGATGAAG GTTTTGGCAC GCTGGATAGC GAAACGCTGG ATACCGCCCT TGATGCGCTG GATGCCCTGA ACGCCAGTGG CAAAACCATC GGTGTGATTA GCCACGTTGA AGCGATGAAA GAGCGTATTC CTGTGCAGAT CAAAGTGAAG AAGATCAACG GCCTGGGCTA CAGCAAACTG GAAAGTACGT TTGCAGTGAA ATAA
|
Protein sequence | MKILSLRLKN LNSLKGEWKI DFTREPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR LSNVSQSQND LMTRDTAECL AEVEFEVKGE AYRAFWSQNR ARNQPDGNLQ VPRVELARCA DGKILADKVK DKLELTATLT GLDYGRFTRS MLLSQGQFAA FLNAKPKERA ELLEELTGTE IYGQISAMVF EQHKSARTEL EKLQAQASGV ALLTPEQVQS LTASLQVLTD EEKQLLTAQQ QEQQSLNWLT RLDELQQEAS RRQQALQQAL AEEEKAQPQL AALSLAQPAR NLRPHWERIA EHSAALAHTR QQIEEVNTRL QNTMALRASI RHHAAKQSAE LQQQQQSLNT WLQEHDRFRQ WNNELAGWRA QFSQQTSDRE HLRQWQQQLT HAEQKLNALA AITLMLTADE VATALAQHAE QRPLRQRLVA LHGQIVPQQK RLAQLMVTIQ NVTLEQTQRN AALNEMRQRY KEKTQQLADV KTICEQEARI KTLEAQRAQL QAGQPCPLCG STSHPAVEAY QALEPGVNQS RLLALENEVK KLGEEGAALR GQLDALTKQL QRDENEAQSL RQDEQALTQQ WQAVTASLNI TLQPQDDIQP WLDAQDEHER QLRLLSQRHE LQGQIAAHNQ QIIQYQQQIE QRQQQLLTAL AGYALTLPQE DEEESWLATR QQEAQSWQQR QNELTALQNR IQQLTPILET LPQSDDLPHS EETVALDNWR QVHEQCLALH SQQQTLQQQD VLAAQSLQKA QAQFDTALQA SVFDDQQAFL AALMDEQTLT QLEQLKQNLE NQRRQAQTLV TQTAETLAQH QQHRPDGLAL TVTVEQIQQE LAQTHQKLRE NTTSQGEIRQ QLKQDADNRQ QQQTLLQQIA QMTQQVEDWG YLNSLIGSKE GDKFRKFAQG LTLDNLVHLA NQQLTRLHGR YLLQRKASEA LEVEVVDTWQ ADAVRDTRTL SGGESFLVSL ALALALSDLV SHKTRIDSLF LDEGFGTLDS ETLDTALDAL DALNASGKTI GVISHVEAMK ERIPVQIKVK KINGLGYSKL ESTFAVK
|
| |