Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0426 |
Symbol | sbcC |
ID | 6147325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 434885 |
End bp | 438028 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615322 |
Product | exonuclease subunit SbcC |
Protein accession | YP_001742529 |
Protein GI | 170682477 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.442969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTC TCAGCCTGCG CCTGAAAAAC CTGAACTCAT TAAAAGGCGA ATGGAAGATT GATTTCACCC GCGAGCCGTT CGCCAGCAAC GGGCTGTTTG CTATTACCGG CCCAACCGGT GCGGGGAAAA CCACCCTGCT GGACGCCATT TGTCTGGCGC TGTATCACGA AACCCCGCGC CTCTCTAACG TTTCACAATC GCAAAATGAT CTCATGACTC GCGATACCGC CGAATGTCTG GCGGAAGTGG AGTTTGAAGT CAAAGGTGAA GCGTATCGCG CGTTCTGGAG CCAGAATCGG GCGCGTAACC AGCCGGACGG TAATTTGCAG GTGCCACGCG TAGAGCTGGC GCGCTGCGCA GACGGCAAAA TTCTCGCCGA CAAAGTGAAA GATAAGCTGG AACTGACGGC GACGTTAACC GGACTGGATT ACGGGCGCTT CACCCGTTCG ATGCTGCTTT CGCAGGGGCA ATTTGCTGCC TTCCTGAATG CCAAACCCAA AGAACGCGCG GAATTGCTCG AGGAGTTAAC CGGCACTGAA ATCTACGGGA AAATCTCTGC GATGGTTTTT GAGCAGCACA AATCGGCCCG CACAGAGCTG GAGAAGCTGC AAGCGCAGGC CAGCGGCGTC GCGTTGCTCA CGCCAGAACA AGTGCAATCG CTGACAGCGA GTTTGCAGGT ACTTACTGAC GAAGAAAAAC AGTTAATTAC CGCGCAGCAG CAAGAACAAC AATCGCTAAA CTGGTTAACG CGTCTGGACG AATTGCAGCA AGAAGGCAGC CGCCGTCAAC AGGCCTTGCA ACAGGCATTA GCCGAAGAAG AAAAAGCGCA ACCTCAACTG GCGGCGCTCA GTCTGGCACA ACCGGCACGA AATCTTCGTC CACACTGGGA ACGCATCGCA GAACACAGCG CGGCGCTGGC GCATACTCGC CAGCAGATTG AAGAAGTAAA TACTCGCTTA CAGAGCACAA TGGCGCTTCG GGCGAGCATT CGCCACCACG CGGCGAAGCA GTCAGCAGAA TTACAGCAGC AGCAACAAAG TCTGAATGCC TGGTTACAGG AACACGACCG CTTCCGTCAG TGGAACAACG AACTGGCGGG TTGGCGTGCG CAGTTCTCCC AACAAACCAG CGATCGCGAG CATCTGCGGC AATGGCAGCA ACAGTTAACC CATGCAGAGC AAAAACTTAA TGCGCTTGCG GCGATTACGT TGACGTTAAC CGCCGATGAA GTTGCTAGCG CCCTGGCGCA ACATGCCGAG CAACGCCCAC TGCGTCAGCG CCTGGTCGCG CTGCATGGAC AGATTGTTCC CCAACAAAAA CGTCTGGCGC AGTTACAGGT CGCTATCCAG AATGTCACTC TGGAACAGAC GCAACGTAAC GCCGCACTTA ACGAAATGCG CCAGCGTTAT AAAGAAAAGA TGCAGCAACT TGCCGATGTC AAAACCATTT GCGAGCAGGA AGCACGCATC AAAACGCTGG AAGCCCAGCG CGCGCAGTTA CAGGCGGGTC AGCCTTGTCC ACTTTGTGGT TCCACCAGCC ACCCGGCGGT CGAGGCGTAT CAGGCGCTGG AGCCTGGCGT TAATCAGGCC CGGCTATTAA CGCTGGAAAA AGAAGTGAAA AAGCTCGGCG AAGAAGGTGC GGCGCTACGT GGGCAACTGG ATGCATTAAC GAAGCAGCTT CAGCGCGATG AAAACGAAGC GCAAAGCCTC CGACAAGATG AGCAAGCACT TACTCAACAA TGGCAAGCCG TCACGGCCAG CCTCAATATC ACCCTGCAGC CACAGGACGA TATTCAACCG TGGCTAGATG CACAAGATGA GCACGAACGC CAGCTGCGGT TACTCAGCCA ACGGCATGAA TTACAAGGGC AGATTGCCGC GCATAATCAG CAAATTATCC AGTATCAACA GCAAATTGAA CAACGCCAGC AGCAACTCTT AACGGCATTG GCGGGTTATG CGCTGACATT GCCACAGGAA GATGAAGAAG AGAGCTGGTT GGCGGCGCGT CAGCAAGAAG CGCAGCGCTG GCAGCAACGC CAGAACGAAT TAACCGCGCT GCAAAACCGT ATTCAGCAAC TGACGCCGAT TCTGGAAACG TTGCCGCAAA GTGATGAACT CCCGCACTGC GAAGAAACTG TGGTACTGGA AAACTGGCGG CAGGTACATG AACAATGTCT CGCACTACAC AGCCAGCAAC AGACGTTACA GCAACAGGAC GTTCTGGCGG CGCAAAGTCT GCAAAAAGCC CAGGCGCAGT TTGACACCGC GCTACAGGCG AGCGTCTTTG ACGATCAACA GGCGTTCCTT GCGGCGCTAA TGGATGAACA AACACTAACG CAGCTGGAAC AGCTCAAGCA GAATCTGGAA AACCAGCGCC GTCAGGCGCA AACGCTGGTC ACTCAGACAG CAGAAGCGCT GGCACAGCAT CAGCAACACC GACCTGACGG CCTGGATCTC AGTGTGACGG TGGAGCAGAT TCAGCAAGAG TTAGCGCAAA CTCAGCAAAA GTTGCGTGAA AATACCACCA GTCAGGGCGA GATTCGCCAG CAGCTGAAGC AGGATGCAGA CAACCGTCAG CAACAACAAA CCTTAATGCA GCAAATTGCT CAAATGACGC AGCAGGTTGA GGACTGGGGA TATCTGAATT CGCTAATAGG TTCCAAAGAG GGCGATAAAT TCCGCAAGTT TGCCCAGGGG CTGACGCTGG ATAATTTAGT CCATCTCGCT AATCAGCAAC TTACCCGGCT GCACGGGCGC TATCTGTTAC AGCGCAAAGC CAGCGAGGCG CTGGAAGTCG AGGTTGTTGA TACCTGGCAG GCAGATGCGG TACGCGATAC CCGTACCCTT TCCGGCGGCG AAAGTTTCCT CGTCAGTCTG GCGCTGGCGC TGGCGCTTTC GGATCTGGTC AGCCATAAAA CGCGTATTGA CTCGCTGTTC CTTGATGAAG GTTTTGGCAC GCTGGATAGC GAAACGCTGG ATACCGCCCT TGATGCGCTG GATGCCCTGA ATGCCAGTGG CAAAACCATC GGTGTGATTA GCCACGTTGA AGCGATGAAA GAGCGTATTC CGGTGCAGAT CAAAGTGAAA AAGATCAACG GCCTGGGCTA CAGCAAACTG GAAAGTGCGT TTGCAGTGAA ATAA
|
Protein sequence | MKILSLRLKN LNSLKGEWKI DFTREPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR LSNVSQSQND LMTRDTAECL AEVEFEVKGE AYRAFWSQNR ARNQPDGNLQ VPRVELARCA DGKILADKVK DKLELTATLT GLDYGRFTRS MLLSQGQFAA FLNAKPKERA ELLEELTGTE IYGKISAMVF EQHKSARTEL EKLQAQASGV ALLTPEQVQS LTASLQVLTD EEKQLITAQQ QEQQSLNWLT RLDELQQEGS RRQQALQQAL AEEEKAQPQL AALSLAQPAR NLRPHWERIA EHSAALAHTR QQIEEVNTRL QSTMALRASI RHHAAKQSAE LQQQQQSLNA WLQEHDRFRQ WNNELAGWRA QFSQQTSDRE HLRQWQQQLT HAEQKLNALA AITLTLTADE VASALAQHAE QRPLRQRLVA LHGQIVPQQK RLAQLQVAIQ NVTLEQTQRN AALNEMRQRY KEKMQQLADV KTICEQEARI KTLEAQRAQL QAGQPCPLCG STSHPAVEAY QALEPGVNQA RLLTLEKEVK KLGEEGAALR GQLDALTKQL QRDENEAQSL RQDEQALTQQ WQAVTASLNI TLQPQDDIQP WLDAQDEHER QLRLLSQRHE LQGQIAAHNQ QIIQYQQQIE QRQQQLLTAL AGYALTLPQE DEEESWLAAR QQEAQRWQQR QNELTALQNR IQQLTPILET LPQSDELPHC EETVVLENWR QVHEQCLALH SQQQTLQQQD VLAAQSLQKA QAQFDTALQA SVFDDQQAFL AALMDEQTLT QLEQLKQNLE NQRRQAQTLV TQTAEALAQH QQHRPDGLDL SVTVEQIQQE LAQTQQKLRE NTTSQGEIRQ QLKQDADNRQ QQQTLMQQIA QMTQQVEDWG YLNSLIGSKE GDKFRKFAQG LTLDNLVHLA NQQLTRLHGR YLLQRKASEA LEVEVVDTWQ ADAVRDTRTL SGGESFLVSL ALALALSDLV SHKTRIDSLF LDEGFGTLDS ETLDTALDAL DALNASGKTI GVISHVEAMK ERIPVQIKVK KINGLGYSKL ESAFAVK
|
| |