Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00349 |
Symbol | sbcC |
ID | 8113519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 377910 |
End bp | 381056 |
Gene Length | 3147 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644846633 |
Product | hypothetical protein |
Protein accession | YP_002998206 |
Protein GI | 251783902 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTC TCAGCCTGCG CCTGAAAAAC CTGAACTCAT TAAAAGGCGA ATGGAAGATT GATTTCACCC GCGAGCCGTT CGCCAGCAAC GGGCTGTTTG CCATTACCGG CCCAACAGGT GCGGGGAAAA CCACCCTGCT GGACGCCATT TGTCTGGCGC TGTATCACGA AACTCCGCGT CTCTCTAACG TTTCACAATC GCAAAATGAT CTCATGACCC GCGATACCGC CGAATGTCTG GCGGAGGTGG AGTTTGAAGT GAAAGGTGAA GCGTACCGTG CATTCTGGAG CCAGAATCGG GCGCGTAACC AACCCGACGG TAATTTGCAG GTGCCACGCG TAGAGCTGGC GCGCTGCGCC GACGGCAAAA TTCTCGCCGA CAAAGTGAAA GATAAGCTGG AACTGACAGC GACGTTAACC GGGCTGGATT ACGGGCGCTT CACCCGTTCG ATGCTGCTTT CGCAGGGGCA ATTTGCTGCC TTCCTGAATG CCAAACCCAA AGAACGCGCG GAATTGCTCG AGGAGTTAAC CGGCACTGAA ATCTACGGGC AAATCTCGGC GATGGTTTTT GAGCAGCACA AATCGGCCCG CACTGAGCTG GAGAAGCTGC AAGCGCAGGC CAGCGGCGTC GCGTTGCTCA CGCCGGAACA AGTGCAATCG CTGACAGCGA GTTTGCAGGT ACTTACTGAC GAAGAAAAAC AGTTACTTAC CGCGCAGCAG CAAGAACAAC AATCGCTAAA CTGGTTAACG CGTCTGGACG AATTGCAGCA AGAAGCCAGC CGCCGTCAGC AGGCCTTGCA ACAGGCGTTA GCCGAAGAAG AAAAAGCGCA ACCTCAACTG GCAGCGCTCA GTCTGGCACA ACCGGCACGA AATCTTCGTC CGCACTGGGA ACGCATCGCA GAACACAGCG CGGCGCTGGC GCATACTCGC CAGCAGATTG AAGAAGTAAA TACTCGCTTA CAGAGCACAA TGGCGCTTCG CGCGAGCATT CGCCACCACG CGGCGAAGCA GTCAGCAGAA TTACAGCAGC AGCAACAAAG CCTGAATACC TGGTTACAGG AACACGACCG CTTCCGTCAG TGGAACAACG AACTGGCGGG TTGGCGTGCG CAGTTCTCCC AACAAACCAG CGATCGCGAG CATCTGCGGC AATGGCAGCA ACAGTTAACC CATGCTGAGC AAAAACTTAA TGCGCTTGCG GCGATCACGT TGACGTTAAC CGCCGATGAA GTTGCTACCG CCCTGGCGCA ACATGCTGAG CAACGCCCAC TGCGTCAGCA CCTGGTCGCG CTGCATGGAC AGATTGTTCC CCAACAAAAA CGTCTGGCGC AGTTACAGGT CGCTATCCAG AATGTCACGC AAGAACAGAC GCAACGTAAC GCCGCACTTA ACGAAATGCG CCAGCGTTAT AAAGAAAAGA CGCAGCAACT TGCCGATGTG AAAACCATTT GCGAGCAGGA AGCGCGCATC AAAACGCTGG AAGCTCAACG TGCACAGTTA CAGGCGGGTC AGCCTTGCCC ACTTTGTGGT TCCACCAGCC ACCCGGCGGT CGAGGCGTAT CAGGCGCTGG AGCCTGGCGT TAATCAGTCT CGATTACTGG CGCTGGAAAA CGAAGTTAAA AAGCTCGGTG AAGAAGGTGC GACGCTACGT GGGCAACTGG ACGCCATAAC AAAGCAGCTT CAGCGTGATG AAAACGAAGC GCAAAGCCTC CGACAAGATG AGCAAGCACT TACTCAACAA TGGCAAGCCG TCACGGCCAG CCTCAATATC ACCTTGCAGC CACTGGACGA TATTCAACCG TGGCTGGATG CACAAGATGA GCACGAACGC CAGCTGCGGT TACTCAGCCA ACGGCATGAA TTACAAGGGC AGATTGCCGC GCATAATCAG CAAATTATCC AGTATCAACA GCAAATTGAA CAACGCCAGC AACTACTTTT AACGACATTG ACGGGTTATG CACTGACATT GCCACAGGAA GATGAAGAAG AGAGCTGGTT GGCGACACGT CAGCAAGAAG CGCAGAGCTG GCAGCAACGC CAGAACGAAT TAACCGCGCT GCAAAACCGT ATTCAGCAGC TGACGCCGAT TCTGGAAACG TTGCCGCAAA GTGATGAACT CCCGCACTGC GAAGAAACTG TGGTATTGGA AAACTGGCGG CAGGTACATG AACAATGTCT CGCATTACAC AGCCAGCAGC AGACGTTACA GCAACAGGAT GTTCTGGCGG CGCAAAGTCT GCAAAAAGCC CAGGCGCAGT TTGACACCGC GCTACAGGCC AGCGTCTTTG ACGATCAGCA GGCGTTCCTT GCGGCGCTAA TGGATGAACA AACACTAACG CAGCTGGAAC AGCTCAAGCA GAATCTGGAA AACCAGCGCC GTCAGGCGCA AACGCTGGTC ACTCAGACAG CAGAAACGCT GGCACAGCAT CAACAACACC GACCTGACGA CGGGTTGGCT CTCACTGTGA CGGTGGAGCA GATTCAGCAA GAGTTAGCGC AAACTCACCA AAAGTTGCGT GAAAACACCA CGAGTCAAGG CGAGATTCGC CAGCAGCTGA AGCAGGATGC AGATAACCGT CAGCAACAAC AAACCTTAAT GCAGCAAATT GCTCAAATGA CGCAGCAGGT TGAGGACTGG GGATATCTGA ATTCGCTAAT AGGTTCCAAA GAGGGCGATA AATTCCGCAA GTTTGCCCAG GGGCTGACGC TGGATAATTT AGTCCATCTC GCTAATCAGC AACTTACCCG GCTGCACGGG CGCTATCTGT TACAGCGCAA AGCCAGCGAG GCGCTGGAAG TCGAGGTTGT TGATACCTGG CAGGCAGATG CGGTACGCGA TACCCGTACC CTTTCCGGCG GCGAAAGTTT CCTCGTTAGT CTGGCGCTGG CGCTGGCGCT TTCGGATCTG GTCAGCCATA AAACACGTAT TGACTCGCTG TTCCTTGATG AAGGTTTTGG CACGCTGGAT AGCGAAACGC TGGATACCGC CCTTGATGCG CTGGATGCCC TGAACGCCAG TGGCAAAACC ATCGGTGTGA TTAGCCACGT AGAAGCGATG AAAGAGCGTA TTCCGGTGCA GATCAAAGTG AAAAAGATCA ACGGCCTGGG CTACAGCAAA CTGGAAAGTA CGTTTGCAGT GAAATAA
|
Protein sequence | MKILSLRLKN LNSLKGEWKI DFTREPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR LSNVSQSQND LMTRDTAECL AEVEFEVKGE AYRAFWSQNR ARNQPDGNLQ VPRVELARCA DGKILADKVK DKLELTATLT GLDYGRFTRS MLLSQGQFAA FLNAKPKERA ELLEELTGTE IYGQISAMVF EQHKSARTEL EKLQAQASGV ALLTPEQVQS LTASLQVLTD EEKQLLTAQQ QEQQSLNWLT RLDELQQEAS RRQQALQQAL AEEEKAQPQL AALSLAQPAR NLRPHWERIA EHSAALAHTR QQIEEVNTRL QSTMALRASI RHHAAKQSAE LQQQQQSLNT WLQEHDRFRQ WNNELAGWRA QFSQQTSDRE HLRQWQQQLT HAEQKLNALA AITLTLTADE VATALAQHAE QRPLRQHLVA LHGQIVPQQK RLAQLQVAIQ NVTQEQTQRN AALNEMRQRY KEKTQQLADV KTICEQEARI KTLEAQRAQL QAGQPCPLCG STSHPAVEAY QALEPGVNQS RLLALENEVK KLGEEGATLR GQLDAITKQL QRDENEAQSL RQDEQALTQQ WQAVTASLNI TLQPLDDIQP WLDAQDEHER QLRLLSQRHE LQGQIAAHNQ QIIQYQQQIE QRQQLLLTTL TGYALTLPQE DEEESWLATR QQEAQSWQQR QNELTALQNR IQQLTPILET LPQSDELPHC EETVVLENWR QVHEQCLALH SQQQTLQQQD VLAAQSLQKA QAQFDTALQA SVFDDQQAFL AALMDEQTLT QLEQLKQNLE NQRRQAQTLV TQTAETLAQH QQHRPDDGLA LTVTVEQIQQ ELAQTHQKLR ENTTSQGEIR QQLKQDADNR QQQQTLMQQI AQMTQQVEDW GYLNSLIGSK EGDKFRKFAQ GLTLDNLVHL ANQQLTRLHG RYLLQRKASE ALEVEVVDTW QADAVRDTRT LSGGESFLVS LALALALSDL VSHKTRIDSL FLDEGFGTLD SETLDTALDA LDALNASGKT IGVISHVEAM KERIPVQIKV KKINGLGYSK LESTFAVK
|
| |