Gene B21_00349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00349 
SymbolsbcC 
ID8113519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp377910 
End bp381056 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content54% 
IMG OID644846633 
Producthypothetical protein 
Protein accessionYP_002998206 
Protein GI251783902 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTC TCAGCCTGCG CCTGAAAAAC CTGAACTCAT TAAAAGGCGA ATGGAAGATT 
GATTTCACCC GCGAGCCGTT CGCCAGCAAC GGGCTGTTTG CCATTACCGG CCCAACAGGT
GCGGGGAAAA CCACCCTGCT GGACGCCATT TGTCTGGCGC TGTATCACGA AACTCCGCGT
CTCTCTAACG TTTCACAATC GCAAAATGAT CTCATGACCC GCGATACCGC CGAATGTCTG
GCGGAGGTGG AGTTTGAAGT GAAAGGTGAA GCGTACCGTG CATTCTGGAG CCAGAATCGG
GCGCGTAACC AACCCGACGG TAATTTGCAG GTGCCACGCG TAGAGCTGGC GCGCTGCGCC
GACGGCAAAA TTCTCGCCGA CAAAGTGAAA GATAAGCTGG AACTGACAGC GACGTTAACC
GGGCTGGATT ACGGGCGCTT CACCCGTTCG ATGCTGCTTT CGCAGGGGCA ATTTGCTGCC
TTCCTGAATG CCAAACCCAA AGAACGCGCG GAATTGCTCG AGGAGTTAAC CGGCACTGAA
ATCTACGGGC AAATCTCGGC GATGGTTTTT GAGCAGCACA AATCGGCCCG CACTGAGCTG
GAGAAGCTGC AAGCGCAGGC CAGCGGCGTC GCGTTGCTCA CGCCGGAACA AGTGCAATCG
CTGACAGCGA GTTTGCAGGT ACTTACTGAC GAAGAAAAAC AGTTACTTAC CGCGCAGCAG
CAAGAACAAC AATCGCTAAA CTGGTTAACG CGTCTGGACG AATTGCAGCA AGAAGCCAGC
CGCCGTCAGC AGGCCTTGCA ACAGGCGTTA GCCGAAGAAG AAAAAGCGCA ACCTCAACTG
GCAGCGCTCA GTCTGGCACA ACCGGCACGA AATCTTCGTC CGCACTGGGA ACGCATCGCA
GAACACAGCG CGGCGCTGGC GCATACTCGC CAGCAGATTG AAGAAGTAAA TACTCGCTTA
CAGAGCACAA TGGCGCTTCG CGCGAGCATT CGCCACCACG CGGCGAAGCA GTCAGCAGAA
TTACAGCAGC AGCAACAAAG CCTGAATACC TGGTTACAGG AACACGACCG CTTCCGTCAG
TGGAACAACG AACTGGCGGG TTGGCGTGCG CAGTTCTCCC AACAAACCAG CGATCGCGAG
CATCTGCGGC AATGGCAGCA ACAGTTAACC CATGCTGAGC AAAAACTTAA TGCGCTTGCG
GCGATCACGT TGACGTTAAC CGCCGATGAA GTTGCTACCG CCCTGGCGCA ACATGCTGAG
CAACGCCCAC TGCGTCAGCA CCTGGTCGCG CTGCATGGAC AGATTGTTCC CCAACAAAAA
CGTCTGGCGC AGTTACAGGT CGCTATCCAG AATGTCACGC AAGAACAGAC GCAACGTAAC
GCCGCACTTA ACGAAATGCG CCAGCGTTAT AAAGAAAAGA CGCAGCAACT TGCCGATGTG
AAAACCATTT GCGAGCAGGA AGCGCGCATC AAAACGCTGG AAGCTCAACG TGCACAGTTA
CAGGCGGGTC AGCCTTGCCC ACTTTGTGGT TCCACCAGCC ACCCGGCGGT CGAGGCGTAT
CAGGCGCTGG AGCCTGGCGT TAATCAGTCT CGATTACTGG CGCTGGAAAA CGAAGTTAAA
AAGCTCGGTG AAGAAGGTGC GACGCTACGT GGGCAACTGG ACGCCATAAC AAAGCAGCTT
CAGCGTGATG AAAACGAAGC GCAAAGCCTC CGACAAGATG AGCAAGCACT TACTCAACAA
TGGCAAGCCG TCACGGCCAG CCTCAATATC ACCTTGCAGC CACTGGACGA TATTCAACCG
TGGCTGGATG CACAAGATGA GCACGAACGC CAGCTGCGGT TACTCAGCCA ACGGCATGAA
TTACAAGGGC AGATTGCCGC GCATAATCAG CAAATTATCC AGTATCAACA GCAAATTGAA
CAACGCCAGC AACTACTTTT AACGACATTG ACGGGTTATG CACTGACATT GCCACAGGAA
GATGAAGAAG AGAGCTGGTT GGCGACACGT CAGCAAGAAG CGCAGAGCTG GCAGCAACGC
CAGAACGAAT TAACCGCGCT GCAAAACCGT ATTCAGCAGC TGACGCCGAT TCTGGAAACG
TTGCCGCAAA GTGATGAACT CCCGCACTGC GAAGAAACTG TGGTATTGGA AAACTGGCGG
CAGGTACATG AACAATGTCT CGCATTACAC AGCCAGCAGC AGACGTTACA GCAACAGGAT
GTTCTGGCGG CGCAAAGTCT GCAAAAAGCC CAGGCGCAGT TTGACACCGC GCTACAGGCC
AGCGTCTTTG ACGATCAGCA GGCGTTCCTT GCGGCGCTAA TGGATGAACA AACACTAACG
CAGCTGGAAC AGCTCAAGCA GAATCTGGAA AACCAGCGCC GTCAGGCGCA AACGCTGGTC
ACTCAGACAG CAGAAACGCT GGCACAGCAT CAACAACACC GACCTGACGA CGGGTTGGCT
CTCACTGTGA CGGTGGAGCA GATTCAGCAA GAGTTAGCGC AAACTCACCA AAAGTTGCGT
GAAAACACCA CGAGTCAAGG CGAGATTCGC CAGCAGCTGA AGCAGGATGC AGATAACCGT
CAGCAACAAC AAACCTTAAT GCAGCAAATT GCTCAAATGA CGCAGCAGGT TGAGGACTGG
GGATATCTGA ATTCGCTAAT AGGTTCCAAA GAGGGCGATA AATTCCGCAA GTTTGCCCAG
GGGCTGACGC TGGATAATTT AGTCCATCTC GCTAATCAGC AACTTACCCG GCTGCACGGG
CGCTATCTGT TACAGCGCAA AGCCAGCGAG GCGCTGGAAG TCGAGGTTGT TGATACCTGG
CAGGCAGATG CGGTACGCGA TACCCGTACC CTTTCCGGCG GCGAAAGTTT CCTCGTTAGT
CTGGCGCTGG CGCTGGCGCT TTCGGATCTG GTCAGCCATA AAACACGTAT TGACTCGCTG
TTCCTTGATG AAGGTTTTGG CACGCTGGAT AGCGAAACGC TGGATACCGC CCTTGATGCG
CTGGATGCCC TGAACGCCAG TGGCAAAACC ATCGGTGTGA TTAGCCACGT AGAAGCGATG
AAAGAGCGTA TTCCGGTGCA GATCAAAGTG AAAAAGATCA ACGGCCTGGG CTACAGCAAA
CTGGAAAGTA CGTTTGCAGT GAAATAA
 
Protein sequence
MKILSLRLKN LNSLKGEWKI DFTREPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR 
LSNVSQSQND LMTRDTAECL AEVEFEVKGE AYRAFWSQNR ARNQPDGNLQ VPRVELARCA
DGKILADKVK DKLELTATLT GLDYGRFTRS MLLSQGQFAA FLNAKPKERA ELLEELTGTE
IYGQISAMVF EQHKSARTEL EKLQAQASGV ALLTPEQVQS LTASLQVLTD EEKQLLTAQQ
QEQQSLNWLT RLDELQQEAS RRQQALQQAL AEEEKAQPQL AALSLAQPAR NLRPHWERIA
EHSAALAHTR QQIEEVNTRL QSTMALRASI RHHAAKQSAE LQQQQQSLNT WLQEHDRFRQ
WNNELAGWRA QFSQQTSDRE HLRQWQQQLT HAEQKLNALA AITLTLTADE VATALAQHAE
QRPLRQHLVA LHGQIVPQQK RLAQLQVAIQ NVTQEQTQRN AALNEMRQRY KEKTQQLADV
KTICEQEARI KTLEAQRAQL QAGQPCPLCG STSHPAVEAY QALEPGVNQS RLLALENEVK
KLGEEGATLR GQLDAITKQL QRDENEAQSL RQDEQALTQQ WQAVTASLNI TLQPLDDIQP
WLDAQDEHER QLRLLSQRHE LQGQIAAHNQ QIIQYQQQIE QRQQLLLTTL TGYALTLPQE
DEEESWLATR QQEAQSWQQR QNELTALQNR IQQLTPILET LPQSDELPHC EETVVLENWR
QVHEQCLALH SQQQTLQQQD VLAAQSLQKA QAQFDTALQA SVFDDQQAFL AALMDEQTLT
QLEQLKQNLE NQRRQAQTLV TQTAETLAQH QQHRPDDGLA LTVTVEQIQQ ELAQTHQKLR
ENTTSQGEIR QQLKQDADNR QQQQTLMQQI AQMTQQVEDW GYLNSLIGSK EGDKFRKFAQ
GLTLDNLVHL ANQQLTRLHG RYLLQRKASE ALEVEVVDTW QADAVRDTRT LSGGESFLVS
LALALALSDL VSHKTRIDSL FLDEGFGTLD SETLDTALDA LDALNASGKT IGVISHVEAM
KERIPVQIKV KKINGLGYSK LESTFAVK