Gene ECH74115_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0472 
SymbolsbcC 
ID6968192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp478176 
End bp481319 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content54% 
IMG OID643384520 
Productexonuclease subunit SbcC 
Protein accessionYP_002269034 
Protein GI209396784 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.227308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TCAGCCTGCG CCTGAAAAAC CTGAACTCAT TAAAAGGCGA ATGGAAGATT 
GATTTCACCC GCGAGCCGTT CGCCAGCAAC GGGCTGTTTG CCATTACCGG CCCGACCGGT
GCGGGGAAAA CCACCCTGCT GGACGCCATT TGTCTGGCGC TGTATCACGA AACTCCGCGT
CTCTCTAACG TTTCACAATC GCAAAATGAT CTCATGACCC GCGATACCGC TGAATGTCTG
GCGGAGGTGG AGTTTGAAGT GAAAGGTGAA GCGTATCGCG CGTTCTGGAG CCAGAATCGG
GCGCGTAACC AACCCGACGG TAATTTGCAG GTGCCACGCG TAGAGCTGGC GCGCTGCGCC
GACGGCAAAA TTCTCGCCGA CAAAGTGAAA GATAAGCTGG AACTGACAGC GACGTTAACC
GGGCTGGATT ACGGGCGCTT CACCCGTTCG ATGCTGCTTT CGCAGGGGCA ATTTGCTGCC
TTCCTGAATG CCAAACCCAA AGAACGCGCG GAATTGCTCG AGGAGTTAAC CGGCACTGAA
ATCTACGGGC AAATCTCGGC GATGGTTTTT GAGCAGCACA AATCGGCCCG CACAGAGCTG
GAGAAGCTGC AAGCGCAGGC CAGCGGCGTC GCGTTGCTCA CGCCGGAACA AGTGCAATCG
CTGACAGCGA GTTTGCAGGT ACTTACTGAC GAAGAAAAAC AGTTACTTAC CGCGCAGCAG
CAAGAACAAC AATCACTAAA CTGGTTAACG CGTCTGGACG AATTGCAGCA AGAAGCCAGC
CGCCGTCAGC AGGCCTTGCA ACAGGCGTTA GCCGAAGAAG AAAAAGCGCA ACCTCAACTG
GCGGCGCTCA GTCTGGCACA ACCGGCACGA AATCTTCGTC CGCACTGGGA ACGCATCGCA
GAACACAGCG CGGCGCTGGC GCATACTCGC CAGCAGATTG AAGAAGTAAA TACTCGCTTA
CAGAACACAA TGGCGCTTCG CGCGAGCATT CGCCACCACG CGGCGAAGCA GTCAGCAGAA
TTACAGCAGC AGCAACAAAG CCTGAATACC TGGTTACAGG AACACGACCG CTTCCGTCAG
TGGAACAACG AACTGGCGGG TTGGCGTGCG CAGTTCTCCC AACAAACCAG CGATCGCGAG
CATCTGCGGC AATGGCAGCA ACAGTTAACC CATGCTGAGC AAAAACTTAA TGCGCTTGCG
GCGATCACGT TGATGTTAAC CGCCGATGAA GTTGCTACCG CCCTGGCGCA ACATGCTGAG
CAACGCCCAC TGCGTCAGCG CCTGGTCGCG CTGCATGGGC AGATTGTTCC CCAACAAAAA
CGTCTGGCGC AGTTAATGGT CACTATCCAG AATGTCACTC TGGAACAGAC GCAACGTAAT
GCCGCACTTA ACGAAATGCG CCAGCGTTAT AAAGAAAAGA CGCAGCAACT TGCCGATGTG
AAAACCATTT GCGAGCAGGA AGCGCGCATC AAAACGCTGG AAGCTCAACG TGCACAGTTA
CAGGCGGGTC AGCCTTGTCC ACTTTGTGGT TCCACCAGCC ATCCGGCGGT CGAGGCGTAT
CAGGCGCTGG AGCCTGGCGT TAATCAGTCT CGATTACTGG CGCTGGAAAA CGAAGTTAAA
AAGCTCGGCG AAGAAGGTGC GGCGCTGCGC GGGCAACTGG ATGCATTAAC GAAGCAGCTT
CAGCGCGATG AAAACGAAGC GCAAAGCCTC CGACAAGATG AGCAAGCACT TACTCAACAA
TGGCAAGCCG TCACGGCCAG CCTCAATATC ACCTTGCAGC CACAGGACGA TATTCAACCG
TGGCTGGATG CACAAGATGA GCACGAACGC CAGCTGCGGT TACTCAGCCA ACGACATGAA
TTACAAGGGC AGATTGCCGC GCATAATCAG CAAATTATCC AGTATCAACA GCAAATTGAA
CAACGCCAGC AGCAGCTCTT AACGGCATTG GCGGGTTATG CACTGACATT GCCACAGGAA
GATGAAGAAG AGAGCTGGTT GGCGACACGT CAGCAAGAAG CGCAGAGCTG GCAGCAACGC
CAGAACGAAT TAACCGCGCT GCAAAACCGT ATTCAGCAGC TGACGCCGAT TCTGGAAACG
TTGCCGCAAA GTGATGATCT CCCGCACAGC GAAGAAACTG TGGCGCTGGA TAACTGGCGG
CAGGTTCATG AACAATGTCT CGCATTACAC AGCCAGCAGC AGACGTTACA GCAACAGGAT
GTTCTGGCGG CGCAAAGTCT GCAAAAAGCC CAGGCGCAGT TTGACACCGC GCTACAGGCC
AGCGTCTTTG ACGATCAGCA GGCGTTCCTT GCGGCGCTAA TGGATGAACA AACACTAACG
CAGCTGGAAC AGCTCAAGCA GAATCTGGAA AACCAGCGCC GTCAGGCGCA AACGCTGGTC
ACTCAGACAG CAGAAACGCT GGCACAGCAT CAACAACACC GACCTGACGG GTTGGCTCTC
ACTGTGACGG TGGAGCAGAT TCAGCAAGAG TTAGCGCAAA CTCACCAAAA GTTGCGTGAA
AACACCACGA GTCAAGGCGA GATTCGCCAG CAGCTGAAGC AGGATGCAGA TAACCGTCAG
CAACAACAAA CCTTACTGCA GCAAATTGCT CAAATGACGC AGCAGGTTGA GGACTGGGGA
TATCTGAATT CGCTAATAGG TTCCAAAGAG GGCGATAAAT TCCGCAAGTT TGCCCAGGGG
CTGACGCTGG ATAATTTAGT CCATCTCGCT AATCAGCAAC TTACCCGGCT GCACGGGCGC
TATCTGTTAC AGCGCAAAGC CAGCGAGGCG CTGGAAGTCG AGGTTGTTGA TACCTGGCAG
GCAGATGCGG TACGCGATAC CCGTACCCTT TCCGGCGGCG AAAGTTTCCT CGTTAGTCTG
GCGCTGGCGC TGGCGCTTTC GGATCTGGTC AGCCATAAAA CACGTATTGA CTCGCTGTTC
CTTGATGAAG GTTTTGGCAC GCTGGATAGC GAAACGCTGG ATACCGCCCT TGATGCGCTG
GATGCCCTGA ACGCCAGTGG CAAAACCATC GGTGTGATTA GCCACGTTGA AGCGATGAAA
GAGCGTATTC CTGTGCAGAT CAAAGTGAAG AAGATCAACG GCCTGGGCTA CAGCAAACTG
GAAAGTACGT TTGCAGTGAA ATAA
 
Protein sequence
MKILSLRLKN LNSLKGEWKI DFTREPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR 
LSNVSQSQND LMTRDTAECL AEVEFEVKGE AYRAFWSQNR ARNQPDGNLQ VPRVELARCA
DGKILADKVK DKLELTATLT GLDYGRFTRS MLLSQGQFAA FLNAKPKERA ELLEELTGTE
IYGQISAMVF EQHKSARTEL EKLQAQASGV ALLTPEQVQS LTASLQVLTD EEKQLLTAQQ
QEQQSLNWLT RLDELQQEAS RRQQALQQAL AEEEKAQPQL AALSLAQPAR NLRPHWERIA
EHSAALAHTR QQIEEVNTRL QNTMALRASI RHHAAKQSAE LQQQQQSLNT WLQEHDRFRQ
WNNELAGWRA QFSQQTSDRE HLRQWQQQLT HAEQKLNALA AITLMLTADE VATALAQHAE
QRPLRQRLVA LHGQIVPQQK RLAQLMVTIQ NVTLEQTQRN AALNEMRQRY KEKTQQLADV
KTICEQEARI KTLEAQRAQL QAGQPCPLCG STSHPAVEAY QALEPGVNQS RLLALENEVK
KLGEEGAALR GQLDALTKQL QRDENEAQSL RQDEQALTQQ WQAVTASLNI TLQPQDDIQP
WLDAQDEHER QLRLLSQRHE LQGQIAAHNQ QIIQYQQQIE QRQQQLLTAL AGYALTLPQE
DEEESWLATR QQEAQSWQQR QNELTALQNR IQQLTPILET LPQSDDLPHS EETVALDNWR
QVHEQCLALH SQQQTLQQQD VLAAQSLQKA QAQFDTALQA SVFDDQQAFL AALMDEQTLT
QLEQLKQNLE NQRRQAQTLV TQTAETLAQH QQHRPDGLAL TVTVEQIQQE LAQTHQKLRE
NTTSQGEIRQ QLKQDADNRQ QQQTLLQQIA QMTQQVEDWG YLNSLIGSKE GDKFRKFAQG
LTLDNLVHLA NQQLTRLHGR YLLQRKASEA LEVEVVDTWQ ADAVRDTRTL SGGESFLVSL
ALALALSDLV SHKTRIDSLF LDEGFGTLDS ETLDTALDAL DALNASGKTI GVISHVEAMK
ERIPVQIKVK KINGLGYSKL ESTFAVK