Gene EcSMS35_0426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0426 
SymbolsbcC 
ID6147325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp434885 
End bp438028 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content54% 
IMG OID641615322 
Productexonuclease subunit SbcC 
Protein accessionYP_001742529 
Protein GI170682477 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.442969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TCAGCCTGCG CCTGAAAAAC CTGAACTCAT TAAAAGGCGA ATGGAAGATT 
GATTTCACCC GCGAGCCGTT CGCCAGCAAC GGGCTGTTTG CTATTACCGG CCCAACCGGT
GCGGGGAAAA CCACCCTGCT GGACGCCATT TGTCTGGCGC TGTATCACGA AACCCCGCGC
CTCTCTAACG TTTCACAATC GCAAAATGAT CTCATGACTC GCGATACCGC CGAATGTCTG
GCGGAAGTGG AGTTTGAAGT CAAAGGTGAA GCGTATCGCG CGTTCTGGAG CCAGAATCGG
GCGCGTAACC AGCCGGACGG TAATTTGCAG GTGCCACGCG TAGAGCTGGC GCGCTGCGCA
GACGGCAAAA TTCTCGCCGA CAAAGTGAAA GATAAGCTGG AACTGACGGC GACGTTAACC
GGACTGGATT ACGGGCGCTT CACCCGTTCG ATGCTGCTTT CGCAGGGGCA ATTTGCTGCC
TTCCTGAATG CCAAACCCAA AGAACGCGCG GAATTGCTCG AGGAGTTAAC CGGCACTGAA
ATCTACGGGA AAATCTCTGC GATGGTTTTT GAGCAGCACA AATCGGCCCG CACAGAGCTG
GAGAAGCTGC AAGCGCAGGC CAGCGGCGTC GCGTTGCTCA CGCCAGAACA AGTGCAATCG
CTGACAGCGA GTTTGCAGGT ACTTACTGAC GAAGAAAAAC AGTTAATTAC CGCGCAGCAG
CAAGAACAAC AATCGCTAAA CTGGTTAACG CGTCTGGACG AATTGCAGCA AGAAGGCAGC
CGCCGTCAAC AGGCCTTGCA ACAGGCATTA GCCGAAGAAG AAAAAGCGCA ACCTCAACTG
GCGGCGCTCA GTCTGGCACA ACCGGCACGA AATCTTCGTC CACACTGGGA ACGCATCGCA
GAACACAGCG CGGCGCTGGC GCATACTCGC CAGCAGATTG AAGAAGTAAA TACTCGCTTA
CAGAGCACAA TGGCGCTTCG GGCGAGCATT CGCCACCACG CGGCGAAGCA GTCAGCAGAA
TTACAGCAGC AGCAACAAAG TCTGAATGCC TGGTTACAGG AACACGACCG CTTCCGTCAG
TGGAACAACG AACTGGCGGG TTGGCGTGCG CAGTTCTCCC AACAAACCAG CGATCGCGAG
CATCTGCGGC AATGGCAGCA ACAGTTAACC CATGCAGAGC AAAAACTTAA TGCGCTTGCG
GCGATTACGT TGACGTTAAC CGCCGATGAA GTTGCTAGCG CCCTGGCGCA ACATGCCGAG
CAACGCCCAC TGCGTCAGCG CCTGGTCGCG CTGCATGGAC AGATTGTTCC CCAACAAAAA
CGTCTGGCGC AGTTACAGGT CGCTATCCAG AATGTCACTC TGGAACAGAC GCAACGTAAC
GCCGCACTTA ACGAAATGCG CCAGCGTTAT AAAGAAAAGA TGCAGCAACT TGCCGATGTC
AAAACCATTT GCGAGCAGGA AGCACGCATC AAAACGCTGG AAGCCCAGCG CGCGCAGTTA
CAGGCGGGTC AGCCTTGTCC ACTTTGTGGT TCCACCAGCC ACCCGGCGGT CGAGGCGTAT
CAGGCGCTGG AGCCTGGCGT TAATCAGGCC CGGCTATTAA CGCTGGAAAA AGAAGTGAAA
AAGCTCGGCG AAGAAGGTGC GGCGCTACGT GGGCAACTGG ATGCATTAAC GAAGCAGCTT
CAGCGCGATG AAAACGAAGC GCAAAGCCTC CGACAAGATG AGCAAGCACT TACTCAACAA
TGGCAAGCCG TCACGGCCAG CCTCAATATC ACCCTGCAGC CACAGGACGA TATTCAACCG
TGGCTAGATG CACAAGATGA GCACGAACGC CAGCTGCGGT TACTCAGCCA ACGGCATGAA
TTACAAGGGC AGATTGCCGC GCATAATCAG CAAATTATCC AGTATCAACA GCAAATTGAA
CAACGCCAGC AGCAACTCTT AACGGCATTG GCGGGTTATG CGCTGACATT GCCACAGGAA
GATGAAGAAG AGAGCTGGTT GGCGGCGCGT CAGCAAGAAG CGCAGCGCTG GCAGCAACGC
CAGAACGAAT TAACCGCGCT GCAAAACCGT ATTCAGCAAC TGACGCCGAT TCTGGAAACG
TTGCCGCAAA GTGATGAACT CCCGCACTGC GAAGAAACTG TGGTACTGGA AAACTGGCGG
CAGGTACATG AACAATGTCT CGCACTACAC AGCCAGCAAC AGACGTTACA GCAACAGGAC
GTTCTGGCGG CGCAAAGTCT GCAAAAAGCC CAGGCGCAGT TTGACACCGC GCTACAGGCG
AGCGTCTTTG ACGATCAACA GGCGTTCCTT GCGGCGCTAA TGGATGAACA AACACTAACG
CAGCTGGAAC AGCTCAAGCA GAATCTGGAA AACCAGCGCC GTCAGGCGCA AACGCTGGTC
ACTCAGACAG CAGAAGCGCT GGCACAGCAT CAGCAACACC GACCTGACGG CCTGGATCTC
AGTGTGACGG TGGAGCAGAT TCAGCAAGAG TTAGCGCAAA CTCAGCAAAA GTTGCGTGAA
AATACCACCA GTCAGGGCGA GATTCGCCAG CAGCTGAAGC AGGATGCAGA CAACCGTCAG
CAACAACAAA CCTTAATGCA GCAAATTGCT CAAATGACGC AGCAGGTTGA GGACTGGGGA
TATCTGAATT CGCTAATAGG TTCCAAAGAG GGCGATAAAT TCCGCAAGTT TGCCCAGGGG
CTGACGCTGG ATAATTTAGT CCATCTCGCT AATCAGCAAC TTACCCGGCT GCACGGGCGC
TATCTGTTAC AGCGCAAAGC CAGCGAGGCG CTGGAAGTCG AGGTTGTTGA TACCTGGCAG
GCAGATGCGG TACGCGATAC CCGTACCCTT TCCGGCGGCG AAAGTTTCCT CGTCAGTCTG
GCGCTGGCGC TGGCGCTTTC GGATCTGGTC AGCCATAAAA CGCGTATTGA CTCGCTGTTC
CTTGATGAAG GTTTTGGCAC GCTGGATAGC GAAACGCTGG ATACCGCCCT TGATGCGCTG
GATGCCCTGA ATGCCAGTGG CAAAACCATC GGTGTGATTA GCCACGTTGA AGCGATGAAA
GAGCGTATTC CGGTGCAGAT CAAAGTGAAA AAGATCAACG GCCTGGGCTA CAGCAAACTG
GAAAGTGCGT TTGCAGTGAA ATAA
 
Protein sequence
MKILSLRLKN LNSLKGEWKI DFTREPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR 
LSNVSQSQND LMTRDTAECL AEVEFEVKGE AYRAFWSQNR ARNQPDGNLQ VPRVELARCA
DGKILADKVK DKLELTATLT GLDYGRFTRS MLLSQGQFAA FLNAKPKERA ELLEELTGTE
IYGKISAMVF EQHKSARTEL EKLQAQASGV ALLTPEQVQS LTASLQVLTD EEKQLITAQQ
QEQQSLNWLT RLDELQQEGS RRQQALQQAL AEEEKAQPQL AALSLAQPAR NLRPHWERIA
EHSAALAHTR QQIEEVNTRL QSTMALRASI RHHAAKQSAE LQQQQQSLNA WLQEHDRFRQ
WNNELAGWRA QFSQQTSDRE HLRQWQQQLT HAEQKLNALA AITLTLTADE VASALAQHAE
QRPLRQRLVA LHGQIVPQQK RLAQLQVAIQ NVTLEQTQRN AALNEMRQRY KEKMQQLADV
KTICEQEARI KTLEAQRAQL QAGQPCPLCG STSHPAVEAY QALEPGVNQA RLLTLEKEVK
KLGEEGAALR GQLDALTKQL QRDENEAQSL RQDEQALTQQ WQAVTASLNI TLQPQDDIQP
WLDAQDEHER QLRLLSQRHE LQGQIAAHNQ QIIQYQQQIE QRQQQLLTAL AGYALTLPQE
DEEESWLAAR QQEAQRWQQR QNELTALQNR IQQLTPILET LPQSDELPHC EETVVLENWR
QVHEQCLALH SQQQTLQQQD VLAAQSLQKA QAQFDTALQA SVFDDQQAFL AALMDEQTLT
QLEQLKQNLE NQRRQAQTLV TQTAEALAQH QQHRPDGLDL SVTVEQIQQE LAQTQQKLRE
NTTSQGEIRQ QLKQDADNRQ QQQTLMQQIA QMTQQVEDWG YLNSLIGSKE GDKFRKFAQG
LTLDNLVHLA NQQLTRLHGR YLLQRKASEA LEVEVVDTWQ ADAVRDTRTL SGGESFLVSL
ALALALSDLV SHKTRIDSLF LDEGFGTLDS ETLDTALDAL DALNASGKTI GVISHVEAMK
ERIPVQIKVK KINGLGYSKL ESAFAVK