Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1049 |
Symbol | sbcB |
ID | 6147212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1068476 |
End bp | 1069903 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641615936 |
Product | exonuclease I |
Protein accession | YP_001743128 |
Protein GI | 170682338 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2925] Exonuclease I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.106186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATG ACGGTAAGCA ACAATCTACC TTTTTGTTTC ACGATTACGA AACCTTTGGA ACGCACCCCG CGTTAGATCG CCCTGCACAG TTCGCAGCCA TTCGCACTGA TAACGAATTC AATGTCATCG GCGAACCCGA AGTCTTTTAC TGCAAGCCCG CGGATGACTA TTTACCCCAG CCTGGAGCAG TATTAATTAC CGGTATTACC CCGCAGGAAG CACGGGCGAA AGGAGAAAAC GAAGCCGCGT TTGCCGCCCG TATTCACTCG CTTTTTACCG TACCGAAGAC CTGTATTCTG GGCTACAACA ATGTGCGTTT CGACGATGAA GTCACACGCA ACGTTTTTTA TCGTAATTTC TACGATCCTT ACGCCTGGAG CTGGCAGCAT GATAACTCGC GCTGGGATTT ACTGGATGTT ATGCGTGCCT GTTATGCCCT GCGCCCGGAA GGAATAAACT GGCCTGAAAA TGATGACGGT CTACCGAGCT TTCGCCTTGA GCATTTAACC AAAGCGAATG GTATTGAACA TAGCAACGCC CACGATGCGA TGGCTGATGT GTACGCCACT ATTGCGATGG CGAAACTGGT AAAAACGCGT CAGCCACGCC TGTTTGATTA TCTCTTTACC CATCGTAATA AACACAAACT GATGGCGTTG ATTGATGTTC CGCAGATGAA ACCCCTGGTT CACGTTTCCG GAATGTTTGG GGCATGGCGT GGCAATACCA GCTGGGTGGC ACCGCTGGCG TGGCATCCTG AAAATCGCAA TGCCGTAATT ATGGTGGATT TGGCAGGAGA CATTTCACCA TTACTGGAGC TGGATAGCGA CACATTGCGC GAGCGTTTAT ATACCGCAAA AGCCGATCTT GGCGATAACG CCGCCGTTCC GGTTAAGCTG GTGCATATCA ATAAATGTCC GGTACTGGCC CAGGCGAATA CGCTACGCCC GGAAGACGCT GAGCGTTTAG GGATTGCTCG TCAACACTGC CTGGATAACC TGAAAGTGTT GCGCGAAAAC CCGCAAGTAC GCGACAAAGT TGTGGCAATT TTTGCAGAAG CTGAACCGTT TACACCATCA GATAATGTAG ATGCACAGCT TTATAATGGC TTTTTTAGCG ACGCCGATCG TGCAGCAATG AAAATCGTGC TGGAGACTGA ACCGCGTAAT TTACCGGCTC TGGATATCAC CTTTGTTGAT AAGCGTATCG AGAAACTCCT GTTCAACTAC CGCGCCCGCA ACTTCCCCGG CACGTTAGAT GACAACGAGC AGCAACGCTG GCTTGAGCAC CGCCGCCAGG TCTTCACACC CGAGTTTTTA CAAGGCTATG CAGAAGAGAT TCAGATGCTG GCGCAGCTGT ATGCCGACGA TAAAGAAAAA GTGGCGCTGT TAAAAGCGCT TTGGCAGTAC GCGGAAGAGA TTGTCTAA
|
Protein sequence | MMNDGKQQST FLFHDYETFG THPALDRPAQ FAAIRTDNEF NVIGEPEVFY CKPADDYLPQ PGAVLITGIT PQEARAKGEN EAAFAARIHS LFTVPKTCIL GYNNVRFDDE VTRNVFYRNF YDPYAWSWQH DNSRWDLLDV MRACYALRPE GINWPENDDG LPSFRLEHLT KANGIEHSNA HDAMADVYAT IAMAKLVKTR QPRLFDYLFT HRNKHKLMAL IDVPQMKPLV HVSGMFGAWR GNTSWVAPLA WHPENRNAVI MVDLAGDISP LLELDSDTLR ERLYTAKADL GDNAAVPVKL VHINKCPVLA QANTLRPEDA ERLGIARQHC LDNLKVLREN PQVRDKVVAI FAEAEPFTPS DNVDAQLYNG FFSDADRAAM KIVLETEPRN LPALDITFVD KRIEKLLFNY RARNFPGTLD DNEQQRWLEH RRQVFTPEFL QGYAEEIQML AQLYADDKEK VALLKALWQY AEEIV
|
| |