Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1629 |
Symbol | sbcB |
ID | 6066380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1810748 |
End bp | 1812175 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641601044 |
Product | exonuclease I |
Protein accession | YP_001724614 |
Protein GI | 170019660 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2925] Exonuclease I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.186939 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATG ACGGTAAGCA ACAATCTACC TTTTTGTTTC ACGATTACGA AACCTTTGGC ACGCACCCCG CGTTAGATCG CCCTGCACAG TTCGCAGCCA TTCGCACCGA TAGCGAATTC AATGTCATCG GCGAACCCGA AGTCTTTTAC TGCAAGCCCG CTGATGACTA TTTACCCCAG CCTGGAGCAG TATTAATTAC CGGTATTACC CCGCAGGAAG CACGGGCGAA AGGAGAAAAC GAAGCCGCGT TTGCCGCCCG TATTCACTCG CTTTTTACCG TACCGAAGAC CTGTATTCTG GGCTACAACA ATGTGCGTTT CGACGACGAA GTCACACGCA ACGTTTTTTA TCGTAATTTC TACGATCCTT ACGCCTGGAG CTGGCAGCAT GATAACTCGC GCTGGGATTT ACTGGATGTT ATGCGTGCCT GTTATGCCCT GCGCCCGGAA GGAATAAACT GGCCTGAAAA TGATGACGGT CTACCGAGCT TTCGCCTTGA GCATTTAACC AAAGCGAATG GTATTGAACA TAGCAACGCC CACGATGCGA TGGCTGATGT GTACGCCACT ATTGCGATGG CAAAGCTGGT AAAAACGCGT CAGCCACGCC TGTTTGATTA TCTCTTTACC CATCGTAATA AACACAAACT GATGGCGTTG ATTGATGTTC CGCAGATGAA ACCCCTGGTG CACGTTTCCG GAATGTTTGG GGCATGGCGC GGCAACACCA GCTGGGTGGC ACCGCTGGCG TGGCATCCTG AAAATCGCAA TGCCGTAATT ATGGTGGATT TGGCAGGAGA CATTTCGCCA TTACTGGAAC TGGATAGCGA CACATTGCGC GAGCGTTTAT ATACCGCAAA AACCGATCTT GGCGATAACG CCGCCGTTCC GGTTAAGCTG GTGCACATCA ATAAATGTCC GGTGCTGGCC CAGGCGAATA CGCTACGCCC GGAAGATGCC GACCGACTGG GAATTAATCG TCAGCATTGC CTCGATAACC TGAAAATTCT GCGTGAAAAT CCGCAAGTGC GCGAAAAAGT GGTGGCGATA TTCGCGGAAG CCGAACCGTT TACGCCTTCA GATAACGTGG ATGCACAGCT TTATAACGGC TTTTTCAGTG ACGCCGATCG TGCAGCAATG AAAATTGTGC TGGAAACCGA GCCGCGTAAT TTACCGGCAC TGGATATCAC TTTTGTCGAT AAACGGATTG AAAAGCTGTT GTTCAATTAT CGGGCGCGCA ACTTCCCGGG GACGCTGGAT TATGCCGAGC AGCAACGCTG GCTGGAGCAC CGCCGCCAGG TATTCACACC CGAGTTTTTA CAAGGCTATG CAGAAGAGAT TCAGATGCTG GCGCAGCAGT ATGCCGACGA TAAAGAAAAA GTGGCGCTGT TAAAAGCGCT TTGGCAGTAC GCGGAAGAGA TTGTCTAG
|
Protein sequence | MMNDGKQQST FLFHDYETFG THPALDRPAQ FAAIRTDSEF NVIGEPEVFY CKPADDYLPQ PGAVLITGIT PQEARAKGEN EAAFAARIHS LFTVPKTCIL GYNNVRFDDE VTRNVFYRNF YDPYAWSWQH DNSRWDLLDV MRACYALRPE GINWPENDDG LPSFRLEHLT KANGIEHSNA HDAMADVYAT IAMAKLVKTR QPRLFDYLFT HRNKHKLMAL IDVPQMKPLV HVSGMFGAWR GNTSWVAPLA WHPENRNAVI MVDLAGDISP LLELDSDTLR ERLYTAKTDL GDNAAVPVKL VHINKCPVLA QANTLRPEDA DRLGINRQHC LDNLKILREN PQVREKVVAI FAEAEPFTPS DNVDAQLYNG FFSDADRAAM KIVLETEPRN LPALDITFVD KRIEKLLFNY RARNFPGTLD YAEQQRWLEH RRQVFTPEFL QGYAEEIQML AQQYADDKEK VALLKALWQY AEEIV
|
| |