Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1221 |
Symbol | sbcB |
ID | 6268606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1127192 |
End bp | 1128619 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641725351 |
Product | exonuclease I |
Protein accession | YP_001879865 |
Protein GI | 187733588 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2925] Exonuclease I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.171451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATG ACGGTAAGCA ACAATCTACC TTTTTGTTTC ACGATTACGA AACCTTTGGC ACGCACCCCG CGTTAGATCG CCCTGCACAG TTCGCAGCCA TTCGCACCGA TAGCGAATTC AATGTCATCG GCGAACCCGA AGTCTTTTAC TGCAAGCCCG CTGATGACTA TTTACCCCAG CCAGGAGCCG TATTAATTAC CGGTATTACC CCGCAGGAAG CACGGGCGAA AGGAGAAAAC GAAGCCGCGT TTGCCGCCCG TATTCACTCG CTTTTTACCG TACCGAAGAC CTGTATTCTG GGCTACAACA ATGTGCGTTT CGACGACGAA GTCACACGCA ACATTTTTTA TCGTAATTTC TACGATCCTT ACGCCTGGAG CTGGCAGCAT GATAACTCGC GCTGGGATTT ACTGGATGTT ATGCGTGCCT GTTATGCCCT GCGCCCGGAA GGAATAAACT GGCCTGAAAA TGATGACGGT CTACCGAGCT TTCGCCTTGA GCATTTAACC AAAGCGAATG GTATTGAACA TAGCAACGCC CACGATGCGA TGGCTGATGT GTACGCCACT ATTGCGATGG CGAAACTGGT AAAAACGCGT CAGCCACGTC TGTTTGATTA TCTCTTTACC CATCGTAATA AACACAAACT GATGGCGTTG ATTGATGTTC CGCAGATGAA ACCCCTGGTT CACGTTTCCG GAATGTTTGG GGCATGGCGC GGCAATACCA GCTGGGTGGC ACCGCTGGCG TGGCATCCAG AAAATCGCAA TGCCGTAATT ATGGTGGATT TGGCAGGAGA CATTTCGCCA TTACTGGAAC TGGATAGCGA CACATTGCGC GAGCGTTTAT ATACTGCAAA AGCCGATCTT GGCGATAACG CCGCCGTTCC GGTTAAGCTG GTGCATATCA ATAAATGTCC GGTGCTGGCC CAGGCGAATA CGCTACGCCC GGAAGATGCC GACCGACTGG GAATTAATCG TCAGCATTGC CTCGATAACC TGAAAATTCT GCGTGAAAAT CCGCAAGTGC GCGAAAAAGT GGTGGCGATA TTCGCGGAAG CCGAACCGTT TACGCCTTCA GATAACGTGG ATGCACAGCT TTATAACGGC TTTTTCAGTG ACGCAGATCG TGCAGCAATG AAAATTGTGC TGGAAACCGA GCCGCGTAAT TTACCGGCAC TGGATATCAC TTTTGTTGAT AAACGGATTG AAAAGCTGTT GTTCAATTAT CGGGCACGCA ACTTCCCGGG GACGCTGGAT TATGCCGAGC AGCAACGCTG GCTGGAGCAC CGCCGCCAGG TATTCACGCC AGAGTTTTTG CAGGGTTATG CTGATGAATT GCAGATGCTG GTACAACAAT ATGCCGATGA CAAAGAGAAA GTGGCGCTGT TAAAAGCACT TTGGCAGTAC GCGGAAGAGA TCGTCTAA
|
Protein sequence | MMNDGKQQST FLFHDYETFG THPALDRPAQ FAAIRTDSEF NVIGEPEVFY CKPADDYLPQ PGAVLITGIT PQEARAKGEN EAAFAARIHS LFTVPKTCIL GYNNVRFDDE VTRNIFYRNF YDPYAWSWQH DNSRWDLLDV MRACYALRPE GINWPENDDG LPSFRLEHLT KANGIEHSNA HDAMADVYAT IAMAKLVKTR QPRLFDYLFT HRNKHKLMAL IDVPQMKPLV HVSGMFGAWR GNTSWVAPLA WHPENRNAVI MVDLAGDISP LLELDSDTLR ERLYTAKADL GDNAAVPVKL VHINKCPVLA QANTLRPEDA DRLGINRQHC LDNLKILREN PQVREKVVAI FAEAEPFTPS DNVDAQLYNG FFSDADRAAM KIVLETEPRN LPALDITFVD KRIEKLLFNY RARNFPGTLD YAEQQRWLEH RRQVFTPEFL QGYADELQML VQQYADDKEK VALLKALWQY AEEIV
|
| |