Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01900 |
Symbol | sbcB |
ID | 8113412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1978313 |
End bp | 1979740 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644848115 |
Product | hypothetical protein |
Protein accession | YP_002999688 |
Protein GI | 251785384 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2925] Exonuclease I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATG ACGGTAAGCA ACAATCTACC TTTTTGTTTC ACGATTACGA AACCTTTGGA ACGCACCCCG CGTTAGATCG CCCTGCACAG TTCGCAGCCA TTCGCACTGA TAACGAATTC AATGTCATCG GCGAACCCGA AGTCTTTTAC TGCAAGCCCG CGGATGACTA TTTACCCCAG CCTGGAGCAG TATTAATTAC CGGTATTACC CCGCAGGAAG CACGGGCGAA AGGAGAAAAC GAAGCCGCGT TTGCCGCCCG TATTCACTCG CTTTTTACCG TACCGAAGAC CTGTATTCTG GGCTACAACA ATGTGCGTTT CGACGATGAA GTCACACGCA ACGTTTTTTA TCGTAATTTC TACGATCCTT ACGCCTGGAG CTGGCAGCAT GATAACTCGC GCTGGGATTT ACTGGATGTT ATGCGTGCCT GTTATGCCCT GCGCCCGGAA GGAATAAACT GGCCTGAAAA TGATGACGGT CTACCGAGCT TTCGCCTTGA GCATTTAACC AAAGCGAATG GTATTGAACA TAGCAACGCC CACGATGCGA TGGCTGATGT GTACGCCACT ATTGCGATGG CGAAACTGGT AAAAACGCGT CAGCCACGCC TGTTTGATTA TCTCTTTACC CATCGTAATA AACACAAACT GATGGCGTTG ATTGATGTTC CGCAGATGAA ACCCCTGGTT CACGTTTCCG GAATGTTTGG GGCATGGCGT GGCAATACCA GCTGGGTGGC ACCGCTGGCG TGGCATCCTG AAAATCGCAA TGCCGTAATT ATGGTGGATT TGGCAGGAGA CATTTCGCCA TTACTGGAAC TGGATAGCGA CACATTGCGC GAGCGTTTAT ATACCGCAAA AACCGATCTT GGCGATAACG CCGCCGTTCC GGTTAAGCTG GTGCATATCA ATAAATGTCC GGTACTGGCC CAGGCGAATA CGCTACGCCC GGAAGATGCC GACCGACTGG GAATTAATCG TCAGCATTGC CTCGATAACC TAAAAATTCT GCGTGAAAAT CCGCAAGTGC GTGAAAAAGT GGTGGCGATA TTCGCGGAAG CCGAACCGTT TACGCCTTCA GATAACGTGG ATGCACAGCT TTATAATGGC TTTTTCAGTG ACGCCGATCG TGCAGCAATG AAAATTGTGC TGGAAACCGA GCCGCGTAAT TTACCGGCAC TGGATATCAC TTTTGTTGAT AAACGAATTG AAAAACTGTT GTTCAATTAT CGAGCGCGCA ACTTCCCGGG GACGCTGGAT TATGCCGAGC AGCAACGCTG GCTTGAGCAC CGCCGCCAGG TCTTCACGCC AGAGTTTTTG CAGGGCTACG CTGATGAATT GCAGATGCTG GCACAACAGT ATGCCGATGA CAAAGAGAAA GTGGCGCTGT TAAAAGCGCT TTGGCAGTAC GCGCAAGAGA TTGTCTAA
|
Protein sequence | MMNDGKQQST FLFHDYETFG THPALDRPAQ FAAIRTDNEF NVIGEPEVFY CKPADDYLPQ PGAVLITGIT PQEARAKGEN EAAFAARIHS LFTVPKTCIL GYNNVRFDDE VTRNVFYRNF YDPYAWSWQH DNSRWDLLDV MRACYALRPE GINWPENDDG LPSFRLEHLT KANGIEHSNA HDAMADVYAT IAMAKLVKTR QPRLFDYLFT HRNKHKLMAL IDVPQMKPLV HVSGMFGAWR GNTSWVAPLA WHPENRNAVI MVDLAGDISP LLELDSDTLR ERLYTAKTDL GDNAAVPVKL VHINKCPVLA QANTLRPEDA DRLGINRQHC LDNLKILREN PQVREKVVAI FAEAEPFTPS DNVDAQLYNG FFSDADRAAM KIVLETEPRN LPALDITFVD KRIEKLLFNY RARNFPGTLD YAEQQRWLEH RRQVFTPEFL QGYADELQML AQQYADDKEK VALLKALWQY AQEIV
|
| |