Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0952 |
Symbol | |
ID | 6272372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 881101 |
End bp | 882423 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641725106 |
Product | hypothetical protein |
Protein accession | YP_001879633 |
Protein GI | 187732045 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0000381632 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAACAGA TAGCCCGCTC TGTCGCCCTG GCGTTTAATA ATTTACCGCG ACCACACCGC GTTATGTTGG GGTCGCTCAC CGTTCTTACT CTGGCCGTCG CTGTCTGGCG GCCCTATGTT TATCACCGCG ATGCCACGCC AATTGTCAAA ACCATTGAGC TGGAACAGAA CGAAATTCGT TCGCTCTTAC CTGAAGCCAG TGAGCCGATT GATCAAGCTG CACAAGAAGA TGAAGCCATT CCCCAGGACG AACTGGATGA CAAAATCGCC GGTGAAGCGG GTGTGCATGA ATATGTTGTT TCCACTGGCG ATACGCTAAG CAGCATTCTC AATCAGTATG GTATTGATAT GGGTGATATC ACCCAACTGG CTGCGGCCGA CAAAGAATTG CGTAACCTGA AAATCGGTCA ACAACTCTCC TGGACATTAA CCGCGGACGG CGAACTGCAG CGCCTCACCT GGGAAGTGTC TCGTCGTGAA ACCCGAACCT ATGACCGTAC TGCCGCTAAC GGTTTTAAAA TGACCAGCGA AATGCAGCAA GGAGAGTGGG TTAACAATCT GCTGAAAGGT ACCGTCGGAG GAAGCTTTGT TGCCAGCGCC AGAAACGCCG GTTTAACCAG CGCCGAAGTG AGCGCAGTGA TTAAAGCCAT GCAGTGGCAA ATGGATTTCC GCAAACTGAA AAAAGGCGAT GAATTTGCGG TGTTAATGTC TCGAGAGATG CTTGATGGTA AACGTGAGCA AAGCCAGCTG CTGGGCGTAC GTTTGCGTTC AGAAGGTAAA GATTATTACG CAATCCGCGC TGAGGATGGC AAATTCTACG ACCGTAACGG TACTGGTCTG GCGAAAGGAT TCTTGCGATT CCCGACGGCG AAACAGTTCC GTATCTCGTC TAACTTTAAC CCGCGTCGTA CTAATCCGGT GACCGGTCGC GTTGCACCAC ACAGAGGTGT TGATTTCGCC ATGCCGCAGG GTACGCCAGT GCTTTCAGTG GGTGACGGTG AAGTGGTGGT TGCCAAACGC AGTGGCGCAG CAGGTTATTA TGTGGCTATT CGTCATGGTC GCAGCTACAC CACGCGTTAT ATGCACTTGC GCAAGATTCT GGTGAAACCG GGACAGAAGG TGAAACGTGG CGACCGTATC GCGCTTTCCG GTAATACCGG ACGTTCAACC GGGCCGCATC TGCACTATGA AGTATGGATA AACCAGCAGG CCGTAAACCC GCTGACGGCA AAACTGCCGC GTACCGAAGG GCTGACCGGC TCCGATCGTC GCGAATTCCT GGCGCAGGCC AAAGAGATTG TGCCGCAGCT ACGGTTTGAT TAA
|
Protein sequence | MQQIARSVAL AFNNLPRPHR VMLGSLTVLT LAVAVWRPYV YHRDATPIVK TIELEQNEIR SLLPEASEPI DQAAQEDEAI PQDELDDKIA GEAGVHEYVV STGDTLSSIL NQYGIDMGDI TQLAAADKEL RNLKIGQQLS WTLTADGELQ RLTWEVSRRE TRTYDRTAAN GFKMTSEMQQ GEWVNNLLKG TVGGSFVASA RNAGLTSAEV SAVIKAMQWQ MDFRKLKKGD EFAVLMSREM LDGKREQSQL LGVRLRSEGK DYYAIRAEDG KFYDRNGTGL AKGFLRFPTA KQFRISSNFN PRRTNPVTGR VAPHRGVDFA MPQGTPVLSV GDGEVVVAKR SGAAGYYVAI RHGRSYTTRY MHLRKILVKP GQKVKRGDRI ALSGNTGRST GPHLHYEVWI NQQAVNPLTA KLPRTEGLTG SDRREFLAQA KEIVPQLRFD
|
| |