Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1492 |
Symbol | |
ID | 6269233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1361615 |
End bp | 1363510 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641725592 |
Product | hypothetical protein |
Protein accession | YP_001880098 |
Protein GI | 187734007 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGGGA AGTTTCGCTG CATTTTGCTG TTGATAGCAG GGCTGTTTGT ATCATCTCTA AGTTATGCAG AAAACCCGGA GATCCCTTCT TATGAAGAAG GGATCTCGCT CTTTGATGTT GAAGCCACTC TGCAACCGGA TGGGGTGCTC GACATCAAAG AAAATATTCA TTTTCAGGCG CGAAATCAGC AGATTAAGCA CGGATTTTAT CGTGATTTAC CACGACTCTG GATGCAGCCT GATGGGGACG CTGCACTGCT GAACTATCAT ATTGTTGGCG TCACTCGTGA TGGTATTCCT GAACCCTGGC ATCTTGACTG GCATATCGGG TTAATGAGTA TTGTCGTGGG CGATAAACAA CGTTTCTTGC CTCAAGGCGA CTATCATTAT CAAATTCATT ATCAGGTTAA AAATGCTTTC CTGCGTGAGG GAGATTCAGA TCTGTTAATC TGGAACGTGA CTGGTAACCA CTGGCCGTTT GAAATCTATA AGACCCGATT TTCACTCAAG TTCCCTGATA TCGCGGGTAA TCCATTTAGC GAAATCGATC TCTTTACTGG AGAAGAGGGC GACACATATC GAAATGGCCG CATCCTTGAG GACGGAAGAA TTGAATCCCG CGATCCGTTT TATCGAGAAG ATTTCACGGT ACTCTACCGC TGGCCTCACG CGTTACTTAG CAATGCCCCG GCTCCACAAA CGACGAATAT TTTCAGCCAT CTTCTTTTAC CCTCCACGTC ATCGTTGTTA ATTTGGTTTC CGTGTCTATT CCTGGTTTGT GGATGGTTAT ATCTCTGGAA GCGCAGGCCG CAATTTACGC CGGTAGATGT GATTGAAACC GATGTCATTC CGCCAGATTA CACGCCCGGC ATGTTACGTC TCGATGCGAA GCTGGTTTAC GACGATAAAG GTTTCTGTGC CGATATCGTA AATCTGATCG TGAAGGGAAA AATTCATCTG GAAGATCAGT ATGATAAGAA CCAGCAAATC CTGATTTGTG TTAATGAAGG CGCGACCAGA AATAATGAGG TATTACTGCC CGCAGAGCAG TTATTACTGG AAGCGTTATT TCGTAAAGGC GATAAGGTCG TTCTTACGGG GAGACGCAAC AGAGTCTTAC GCAGGGCATT TTTACGGATG CAGAAATTTT ATCTGCCGCG TAAAAAGTCT TTGTTTTACC GGCCTGATAC GTTTTTGCAA TGGGGTGGAC TGGCAATATT GGCGATCATT CTCTACGGTA ACCTGAGTCC CGTAGGTTGG GCAGGAATGA GTCTGGTAGG CGATATGTTT ATTATGATCT GCTGGATTAT TCCTTTTTTA TTTTGTTCCC TTGAGCTTTT GTTTGCCCGC GATGATGACA AGCCTTGCGT TAATCGTGCA ATCATCACTT TGTTTTTACC ACTGATTTGT TCAGGCGTGG CCTTCTATTC TCTCTATATC AATGTCGGAG ATGTATTCTT TTACTGGTAT ATGCCAGCGG GTTATTTTAG CGCTGTTTGC CTGACCGGTT ATCTCACTGG CATGGGGTAT ATTTTTCTGC CAAAGTTTAC CCAAACTGGG CAGCAACGTT ATGCCCGCGG TGAAGCTATC GTTAACTATC TTGCGCGTAA AGAGGCAGCA ACACACAGTG GACGTCGGCG GAAAGGGGAA ACACGGAAAC TGGATTACGC GTTGCTAGGT TGGGCTATAT CGGCAAATTT GGGGAGGGAA TGGGCGTTAC GCATTGCCCC TTCGCTTTCT TCGGCGATTC GCGCTCCAGA GATTGCCCGT AACGGCGTTT TATTCTCATT ACAGACGCAC CTAAGTTGCG GGGCCAATAC CAGTTTGTTG GGGCGAAGTT ATTCCGGTGG TGGTGCTGGC GGCGGCGCGG GTGGCGGAGG CGGTGGTGGC TGGTAA
|
Protein sequence | MAGKFRCILL LIAGLFVSSL SYAENPEIPS YEEGISLFDV EATLQPDGVL DIKENIHFQA RNQQIKHGFY RDLPRLWMQP DGDAALLNYH IVGVTRDGIP EPWHLDWHIG LMSIVVGDKQ RFLPQGDYHY QIHYQVKNAF LREGDSDLLI WNVTGNHWPF EIYKTRFSLK FPDIAGNPFS EIDLFTGEEG DTYRNGRILE DGRIESRDPF YREDFTVLYR WPHALLSNAP APQTTNIFSH LLLPSTSSLL IWFPCLFLVC GWLYLWKRRP QFTPVDVIET DVIPPDYTPG MLRLDAKLVY DDKGFCADIV NLIVKGKIHL EDQYDKNQQI LICVNEGATR NNEVLLPAEQ LLLEALFRKG DKVVLTGRRN RVLRRAFLRM QKFYLPRKKS LFYRPDTFLQ WGGLAILAII LYGNLSPVGW AGMSLVGDMF IMICWIIPFL FCSLELLFAR DDDKPCVNRA IITLFLPLIC SGVAFYSLYI NVGDVFFYWY MPAGYFSAVC LTGYLTGMGY IFLPKFTQTG QQRYARGEAI VNYLARKEAA THSGRRRKGE TRKLDYALLG WAISANLGRE WALRIAPSLS SAIRAPEIAR NGVLFSLQTH LSCGANTSLL GRSYSGGGAG GGAGGGGGGG W
|
| |