Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2977 |
Symbol | |
ID | 6272356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2783459 |
End bp | 2784661 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641726919 |
Product | IS4 ORF |
Protein accession | YP_001881384 |
Protein GI | 187732534 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3385] FOG: Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0325653 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTCCGG ATTTTTTTAT GCACATTGGA CAGGCTCTTG ATCTGGTATC CCGTTACGAT TCTCTGCGTA ACCCACTGAC TTCTCTGGGG GATTACCTCG ACCCCGAACT CATCTCTCGT TGCCTTGCCG AATCAGGTAC TGTAACGCTA CGCAAGCGCC GTCTTCCCCT CGAAATGATG GTCTGGTGTA TTGTTGGCAT GGCGCTTGAG CGTAAAGAAC CTCTTCACCA GATTGTGAAT CGCCTGGACA TCATGCTGCC GGGCAATCGC CCCTTCGTTG CCCCCAGTGC CGTTATTCAG GCCCGTCAGC GCCTGGGAAG TGAGGCTGTC CACCGCGTGT TCACGAAAAC AGCGCAGCTC TGGCATAACG CCACGCCGCA TCCGCACTGG TGCGGCCTGA CCCTGCTGGC CATCGATGGT GTGTTCTGGC GCACACCGGA TACACCAGAG AACGATGCAG CCTTCCCCCG CCAGACACAT GCCGGGAACC CGGCGCTCCA CCCGCAGGTC AAAATGGTCT GCCAGATGGA ACTGACCAGC CATCTGCTGA CGGCTGCAGC CTTCGGCACG ATGAAGAACA GCGAAAATGA GCTTGCTGAG CAACTTATAG AACAAACCGG CGATAACACT CTGACGTTAA TGGATAAAGG TTATTACTCA CTGGGACTGT TAAATGCCTG GAGCCTGGCG GGAGAACACC GCCACTGGAT GATACCTCTC AGAAAGGGAG CGCAATATGA AGAGCTCAGA AAACTGGGTA AAGGCGATCA TCTGGTGAAG CTGAAAACCA GCCCGCAGGC ACGAAAAAAG TGGCCGGGAC TGGGAAATGA AGTGACAGCC CGCCTGCTGA CCGTGACGCG CAAAGGAAAA GTCTGCCATC TGCTGACGTC GATGACGAAC GCCATGCGCT TCCCCGGAGG AGAAATGGCG GATCTGTACA GTCATCGCTG GGAAATCGAA CTGGGATACA GGGAGATAAA ACAGACGATG CAACTGAGCA GGCTGACGCT GAGAAGTAAA AAGCCGGAGC TTGTGGAGCA AGAGCTGTGG GGTGTCTTAC TGGCTTATAA TCTGGTGAGA TATCAGATGA TTAAAATGGC GGAACATCTG AAAGGTTACT GGCCAAATCA ACTGAGTTTC TCAGAATCAT GCGGAATGGT GATGAGAATG CTGATGACAT TGCAGGGCGC TGAGGTAGCC TGA
|
Protein sequence | MFPDFFMHIG QALDLVSRYD SLRNPLTSLG DYLDPELISR CLAESGTVTL RKRRLPLEMM VWCIVGMALE RKEPLHQIVN RLDIMLPGNR PFVAPSAVIQ ARQRLGSEAV HRVFTKTAQL WHNATPHPHW CGLTLLAIDG VFWRTPDTPE NDAAFPRQTH AGNPALHPQV KMVCQMELTS HLLTAAAFGT MKNSENELAE QLIEQTGDNT LTLMDKGYYS LGLLNAWSLA GEHRHWMIPL RKGAQYEELR KLGKGDHLVK LKTSPQARKK WPGLGNEVTA RLLTVTRKGK VCHLLTSMTN AMRFPGGEMA DLYSHRWEIE LGYREIKQTM QLSRLTLRSK KPELVEQELW GVLLAYNLVR YQMIKMAEHL KGYWPNQLSF SESCGMVMRM LMTLQGAEVA
|
| |