Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1062 |
Symbol | |
ID | 6268694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 973008 |
End bp | 974216 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641725201 |
Product | IS605 family transposase orfB |
Protein accession | YP_001879720 |
Protein GI | 187732029 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000000017458 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACGAC TACAAGCATT TAAATTCCAG TTAAGACCCG GTGGTCAACA GGAGCGTGAA ATGAGGCGCT TCGCAGGCGC ATGTCGTTTC GTTTTCAATC GTGCTCTGGC ACGTCAGAAT GAGAATCATG AGGTCGGTAA TAAATACATC CCTTACGGGA AAATGGCTTC CTGGCTGGTT GAGTGGAAAA ATGCCACTGA AACGCAATGG CTTAAAGATG CCCCGTCACA GCCATTGCAA CAGTCACTGA AAGACCTTGA GCGGGCTTAC AAAAACTTCT TCCAGAATCG GGCTGCTTTT CCCCGATTCA AAAAGCGGGG ACAGAATGAT GTATTCCGCT ACCCGCAGGG CGTTAAGCTC GATCAGGAAA ACAGCCGTAT TTTTCTGCCG AAACTGGGCT GGATGCGCTA CCGGAACAGC CGTCAGGTCA CGGGTGTTGT GAAAAATGTC ACTGTCAGCC AGTCCTGCGG TAAGTGGTAC ATCAGTATTC AGACAGAAAG TGAAGTATCA ACTCCGGTTC ACCCTTCAGC ATCAATGGTC GGGCTGGATG CTGGCGTGGC TAAACTCGCC ACGCTGTCAG ATGGCACAGC CTTTGAGCCT GTAAACAGTT TTCAGAAAAA CCAGAAGAAG CTGGCGAGAC TTCAGCGACA GTTAAGCCGC AAGGTCAAAT TCAGCAACAA CTGGCAGAAG CAGAAACGCA AAATACAGCG ACTGCATTCC TGTATCGCAA ATATCCGCAG GGACTACCTT CACAAAGTCA CAACGACCGT CAGCAAAAAC CACGCAATGA TAGTCATTGA GGATTTGAAG GTCAGCAACA TGTCAAAGTC AGCAGCGGGT ACGATCAGCC AGCCGGGGCG CAATGTCCGG GCAAAATCAG GTTTAAACCG TTCGATACTG GATCAGGGCT GGTATGAAAT GCGCCGCCAG CTTGAGTACA AGCAGCTCTG GCGTGGCGGT CAGGTGCTTG CTGTTCCGCC AGCGTACACA AGCCAGCGTT GCGCGTGCTG TGGTCATACA GCGAAAGAAA ACCGCCTGTC ACAAAGTAAA TTCAGATGCC AGGTATGTGG ATATACAGCG AACGCCGATG TAAATGGCGC TCGTAACATT TTAGCGGCGG GGCACGCCGT TCTTGCCTGT GGAGAGATGG TGCAGTCAGG CCGCCCGTTG AAGCAGGAAC CCACCGAAAT GATTCAGGCG ACAGCCTGA
|
Protein sequence | MKRLQAFKFQ LRPGGQQERE MRRFAGACRF VFNRALARQN ENHEVGNKYI PYGKMASWLV EWKNATETQW LKDAPSQPLQ QSLKDLERAY KNFFQNRAAF PRFKKRGQND VFRYPQGVKL DQENSRIFLP KLGWMRYRNS RQVTGVVKNV TVSQSCGKWY ISIQTESEVS TPVHPSASMV GLDAGVAKLA TLSDGTAFEP VNSFQKNQKK LARLQRQLSR KVKFSNNWQK QKRKIQRLHS CIANIRRDYL HKVTTTVSKN HAMIVIEDLK VSNMSKSAAG TISQPGRNVR AKSGLNRSIL DQGWYEMRRQ LEYKQLWRGG QVLAVPPAYT SQRCACCGHT AKENRLSQSK FRCQVCGYTA NADVNGARNI LAAGHAVLAC GEMVQSGRPL KQEPTEMIQA TA
|
| |