Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1691 |
Symbol | |
ID | 6271405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1539077 |
End bp | 1540705 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641725772 |
Product | transposase (IS4 family) |
Protein accession | YP_001880270 |
Protein GI | 187730252 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3385] FOG: Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0674671 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTCCGG ATTTTTTTAT GCACATTGGA CAGGCTCTTG ATCTGGTATC CCGTTACGAT TCTCTGCGTA ACCCACTGAC TTCTCTGGGG GATTACCTCG ACCCCGAACT CATCTCTCGT TGCCTTGCCG AATCAGGTAC TGTAACGCTA CGCAAGCGCC GCCTTCCCCT CGAAATGATG GTCTGGTGTA TTGTTGGCAT GGCGCTTGAG CGTAAAGAAC CTCTTCACCA GATTGTGAAT CGCCTGGACA TCATGCTGCC GGGCAATCGC CCCTTCGTTG CCCCCAGTGC CGTTATTCAG GCCCGCCAGC GCCTGGGAAG TGAGGCTGTC CGCCGCGTGT TCACGAAAAC AGCGCAGCTC TGGCATAACG CCACGCCGCA TCCGCACTGG TGCGGCCTGA CCCTGCTGGC CATCGATGGT GTGTTCTGGC GCACACCGGA TACACCAGAG AACGATGCAG CCTTCCCCCG CCAGACACAT GCCGGGAACC CGGCGCTCCA CCCGCAGGTC AAAATGGTCT GCCAGATGGA ACTGACCAGC CATCTGCTGA CGGCTGCAGC CTTCGGCACG ATGAAGAACA GCGAAAATGA GCTTGCTGAG CAACTTATAG AACAAACCGG CGATAACACT CTGACGTTAA TGGATAAAGG TTATTACTCA CTGGGACTGT TAAATGCCTG GAGCCTGGCG GGAGAACACC GCCACTGGAT GATACCTCTC AGAAAGGGAG CGCAATATGA AGAGCTCAGA AAACTGGGTA AAGGCGATCA TCTGGTGAAG CTGAAAACCA GCCCGCAGGC ACGAAAAAAG TGGCCGGGAC TGGGAAATGA AGTGACAGCC CGCCTGCTGA CCGTGACGCG CAAAGGAAAA GTCTGCCATC TGCTGACGTC GATGACGGAC GCCATGCGCT TCCCCGGAGG AGAAATGGCG GATCTGTACA GTCATCGCTG GGAAATTGAA CTGGGATACA GGGAGATAAA ACAGACGATG CAACTGAGCA GGCTGACGCT GAGAAGTAAA AAGCCGGAGC TTGTGGAGCA AGAGCTGTGG GGTGTCTTAC TGGCTTATAA TCTGGTGAGA TATCAGATGA TTAAAATGGC GGAAAGTGGC GCAGTAGACT GTGACGTTTT TTTTGACGAC AGGGACCAGG CAGTCCCCTA CACAGCCACC GCTGATGATG TCGCTCCGAC GGGTCAGCAA ATCTGGCAGG AACTGCAAAG CGGCAAATGG GGTGAGATAG CCCCATTCAC TGTGACACCA GAAATGCTGG AAGCGGCCAG AGAGGCCAGA CGTCAGGAAA TTGAAGCATG GCGCGCAGAA CAGGAGGCGA AGCCGTTCAC GTTTGAATGG AACGGTCGTA TCTGGAATGC TGGTCCCGAC TCACTGGGCC GCCTGTCCCC GGTAGTCATG CTGGCAAAAT CTGTCACAGC ACAAACACAT ATGGCGTGGA GCGATGCCGA TAATCAGCAG GTGAAACTGT CGATGCCGGA ACTGGAAGAA CTGGCGGCAG CAATGGTGCA GGCGCAGGTC GATCGCAACG ATGAGATTTA TCGCCGTCAG CGTGAAATGA AAGAGGAGCT GAGCGGTCTG GATGATTTGG CTTCAATTCG GGCGTTTGAC GTTGAGTAA
|
Protein sequence | MFPDFFMHIG QALDLVSRYD SLRNPLTSLG DYLDPELISR CLAESGTVTL RKRRLPLEMM VWCIVGMALE RKEPLHQIVN RLDIMLPGNR PFVAPSAVIQ ARQRLGSEAV RRVFTKTAQL WHNATPHPHW CGLTLLAIDG VFWRTPDTPE NDAAFPRQTH AGNPALHPQV KMVCQMELTS HLLTAAAFGT MKNSENELAE QLIEQTGDNT LTLMDKGYYS LGLLNAWSLA GEHRHWMIPL RKGAQYEELR KLGKGDHLVK LKTSPQARKK WPGLGNEVTA RLLTVTRKGK VCHLLTSMTD AMRFPGGEMA DLYSHRWEIE LGYREIKQTM QLSRLTLRSK KPELVEQELW GVLLAYNLVR YQMIKMAESG AVDCDVFFDD RDQAVPYTAT ADDVAPTGQQ IWQELQSGKW GEIAPFTVTP EMLEAAREAR RQEIEAWRAE QEAKPFTFEW NGRIWNAGPD SLGRLSPVVM LAKSVTAQTH MAWSDADNQQ VKLSMPELEE LAAAMVQAQV DRNDEIYRRQ REMKEELSGL DDLASIRAFD VE
|
| |