Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2324 |
Symbol | |
ID | 6269163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2113802 |
End bp | 2115148 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641726329 |
Product | IS4 transposase |
Protein accession | YP_001880812 |
Protein GI | 187733806 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3385] FOG: Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000000109672 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTCCGG ATTTTTTTAT GCACATTGGA CAGGCTCTTG ATCTGGTATC CCGTTACGAT TCTCTGCGTA ACCCACTGAC TTCTCTGGGG GATTACCTCG ACCCCGAACT CATCTCTCGT TGCCTTGCCG AATCAGGTAC TGTAACGCTA CGCAAGCGCC GTCTTCCCCT CGAAATGATG GTCTGGTGTA TTGTTGGCAT GGCGCTTGAG CGTAAAGAAC CTCTTCACCA GATTGTGAAT CGCCTGGACA TCATGCTGCC GGGCAATCGC CCCTTCGTTG CCCCCAGTGC CGTTATTCAG GCCCGCCAGC GCCTGGGAAG TGAGGCTGTC CGCCGCGTGT TCACGAAAAC AGCGCAGCTC TGGCATAACG CCACGCCGCA TCCGCACTGG TGCGGCCTGA CCCTGCTGGC CATCGATGGT GTGTTCTGGC GCACACTGGA TACACCAGAG AACGATGCAG CCTTCCCCCG GCAGACACAT GCCGGGAACC CGGCGCTCTA CCCGCAGGTC AAAATGGTCT GCCAGATGGA ACTGACCAGC CATCTGCTGA CGGCTGCAGC CTTCGGCACG ATGAAGAACA GCGAAAATGA GCTTGCTGGG CAACTTATAG AACAAACCGG CGATAACACT CTGACGTTAA TGGATAAAGG TTATTACTCA CTGGGACTGT TAAATGCCTG GAGCCTGGCG GGAGAACACC GCCACTGGAT GATACCTCTC AGAAAGGGAG CGCAATATGA AGAGCTCAGA AAGCTGGGTA AAGGCGATCA TCTGGTGAAG CTGAAAACCA GCCCGCAGGC ACGAAAAAAG TGGCCGGGAC TGGGAAATGA AGTGACAGCC CGCCTGCTGA CCGTGACGCG CAAAGGAAAA GTCTGCCATC TGCTGACGTC GATGACGGAC GCCATGCGCT TCCCCGGAGG AGAAATGGCG GATCTGTACA GTCATCGCTG GGAAATCGAA CTGGGATACA GGGAGATAAA ACAGACGATG CAACTGAGCA GGCTGACGCT GAGAAGTAAA AAGCCGGAGC TTGTGGAGCA AGAGCTGTGG GGTGTCTTAC TGGCTTATAA TCTGGTGAGA TATCAGATGA TTAAAATGGC GGAACATCTG AAAGGTTACT GGCCGAATCA ACTGAGTTTC TCAGAATCAT GCGGAATGGT GATGAGAATG CTGATGACAT TGCAGGGCGC TTCACCGGGA CGTATACCGG AGCTGATGCG CGATCTTGCA AGTATGGGAC AACTTGTGAA ATTACCGACG AGAAGGGGAA GGGCCTTCCC GAGAGTGGTA AAGGAGAGGC CCTGGAAATA CCCCACAGCC CCGAAAAAGA GCCAGTCAGT TGCTTAA
|
Protein sequence | MFPDFFMHIG QALDLVSRYD SLRNPLTSLG DYLDPELISR CLAESGTVTL RKRRLPLEMM VWCIVGMALE RKEPLHQIVN RLDIMLPGNR PFVAPSAVIQ ARQRLGSEAV RRVFTKTAQL WHNATPHPHW CGLTLLAIDG VFWRTLDTPE NDAAFPRQTH AGNPALYPQV KMVCQMELTS HLLTAAAFGT MKNSENELAG QLIEQTGDNT LTLMDKGYYS LGLLNAWSLA GEHRHWMIPL RKGAQYEELR KLGKGDHLVK LKTSPQARKK WPGLGNEVTA RLLTVTRKGK VCHLLTSMTD AMRFPGGEMA DLYSHRWEIE LGYREIKQTM QLSRLTLRSK KPELVEQELW GVLLAYNLVR YQMIKMAEHL KGYWPNQLSF SESCGMVMRM LMTLQGASPG RIPELMRDLA SMGQLVKLPT RRGRAFPRVV KERPWKYPTA PKKSQSVA
|
| |