Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3503 |
Symbol | |
ID | 6268896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3255454 |
End bp | 3256800 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641727383 |
Product | IS4 transposase |
Protein accession | YP_001881830 |
Protein GI | 187730724 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3385] FOG: Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0875962 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTCCGG ATTTTTTTAT GCACATTGGA CAGGCTCTTG ATCTGGTATC CCGTTACGAT TCTCTGCGTA ACCCACTGAC TTCTCTGGGG GATTACCTCG ACCCCGAACT CATCTCTCGT TGCCTTGCCG AATCAGGTAC TGTAACGCTA CGCAAGCGCC GTCTTCCCCT CGAAATGATG GTCTGGTGTA TTGTTGGCAT GGCGCTTGAG CGTAAAGAAC CTCTTCACCA GATTGTGAAT CGCCTGGACA TCATGCTGCC GGGCAATCGC CCCTTCGTTG CCCCCAGTGC CGTTATTCAG GCCCGCCAGC GCCTGGGAAG TGAGGCTGTC CGCCGCGTGT TCACGAAAAC AGCGCAGCTC TGGCATAACG CCACGCCGCA TCCGCACTGG TGCGGCCTGA CCCTGCTGGC CATCGATGGT GTGTTCTGGC GCACACCGGA TACACCAGAG AACGATGCAG CCTTCCCCCG CCAGACACAT GCCGGGAACC CGGCGCTCCA CCCGCAGGTC AAAATGGTCT GCCAGATGGA ACTGACCAGC CATCTGCTGA CGGCTGCAGC CTTCGGCACG ATGAAGAACA GCGAAAATGA GCTTGCTGAG CAACTTATAG AACAAACCGG CGATAACACT CTGACGTTAA TGGATAAAGG TTATTACTCA CTGGGACTGT TAAATGCCTG GAGCCTGGCG GGAGAACACC GCCACTGGAT GATACCTCTC AGAAAGGGAG CGCAATATGA AGAGCTCAGA AAACTGGGTA AAGGCGATCA TCTGGTGAAG CTGAAAACCA GCCCGCAGGC ACGAAAAAAG TGGCCGGGAC TGGGAAATGA AGTGACAGCC CGCCTGCTGA CCGTGACGCG CAAAGGAAAA GTCTGCCATC TACTGACGTC GATGACGGAC GCCATGCGCT TCCCCGGAGG AGAAATGGCG GATCTGTACA GTCATCGCTG GGAAATTGAA CTGGGATACA GGGAGATAAA ACAGACGATG CAACTGAGCA GGCTGACGCT GAGAAGTAAA AAGCCGGAGC TTGTGGAGCA AGAGCTGTGG GGTGTCTTAC TGGCTTATAA TCTGGTGAGA TATCAGATGA TTAAAATGGC GGAACATCTG AAAGGTTACT GGCCGAATCA ACTGAGTTTC TCAGAATCAT GCGGAATGGT GATGAGAATG CTGATGACAT TGCAGGGCGC TTCACCGGGA CGTATACCGG AGCTGATGCG CGATCTTGCA AGTATGGGAC AACTTGTGAA ATTACCGACG AGAAGGGGAA GGGCCTTCCC GAGAGTGGTA AAGGAGAGGC CCTGGAAATA CCCCACAGCC CCGAAAAAGA GCCAGTCAGT TGCTTAA
|
Protein sequence | MFPDFFMHIG QALDLVSRYD SLRNPLTSLG DYLDPELISR CLAESGTVTL RKRRLPLEMM VWCIVGMALE RKEPLHQIVN RLDIMLPGNR PFVAPSAVIQ ARQRLGSEAV RRVFTKTAQL WHNATPHPHW CGLTLLAIDG VFWRTPDTPE NDAAFPRQTH AGNPALHPQV KMVCQMELTS HLLTAAAFGT MKNSENELAE QLIEQTGDNT LTLMDKGYYS LGLLNAWSLA GEHRHWMIPL RKGAQYEELR KLGKGDHLVK LKTSPQARKK WPGLGNEVTA RLLTVTRKGK VCHLLTSMTD AMRFPGGEMA DLYSHRWEIE LGYREIKQTM QLSRLTLRSK KPELVEQELW GVLLAYNLVR YQMIKMAEHL KGYWPNQLSF SESCGMVMRM LMTLQGASPG RIPELMRDLA SMGQLVKLPT RRGRAFPRVV KERPWKYPTA PKKSQSVA
|
| |