Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2240 |
Symbol | |
ID | 6272217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2037060 |
End bp | 2038079 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641726262 |
Product | transposase, IS1111 family |
Protein accession | YP_001880746 |
Protein GI | 187732173 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000000963569 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATA CACCGGTTGG CGTTGATATC GCAAAACATG TCATTCAGAT TCACTTCATC AATGAGCACA CAGGTGAAGT GGTTGATAAA CAGTTGCGTA GACAGGATTT TCTGACGTTC TTCGGCAACC GTGAGCCATG CCTGATTGGT ATGGAGGCCT GTGGAGGTTC TCAGCACTGG GCACGGGAAC TGACAAAACT TGGTCATAAA GTTCGGTTGT TGCAGGCCCG CTTCGTTAAG GCATTCGTCA TGGGCAATAA GAATGATGTG ATGGATGCCC GGGCTATCTG GATGGCGGTT CAGCAGCCGG GTAAAGAAAT CGCCGTAAAA ACAGAAGAAC AGCAGTCGGT ACTGGTTCTG CACCGTACCC GCATGCAACT GGTGAAGTTC CGGACCGCAC AAATTAATGC CCTGCACGGG ACGTTACTGG AGTTTGGTGA AACCATCCAC AAAGGCCGGG CAGCGATGGA GCGGGAGTTC CCCGAAGCAC TGGAACGGAT GAAAGAGAGA CTGCCACCGT ATCTCATTAT GGTTCTGGAA AACCAGTACA ACCGACTGAA TGAGCTGGAC TCACTGATAG AGGATATTGA AAAACAGCTT ACCAGCGTGG CGAGGCAGAA TGAAACCTGT AAGCGGTTGC TGGATATTCC TGGCGTTGGA CCACTTATTG CGACGGCAGC GGTGGCCACC ATGGGGGAAG CATCAGCGTT TAAATCGGGG CGAGAGTTCG CCGCATATGT TGGTCTGGTT CCAAAACAAA CAGGCTCCGG AGGGAAAGTA CGTCTGCTGG GGATAAGCAA ACGTGGTGAC ACTTATCTCA GGACATTATT TATCCACGGT GCAAGAGCGG TGGCATTAGT AGCTAAAGAG CCTGGCCCGT GGATAACCGA ACTGAAAAAA CGTCGTCCAG CCAGTGTGGC AATCGTCGCC ATGGCAAACA AGCTGGCACG AACAGTATGG GCGATAACCG CCCATGACCG TAAGTATGAC AGGAACCACG TCAGTATCAG ACCATATTAA
|
Protein sequence | MKYTPVGVDI AKHVIQIHFI NEHTGEVVDK QLRRQDFLTF FGNREPCLIG MEACGGSQHW ARELTKLGHK VRLLQARFVK AFVMGNKNDV MDARAIWMAV QQPGKEIAVK TEEQQSVLVL HRTRMQLVKF RTAQINALHG TLLEFGETIH KGRAAMEREF PEALERMKER LPPYLIMVLE NQYNRLNELD SLIEDIEKQL TSVARQNETC KRLLDIPGVG PLIATAAVAT MGEASAFKSG REFAAYVGLV PKQTGSGGKV RLLGISKRGD TYLRTLFIHG ARAVALVAKE PGPWITELKK RRPASVAIVA MANKLARTVW AITAHDRKYD RNHVSIRPY
|
| |