Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1606 |
Symbol | |
ID | 6272829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1460747 |
End bp | 1462063 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641725696 |
Product | IS66 family element, transposase |
Protein accession | YP_001880196 |
Protein GI | 187731306 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000000125107 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCT CACTTCCTGA CGATATCAAT GCACTGAAAC GTCTCCTTGC CGAACAGGAG GCGCTGAACC GTGCCCTGCT GGAAAAGCTG AACGAGCGTG AACGCGAAAT AGACCATCTG CAGGCACAGC TGGATAAGCT GCGCCGGATG AACTTCGGCA GCCGCTCCGA AAAAGTCTCC CGTCGTATCG CACAGATGGA AGCTGACCTG AAGGCACTTC AGAAAGAAAG TGATACCCTT ACCGGTCGGG TTGACGACCC GGCCGTGCAG CGCCCGCTGC GTCAAACCCG CACCCGCAAA CCGTTCCCCG AATCACTCCC CCGCGATGAA AAACGGCTGC TGCCGGCAGC GTCATGCTGC CCGGAATGTG GAGGCTCACT GAGCTATCTG GGTGAGGATG CCGCCGAACA GCTGGAGTTG ATGCGCAGCG TCTTCCGGGT TATCCGGACT GTACGTGAAA AGCATGCCTG TACTCAGTGC GATGCCATCG TGCAGGCCCC CGCGCCTTCA CGGCCCATCG AGCGGGGTAT CGCAGGACCG GGGCTGCTGG CCCGCGTGCT GATCTCAAAG TATGCAGAGC ACACCCCGCT GTACCGCCAG TCTGAAATGT ACGGCCGCCA GGGCGTGGAG CTGAGTCGTT CACTGCTGTC GGGCTGGGTG GATGCATGCT GCCGGCTACT GTCACCGCTG GAAGAAGCGC TTCAGGACTA TGTGCTGACT GACGGTAAGC TCCATGCTGA TGACACGCCT GTCCCGGTGC TGTTGCCAGG CAATAAGAAA ACGAAGACCG GGCGGTTATG GACCTACGTT CGTGACGACC GTAACGCCGG GTCAACGCTG GCGCCGGCGG TGTGGTTCGC TTACAGCCCG GACAGAAAAG GCATCCATCC GCAGACCCAT CTTGCGGGGT TCAGTGGTGT ACTGCAGGCG GATGCATACG CCGGGTTCAA CGAGCTGTAC CGGGATGGCC GGATAACGGA AGCCGCCTGT TGGGCTCACG CCCGCCGTAA AATCCACGAT GTGCACGTTC GCACCCCGTC AGCCCTGACG GAGGAAGCGC TGAAACGGAT CGGCGAACTG TACGCCATCG AGGCAGAGAT AAGGGGAATG ACGGCGGAGC AGCGCCTTGC CGAACGTCAG TTGAAAACGA AACCGCTGCT GAAATCCCTG GAAAGCTGGC TGCGTGAAAA GATGAAAACC CTGTCGCGAC ACTCAGAACT GGCGAAAGCG TTCGCATACG CCCTGAAGTG GTCAACAAAA ACTGGCCACC GAGTTAGAGT TTTTCCAGTA TCGATTTTCC GATTCGTTTG GGGGTAA
|
Protein sequence | MSGSLPDDIN ALKRLLAEQE ALNRALLEKL NEREREIDHL QAQLDKLRRM NFGSRSEKVS RRIAQMEADL KALQKESDTL TGRVDDPAVQ RPLRQTRTRK PFPESLPRDE KRLLPAASCC PECGGSLSYL GEDAAEQLEL MRSVFRVIRT VREKHACTQC DAIVQAPAPS RPIERGIAGP GLLARVLISK YAEHTPLYRQ SEMYGRQGVE LSRSLLSGWV DACCRLLSPL EEALQDYVLT DGKLHADDTP VPVLLPGNKK TKTGRLWTYV RDDRNAGSTL APAVWFAYSP DRKGIHPQTH LAGFSGVLQA DAYAGFNELY RDGRITEAAC WAHARRKIHD VHVRTPSALT EEALKRIGEL YAIEAEIRGM TAEQRLAERQ LKTKPLLKSL ESWLREKMKT LSRHSELAKA FAYALKWSTK TGHRVRVFPV SIFRFVWG
|
| |