Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0847 |
Symbol | |
ID | 6269000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 794884 |
End bp | 796476 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641725018 |
Product | IS66 family element, transposase |
Protein accession | YP_001879545 |
Protein GI | 187734264 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAAA AATACCTCAT TCGCATTGCA GAACTGGAAT GCCAGCTCCG TCAGAAAGAC CAGCAACTGA GTCTGGTTGA AGAGACGGAG GCCTTCCTGC GCTCTGCACT GGCCCGCGCC GAAGAAAAGA TCGAAGAAGA TGAACGGGAA ATAGAACATC TGCGGGCTCA GATAGAAAAA CTGCGCCGGA TGCTGTTCGG TACCCGTTCT GAAAAACTGC GTCGTGAAGT TGAACAGGCT GAGGCCCTGC TGAAACAACG CGAACAGGAC AGTGATCGTT ACAGTGGGCG GGAAGACGAT CCGCAGGTTC CCCGCCAGTT GCGACAGTCT CGTCATCGTC GCCCGTTACC GGAGCATCTG CCCCGCGAAA TAAATCGCCT GGAGCCAGAA GAAAGCTGTT GCCCGGAGTG TGGCGGTGAG CTGGATTATC TGGGGGAAGT CAGCGCAGAA CAACTGGAAC TGGTGAGCAG CGCTCTGAAA GTGATCCGCA CAGAACGGGT AAAAAAAGCC TGTACAAAAT GTGACTGCAT CGTTGAAGCA CCGGCACCAT CCCGTCCGAT AGAGCGTGGT ATCGCGGGCC CGGGGTTACT TGCCCGCATG TTAACGGGAA AATACTGCGA ACACCTGCCA CTGTATCGTC AGAGTGAAAT TTTTGCCCGT CAGGGTGTCG AACTGAGCCG TGCATTACTC TCCAACTGGG TTGACGCGTG CTGCCAGTTA ATGACGCCGC TGAATGATGC TCTGTACCGT TATGTGATGA ACAGCCGCAA AGTTCACACT GATGACACAC CAGTAAAAGT GCTGGCACCG GGCAGGAAGA AGGCGAAAAC AGGATATATC TGGACGTATG TCCGGGATGA CAGGAATGCC GGTTCGCCAG AGCCTCCGGC GGTCTGGTTC GCCTACTCAC CGGACCATCA GGGTAAACAT CCGGAGCAGC ACCTTAGTCC CTTCCGGGGT ATCCTGCAGG CAGATGCGTT TAATGGTTAC GATCGGCTGT TCAGTGCCGA ACGAGAAGGC GGCGCGTTGA CGGAAGCAGG ATGCTGGGCT CATGCGCGGC GCAAAGTCCA CGATGTATAT ATCAGTACCA AAAGCGCGAC AGCGGAAGAA GCCCTGAAAC TAATCGGTGA GCTGTACGCC ATCGAGCACG AAATACGCGG GTTGCCGGTG TCTGAACGCC TGGCGGTCAG GCAAATGCAG AGTAAACCGC TACTGACTTC CCTGTATAAG CTGATGCAGG AGAAAGAACA CACGTTATCG AAAAAATGCC GTCTGAGAGA TGCGTTCCGG TATATCAGGA AGCACTGGGT TGCGTTGTGC AACTTCAGTG ATGATGGTCT GGCTGAGGCG GATAATAATG CCGCGGAAAG AGCGCTTCGT GCAGTCTGTC TCGGAAAGAA AAACTTTATG TTCTTCGGCA GCGATCACGG TGGAGAGCGT GGTGCGCTAC TGTACGGGCT GATCGGCACC TGCCGACTGA ACGGTATCGA TCCGGAAGCG TATCTGCGCT ATATCCTGAG CGTACTGCCG GAATGGCCTT CCAACCGTGT TGACGAACTC CTGCCATGGA ACGTAGCACT CACCAATAAA TAA
|
Protein sequence | MNQKYLIRIA ELECQLRQKD QQLSLVEETE AFLRSALARA EEKIEEDERE IEHLRAQIEK LRRMLFGTRS EKLRREVEQA EALLKQREQD SDRYSGREDD PQVPRQLRQS RHRRPLPEHL PREINRLEPE ESCCPECGGE LDYLGEVSAE QLELVSSALK VIRTERVKKA CTKCDCIVEA PAPSRPIERG IAGPGLLARM LTGKYCEHLP LYRQSEIFAR QGVELSRALL SNWVDACCQL MTPLNDALYR YVMNSRKVHT DDTPVKVLAP GRKKAKTGYI WTYVRDDRNA GSPEPPAVWF AYSPDHQGKH PEQHLSPFRG ILQADAFNGY DRLFSAEREG GALTEAGCWA HARRKVHDVY ISTKSATAEE ALKLIGELYA IEHEIRGLPV SERLAVRQMQ SKPLLTSLYK LMQEKEHTLS KKCRLRDAFR YIRKHWVALC NFSDDGLAEA DNNAAERALR AVCLGKKNFM FFGSDHGGER GALLYGLIGT CRLNGIDPEA YLRYILSVLP EWPSNRVDEL LPWNVALTNK
|
| |