Gene SbBS512_E1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1086 
Symbol 
ID6271624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp987183 
End bp988748 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content55% 
IMG OID641725224 
Producttransposase family 
Protein accessionYP_001879742 
Protein GI187731723 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATCT CCGCTCTCAA CACCACGAAT GACATCGAAA AACTGCGTGC TATGGCACTT 
GCCATGGTAC AAGAAGTCAT GTCGGAGAAT GCCGAAAAAG AGCGGGAATT ACTGGAGAAA
AGCCGGCGCA TCCAGCTTCT GGAAGAAATG CTGAAACTGG TTCGTCAACA GCGCTTCGGA
AAAAAATGTG AAACGCTGGC TGGTATGCAA CGCTCCCTAT TCGAAGAGGA TGTTGATGCC
GATATCGCCG CGCTTACCGC ACATCTGGAT AAACTGCTCC CGCAATCCCC TGAAGAAGAT
GAAAAAGCGT CCCGTTCACG CCCGATACGC AAACCCTTAC CGGTTCATCT TCCACGGGTG
GAAAAAATTA TCCAGCCGGA CACTGACCAT TGCCCTGAAT GTGACGAGCC GCTGCACTAT
ATCCGCGATG CGGTGAGTGA AAAGCTGGAG TATATTCCCG CTCACTTTGT GGTGAACCGT
TATGTCCGTC CGCAATACAG TTGTCCCTGT TGCCAGAAGG TGTTCAGCGG TGAAATGCCG
GCACATATCC TCCCGAAAAG TGCCGTTGAG CCATCAGTCA TCGCACAGGT GATCATCAAT
AAATACGGTG ACCACCTGCC TCTGTATCGC CAGCAACAGG TCTTTGCCCG TTCAGATGTC
GGGCTGCCCG TCAGTTCGAT GGCTGACATG GTTGGCGCGG CGGGTGCCGC ATTATCTCCC
CTGGCGGCGT TACTCCATCG CGAGTTGATA AACCGTCCGG TGGTGCATGC AGATGAGACT
ACCCTGAAGA TCCTGAACAC GAAGAAAGGC GGTAAATCCT GCTCCGGTTA TCTGTGGGCA
TACGTCAGTG GAGAAAGGAC GGGACCGTCA GTTGTGTGCT TCGACTGCCG GACCGGACGT
AGCCATGAGT ATCCTGAAAA CTGGCTTCAG GGCTGGGGCG GGACGCTGGT TGTCGACGGA
CATAAAGCTT ACCGGACTCT GGCAAACAAA GTGCCGGAGA TCACGCTGGC CGGATGCTGG
GCCCATGCCC GCAGGGGCTT CGCCGACCTG TATAAAATCA GTAAAGATCC ACGGGCTGCC
ATAGCCGTGA AGAAAATCGC GGGGTTGTAC CGTCTTGAGA AGAAGATCAG TAGCCGCCCC
GTGGAAAAAA TCCGCCAGTG GCGACAGCGT TATGCCCGTC CGATACTGGA AGAACTGTGG
TCATGGCTTG AAGAGCAGGA ACCGCAATGT TCTCCGGGAA AGGCATTACA CAAAGCCATT
GCCTATGCGC TGTCTCATCG CGTGGAACTG AGCCGCTTCC TGGAAGATGG TGCGGTGCCG
CTGGATAATA ATGTGTGTGA ACGGGCCATC AAAAACGTGG TTCTGGGCAG AAAATCGTGG
CTGTTCGCCG GTTCGCAGAT GGCGGGAGAA CGCGCCGCGC AAATAATGAG CTTGCTGGAA
ACCGCGAAAC GCAACGGTCT GGAGCCGCAT GCCTGGTTGA CAGACGTCCT GATGCGTCTG
CCGGAGTGGC CGGAGGAGCG ACTGGCAGAG TTGCTGCCTC TTGAGGGATT TACCTTCTCC
GGGTGA
 
Protein sequence
MDISALNTTN DIEKLRAMAL AMVQEVMSEN AEKERELLEK SRRIQLLEEM LKLVRQQRFG 
KKCETLAGMQ RSLFEEDVDA DIAALTAHLD KLLPQSPEED EKASRSRPIR KPLPVHLPRV
EKIIQPDTDH CPECDEPLHY IRDAVSEKLE YIPAHFVVNR YVRPQYSCPC CQKVFSGEMP
AHILPKSAVE PSVIAQVIIN KYGDHLPLYR QQQVFARSDV GLPVSSMADM VGAAGAALSP
LAALLHRELI NRPVVHADET TLKILNTKKG GKSCSGYLWA YVSGERTGPS VVCFDCRTGR
SHEYPENWLQ GWGGTLVVDG HKAYRTLANK VPEITLAGCW AHARRGFADL YKISKDPRAA
IAVKKIAGLY RLEKKISSRP VEKIRQWRQR YARPILEELW SWLEEQEPQC SPGKALHKAI
AYALSHRVEL SRFLEDGAVP LDNNVCERAI KNVVLGRKSW LFAGSQMAGE RAAQIMSLLE
TAKRNGLEPH AWLTDVLMRL PEWPEERLAE LLPLEGFTFS G