Gene SbBS512_E1606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1606 
Symbol 
ID6272829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1460747 
End bp1462063 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content59% 
IMG OID641725696 
ProductIS66 family element, transposase 
Protein accessionYP_001880196 
Protein GI187731306 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000125107 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCT CACTTCCTGA CGATATCAAT GCACTGAAAC GTCTCCTTGC CGAACAGGAG 
GCGCTGAACC GTGCCCTGCT GGAAAAGCTG AACGAGCGTG AACGCGAAAT AGACCATCTG
CAGGCACAGC TGGATAAGCT GCGCCGGATG AACTTCGGCA GCCGCTCCGA AAAAGTCTCC
CGTCGTATCG CACAGATGGA AGCTGACCTG AAGGCACTTC AGAAAGAAAG TGATACCCTT
ACCGGTCGGG TTGACGACCC GGCCGTGCAG CGCCCGCTGC GTCAAACCCG CACCCGCAAA
CCGTTCCCCG AATCACTCCC CCGCGATGAA AAACGGCTGC TGCCGGCAGC GTCATGCTGC
CCGGAATGTG GAGGCTCACT GAGCTATCTG GGTGAGGATG CCGCCGAACA GCTGGAGTTG
ATGCGCAGCG TCTTCCGGGT TATCCGGACT GTACGTGAAA AGCATGCCTG TACTCAGTGC
GATGCCATCG TGCAGGCCCC CGCGCCTTCA CGGCCCATCG AGCGGGGTAT CGCAGGACCG
GGGCTGCTGG CCCGCGTGCT GATCTCAAAG TATGCAGAGC ACACCCCGCT GTACCGCCAG
TCTGAAATGT ACGGCCGCCA GGGCGTGGAG CTGAGTCGTT CACTGCTGTC GGGCTGGGTG
GATGCATGCT GCCGGCTACT GTCACCGCTG GAAGAAGCGC TTCAGGACTA TGTGCTGACT
GACGGTAAGC TCCATGCTGA TGACACGCCT GTCCCGGTGC TGTTGCCAGG CAATAAGAAA
ACGAAGACCG GGCGGTTATG GACCTACGTT CGTGACGACC GTAACGCCGG GTCAACGCTG
GCGCCGGCGG TGTGGTTCGC TTACAGCCCG GACAGAAAAG GCATCCATCC GCAGACCCAT
CTTGCGGGGT TCAGTGGTGT ACTGCAGGCG GATGCATACG CCGGGTTCAA CGAGCTGTAC
CGGGATGGCC GGATAACGGA AGCCGCCTGT TGGGCTCACG CCCGCCGTAA AATCCACGAT
GTGCACGTTC GCACCCCGTC AGCCCTGACG GAGGAAGCGC TGAAACGGAT CGGCGAACTG
TACGCCATCG AGGCAGAGAT AAGGGGAATG ACGGCGGAGC AGCGCCTTGC CGAACGTCAG
TTGAAAACGA AACCGCTGCT GAAATCCCTG GAAAGCTGGC TGCGTGAAAA GATGAAAACC
CTGTCGCGAC ACTCAGAACT GGCGAAAGCG TTCGCATACG CCCTGAAGTG GTCAACAAAA
ACTGGCCACC GAGTTAGAGT TTTTCCAGTA TCGATTTTCC GATTCGTTTG GGGGTAA
 
Protein sequence
MSGSLPDDIN ALKRLLAEQE ALNRALLEKL NEREREIDHL QAQLDKLRRM NFGSRSEKVS 
RRIAQMEADL KALQKESDTL TGRVDDPAVQ RPLRQTRTRK PFPESLPRDE KRLLPAASCC
PECGGSLSYL GEDAAEQLEL MRSVFRVIRT VREKHACTQC DAIVQAPAPS RPIERGIAGP
GLLARVLISK YAEHTPLYRQ SEMYGRQGVE LSRSLLSGWV DACCRLLSPL EEALQDYVLT
DGKLHADDTP VPVLLPGNKK TKTGRLWTYV RDDRNAGSTL APAVWFAYSP DRKGIHPQTH
LAGFSGVLQA DAYAGFNELY RDGRITEAAC WAHARRKIHD VHVRTPSALT EEALKRIGEL
YAIEAEIRGM TAEQRLAERQ LKTKPLLKSL ESWLREKMKT LSRHSELAKA FAYALKWSTK
TGHRVRVFPV SIFRFVWG