Gene SbBS512_E3899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3899 
Symbol 
ID6270021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3626317 
End bp3627909 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content54% 
IMG OID641727754 
ProductIS66 family element, transposase 
Protein accessionYP_001882188 
Protein GI187730419 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAA AATACCTCAT TCGCATTGCA GAACTGGAAT GCCAGCTCCG TCAGAAAGAC 
CAGCAACTGA GTCTGGTTGA AGAGACGGAG GCCTTCCTGC GCTCTGCACT GGCCCGCGCC
GAAGAAAAGA TCGAAGAAGA TGAACGGGAA ATAGAACATC TGCGGGCTCA GATAGAAAAA
CTGCGCCGGA TGCTGTTCGG TACCCGTTCT GAAAAACTGC GTCGTGAAGT TGAACAGGCT
GAGGCCCTGC TGAAACAACG CGAACAGGAC AGTGATCGTT ACAGTGGGCG GGAAGACGAT
CCGCAGGTTC CCCGCCAGTT GCGACAGTCT CGTCATCGTC GCCCGTTACC GGAGCATCTG
CCCCGCGAAA TAAATCGCCT GGAGCCAGAA GAAAGCTGTT GCCCGGAGTG TGGCGGTGAG
CTGGATTATC TGGGGGAAGT CAGCGCAGAA CAACTGGAAC TGGTGAGCAG CGCTCTGAAA
GTGATCCGCA CAGAACGGGT AAAAAAAGCC TGTACAAAAT GTGACTGCAT CGTTGAAGCA
CCGGCACCAT CCCGTCCGAT AGAGCGTGGT ATCGCGGGCC CGGGGTTACT TGCCCGCGTG
TTAACGGGAA AATACTGCGA ACACCTGCCA CTGTATCGTC AGAGTGAAAT TTTTGCCCGT
CAGGGTGTCG AACTGAGCCG TGCATTACTC TCCAACTGGG TTGACGCGTG CTGCCAGTTA
ATGACGCCGC TGAATGATGC TCTGTACCGT TATGTGATGA ACAGCCGCAA AGTTCACACT
GATGACACAC CAGTAAAAGT GCTGGCACCG GGCAGGAAGA AGGCGAAAAC AGGATATATC
TGGACGTATG TCCGGGATGA CAGGAATGCC GGTTCGCCAG AGCCTCCGGC GGTCTGGTTC
GCCTACTCAC CGGACCATCA GGGTAAACAT CCGGAGCAGC ACCTTAGTCC CTTCCGGGGT
ATCCTGCAGG CAGATGCGTT TAATGGTTAC GATCGGCTGT TCAGTGCCGA ACGAGAAGGC
GGCGCGTTGA CGGAAGCAGG ATGCTGGGCT CATGCGCGGC GCAAAGTCCA CGATGTATAT
ATCAGTACCA AAAGCGCGAC AGCGGAAGAA GCCCTGAAAC TAATCGGTGA GCTGTACGCC
ATCGAGCACG AAATACGCGG GTTGCCGGTG TCTGAACGCC TGGCGGTCAG GCAAATGCAG
AGTAAACCGC TACTGACTTC CCTGTATAAG CTTATGCAGG AGAAAGAACA CACGTTATCG
AAAAAATGCC GTCTGAGAGA TGCGTTCCGG TATATCAGGA AGCACTGGGT TGCGTTGTGC
AACTTCAGTG ATGATGGTCT GGCTGAGGCG GATAATAATG CCGCGGAAAG AGCGCTTCGT
GCAGTCTGTC TCGGAAAGAA AAACTTTATG TTCTTCGGCA GCGATCACGG TGGAGAGCGT
GGTGCGCTAC TGTACGGGCT GATCGGCACC TGCCGACTGA ACGGTATCGA TCCGGAAGCG
TATCTGCGCT ATATCCTGAG CGTACTGCCG GAATGGCCTT CCAACCGTGT TGACGAACTC
CTGCCATGGA ACGTAGCACT CACCAATAAA TAA
 
Protein sequence
MNQKYLIRIA ELECQLRQKD QQLSLVEETE AFLRSALARA EEKIEEDERE IEHLRAQIEK 
LRRMLFGTRS EKLRREVEQA EALLKQREQD SDRYSGREDD PQVPRQLRQS RHRRPLPEHL
PREINRLEPE ESCCPECGGE LDYLGEVSAE QLELVSSALK VIRTERVKKA CTKCDCIVEA
PAPSRPIERG IAGPGLLARV LTGKYCEHLP LYRQSEIFAR QGVELSRALL SNWVDACCQL
MTPLNDALYR YVMNSRKVHT DDTPVKVLAP GRKKAKTGYI WTYVRDDRNA GSPEPPAVWF
AYSPDHQGKH PEQHLSPFRG ILQADAFNGY DRLFSAEREG GALTEAGCWA HARRKVHDVY
ISTKSATAEE ALKLIGELYA IEHEIRGLPV SERLAVRQMQ SKPLLTSLYK LMQEKEHTLS
KKCRLRDAFR YIRKHWVALC NFSDDGLAEA DNNAAERALR AVCLGKKNFM FFGSDHGGER
GALLYGLIGT CRLNGIDPEA YLRYILSVLP EWPSNRVDEL LPWNVALTNK