Gene SbBS512_E1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1691 
Symbol 
ID6271405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1539077 
End bp1540705 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content55% 
IMG OID641725772 
Producttransposase (IS4 family) 
Protein accessionYP_001880270 
Protein GI187730252 
COG category[L] Replication, recombination and repair 
COG ID[COG3385] FOG: Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0674671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTCCGG ATTTTTTTAT GCACATTGGA CAGGCTCTTG ATCTGGTATC CCGTTACGAT 
TCTCTGCGTA ACCCACTGAC TTCTCTGGGG GATTACCTCG ACCCCGAACT CATCTCTCGT
TGCCTTGCCG AATCAGGTAC TGTAACGCTA CGCAAGCGCC GCCTTCCCCT CGAAATGATG
GTCTGGTGTA TTGTTGGCAT GGCGCTTGAG CGTAAAGAAC CTCTTCACCA GATTGTGAAT
CGCCTGGACA TCATGCTGCC GGGCAATCGC CCCTTCGTTG CCCCCAGTGC CGTTATTCAG
GCCCGCCAGC GCCTGGGAAG TGAGGCTGTC CGCCGCGTGT TCACGAAAAC AGCGCAGCTC
TGGCATAACG CCACGCCGCA TCCGCACTGG TGCGGCCTGA CCCTGCTGGC CATCGATGGT
GTGTTCTGGC GCACACCGGA TACACCAGAG AACGATGCAG CCTTCCCCCG CCAGACACAT
GCCGGGAACC CGGCGCTCCA CCCGCAGGTC AAAATGGTCT GCCAGATGGA ACTGACCAGC
CATCTGCTGA CGGCTGCAGC CTTCGGCACG ATGAAGAACA GCGAAAATGA GCTTGCTGAG
CAACTTATAG AACAAACCGG CGATAACACT CTGACGTTAA TGGATAAAGG TTATTACTCA
CTGGGACTGT TAAATGCCTG GAGCCTGGCG GGAGAACACC GCCACTGGAT GATACCTCTC
AGAAAGGGAG CGCAATATGA AGAGCTCAGA AAACTGGGTA AAGGCGATCA TCTGGTGAAG
CTGAAAACCA GCCCGCAGGC ACGAAAAAAG TGGCCGGGAC TGGGAAATGA AGTGACAGCC
CGCCTGCTGA CCGTGACGCG CAAAGGAAAA GTCTGCCATC TGCTGACGTC GATGACGGAC
GCCATGCGCT TCCCCGGAGG AGAAATGGCG GATCTGTACA GTCATCGCTG GGAAATTGAA
CTGGGATACA GGGAGATAAA ACAGACGATG CAACTGAGCA GGCTGACGCT GAGAAGTAAA
AAGCCGGAGC TTGTGGAGCA AGAGCTGTGG GGTGTCTTAC TGGCTTATAA TCTGGTGAGA
TATCAGATGA TTAAAATGGC GGAAAGTGGC GCAGTAGACT GTGACGTTTT TTTTGACGAC
AGGGACCAGG CAGTCCCCTA CACAGCCACC GCTGATGATG TCGCTCCGAC GGGTCAGCAA
ATCTGGCAGG AACTGCAAAG CGGCAAATGG GGTGAGATAG CCCCATTCAC TGTGACACCA
GAAATGCTGG AAGCGGCCAG AGAGGCCAGA CGTCAGGAAA TTGAAGCATG GCGCGCAGAA
CAGGAGGCGA AGCCGTTCAC GTTTGAATGG AACGGTCGTA TCTGGAATGC TGGTCCCGAC
TCACTGGGCC GCCTGTCCCC GGTAGTCATG CTGGCAAAAT CTGTCACAGC ACAAACACAT
ATGGCGTGGA GCGATGCCGA TAATCAGCAG GTGAAACTGT CGATGCCGGA ACTGGAAGAA
CTGGCGGCAG CAATGGTGCA GGCGCAGGTC GATCGCAACG ATGAGATTTA TCGCCGTCAG
CGTGAAATGA AAGAGGAGCT GAGCGGTCTG GATGATTTGG CTTCAATTCG GGCGTTTGAC
GTTGAGTAA
 
Protein sequence
MFPDFFMHIG QALDLVSRYD SLRNPLTSLG DYLDPELISR CLAESGTVTL RKRRLPLEMM 
VWCIVGMALE RKEPLHQIVN RLDIMLPGNR PFVAPSAVIQ ARQRLGSEAV RRVFTKTAQL
WHNATPHPHW CGLTLLAIDG VFWRTPDTPE NDAAFPRQTH AGNPALHPQV KMVCQMELTS
HLLTAAAFGT MKNSENELAE QLIEQTGDNT LTLMDKGYYS LGLLNAWSLA GEHRHWMIPL
RKGAQYEELR KLGKGDHLVK LKTSPQARKK WPGLGNEVTA RLLTVTRKGK VCHLLTSMTD
AMRFPGGEMA DLYSHRWEIE LGYREIKQTM QLSRLTLRSK KPELVEQELW GVLLAYNLVR
YQMIKMAESG AVDCDVFFDD RDQAVPYTAT ADDVAPTGQQ IWQELQSGKW GEIAPFTVTP
EMLEAAREAR RQEIEAWRAE QEAKPFTFEW NGRIWNAGPD SLGRLSPVVM LAKSVTAQTH
MAWSDADNQQ VKLSMPELEE LAAAMVQAQV DRNDEIYRRQ REMKEELSGL DDLASIRAFD
VE