Gene SbBS512_E4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4229 
Symbol 
ID6270559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3954189 
End bp3955388 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content57% 
IMG OID641728049 
ProductIS1294, transposase 
Protein accessionYP_001882470 
Protein GI187730874 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGTCCG CTTTTACTCC CCGCCCGCTG AAACGTCTGT TCACGGCCAA CCAGTGCTGG 
ACATCCTTCC TGGATGCGGG CGGTCTGCGC GATATCGGGG TTGAGGCTGT CACAAAAATG
CTGGCCTGCG GCACGCGGAT ACTGGGAGTG AAGGAATACA TCTGCGATAA ACCTGAGTGC
CCCCACGTCA GATACGTCAC TAACTCATGC GGCAGCCGTG CCTGCCCGTC CTGCGGAAAA
AAGGCCACAG ACCTGTGGAT AGCGACACAG CTGAATCGTC TTCCTGACTG CGACTGGGTA
CACCTGGTCT TCACCCTGCC GGACACGCTG TGGCCGGTGT TCGAAAGCAA CCGGTGGCTG
CTGAATGACG TGTGCCGTCT GGCGGTGGAG AATCTGCTGT ATGCCGCCCG GAAACGGGGG
CAGGAACCCG GTATCTTCTG CGCCATCCAC ACGTATGGCC GTCGTCTCAA CTGGCATCCG
CATGTACATG TGTCTGTAAC CTGTGGAGGT CTGAATAAGC ATGGTCAGTG GAAAAAGCTG
AGCTTCCTGA AAGACGCGAT GCGTTCACGG TGGATGTGGA ATATGCGGCA GCTGCTTCTG
AAAGCGTGGT CAGAGGGGCT GGCGATGCCG GAGTCGTTGT CACATATCAC GACGGAATCA
CAGTGGAGAA GCCTGGTGCT GAAAGCCGGA GGAAAATACT GGCATGTGTA CATGTCGAAA
AAAACGGCCG GCGGGAGGAA TACGGCGCGC TACCTGGGTC GTTATCTGAA GAAGCCGCCG
ATAGCGGCCT CCCGGCTGGC ACATTACAAC GTAGGGGCGA GCCTGAACTT CCGTTACCTG
GACCACAAAA CGGGAGAAAC GGCGACGGAA ACGCTGACAC AGCGTGAGCT GGTCGCGAGG
CTGAAACAGC ACATCCCGGA GAAGTTTTTT AAGATGGTGA GGTACTTCGG GTTCCTTGCC
AACCGGGTGT GTGGAGAGAA GCTGCCGCAG GTGTACCGTG CTCTGGGGAT GGATAAACAG
GAACCAGTGG CGAAAGTGTG CTATGCACAA ATGGTGAAAC AGTTCCTGAG TCGTGACCCG
TTCGAATGCG TGCTGTGTGG CGGCCGGATG GTATACCGCC GGGCCATCGC GGGACTGAAT
GTGGAGGGGC TGAAGAAAAA CGCGCGGGAT ATCAGTCTGC TGAGGTATAT GCCGGCCTGA
 
Protein sequence
MLSAFTPRPL KRLFTANQCW TSFLDAGGLR DIGVEAVTKM LACGTRILGV KEYICDKPEC 
PHVRYVTNSC GSRACPSCGK KATDLWIATQ LNRLPDCDWV HLVFTLPDTL WPVFESNRWL
LNDVCRLAVE NLLYAARKRG QEPGIFCAIH TYGRRLNWHP HVHVSVTCGG LNKHGQWKKL
SFLKDAMRSR WMWNMRQLLL KAWSEGLAMP ESLSHITTES QWRSLVLKAG GKYWHVYMSK
KTAGGRNTAR YLGRYLKKPP IAASRLAHYN VGASLNFRYL DHKTGETATE TLTQRELVAR
LKQHIPEKFF KMVRYFGFLA NRVCGEKLPQ VYRALGMDKQ EPVAKVCYAQ MVKQFLSRDP
FECVLCGGRM VYRRAIAGLN VEGLKKNARD ISLLRYMPA