Gene SbBS512_E1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1059 
Symbol 
ID6272936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp970755 
End bp971960 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content55% 
IMG OID641725198 
Productputative inner membrane protein 
Protein accessionYP_001879717 
Protein GI187731594 
COG category[R] General function prediction only 
COG ID[COG2391] Predicted transporter component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000178842 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATGGC AGCAATTCAA ACACGCCTGG TTGATTAAAT TCTGGGCACC CATCCCTGCG 
GTCATCGCGG CGGGTATTCT CTCCACTTAC TATTTTGGCA TTACTGGCAC CTTTTGGGCT
GTCACGGGTG AATTTACCCG TTGGGGCGGC CAGCTCCTGC AGCTGTTTGG CGTCCATGCT
GAAGAGTGGG GTTATTTTAA AATTATCCAT CTGGAAGGAT CGCCATTAAC CCGCATCGAC
GGGATGATGA TCCTCGGCAT GTTTGGCGGC TGCTTTGCCG CAGCGCTGTG GGCCAACAAT
GTCAAACTGC GCATGCCACG CAGCCGTATC CGCATTATGC AGGCCATTAT TGGCGGTATT
ATCGCTGGTT TTGGCGCGCG TCTGGCAATG GGCTGTAACC TGGCGGCGTT CTTTACCGGT
ATTCCTCAGT TCTCGCTGCA TGCCTGGTTC TTTGCCATCG CCACTGCCAT TGGTTCATGG
TTTGGCGCAC GCTTTACCCT GCTGCCCATC TTCCGTATTC CCGTGAAAAT GCAGAAAGTT
TCTGCCGCCT CACCGCTGAC GCAAAAACCG GATCAGGCGC GGCGTCGTTT TCGTCTCGGG
ATGCTGGTCT TTTTCGGTAT GCTGGGCTGG GCGCTGCTCA CGGCGATGAA CCAACCAAAA
CTGGGGCTGG CAATGCTGTT TGGCGTCGGC TTTGGTTTAC TGATTGAACG TGCGCAAATC
TGCTTTACTT CGGCGTTCCG CGATATGTGG ATCACCGGAC GTACCCATAT GGCGAAAGCA
ATCATTATCG GTATGGCGGT GAGTGCCATC GGGATCTTCA GTTACGTACA GTTAGGCGTT
GAACCCAAAA TCATGTGGGC GGGACCAAAC GCGGTAATTG GTGGTTTACT GTTTGGTTTT
GGCATCGTGC TGGCAGGCGG CTGCGAAACC GGCTGGATGT ACCGCGCGGT AGAAGGCCAG
GTGCACTACT GGTGGGTCGG TCTGGGCAAT GTGATCGGCT CGACGATTCT GGCGTATTAC
TGGGATGATT TCGCTCCGGG GCTGGCCACC GACTGGGACA AAATCAACCT GCTGAAAACC
TTTGGCCCGA TGGGTGGCCT GCTGGTGACA TATTTGCTGT TGTTTACTGC GCTGATGTTG
ATTATCGGCT GGGAAAAACG CTTCTTCCGC CGTGCGGCAC CGCAGACTGC TAAGGAGATC
GCATGA
 
Protein sequence
MSWQQFKHAW LIKFWAPIPA VIAAGILSTY YFGITGTFWA VTGEFTRWGG QLLQLFGVHA 
EEWGYFKIIH LEGSPLTRID GMMILGMFGG CFAAALWANN VKLRMPRSRI RIMQAIIGGI
IAGFGARLAM GCNLAAFFTG IPQFSLHAWF FAIATAIGSW FGARFTLLPI FRIPVKMQKV
SAASPLTQKP DQARRRFRLG MLVFFGMLGW ALLTAMNQPK LGLAMLFGVG FGLLIERAQI
CFTSAFRDMW ITGRTHMAKA IIIGMAVSAI GIFSYVQLGV EPKIMWAGPN AVIGGLLFGF
GIVLAGGCET GWMYRAVEGQ VHYWWVGLGN VIGSTILAYY WDDFAPGLAT DWDKINLLKT
FGPMGGLLVT YLLLFTALML IIGWEKRFFR RAAPQTAKEI A