Gene SbBS512_E4108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4108 
Symbol 
ID6272551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3834740 
End bp3836179 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content49% 
IMG OID641727938 
Productputative transporter 
Protein accessionYP_001882370 
Protein GI187732687 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTACA AAATGCGTTA TCGGTCTCAG CACCCCTATA GCATCAAGGA AAAGCAGATG 
AAGAGTGAAG TGTTGTCCGT TAAAGAGAAA ATTGGTTATG GCATGGGAGA CGCCGCCAGC
CACATTATTT TCGATAACGT AATGTTATAT ATGATGTTCT TTTATACCGA TATTTTTGGC
ATTCCTGCCG GTTTTGTCGG AACCATGTTT TTGGTCGCTC GTGCACTGGA TGCGATTTCC
GATCCTTGCA TGGGGTTGTT GGCCGATCGA ACGCGCTCTC GCTGGGGTAA ATTTCGTCCG
TGGGTACTGT TTGGCGCACT GCCATTCGGG ATCGTCTGTG TACTGGCCTA TAGCACGCCA
GATCTCAGTA TGAACGGCAA AATGATCTAT GCAGCAATTA CTTACACCCT ACTTACCTTA
CTTTATACCG TCGTCAATAT CCCTTACTGC GCATTGGGTG GTGTAATCAC CAATGACCCG
ACTCAGCGTA TCTCGCTGCA ATCCTGGCGT TTTGTGCTGG CGACCGCGGG AGGCATGCTT
TCTACTGTTC TGATGATGCC ACTGGTTAAT TTAATTGGCG GTGATAATAA ACCACTCGGT
TTCCAGGGCG GTATCGCGGT CCTTTCCGTG GTGGCATTCA TGATGCTGGC ATTTTGTTTC
TTCACCACTA AAGAACGCGT TGAAGCACCA CCTACAACAA CGTCTATGCG GGAAGATTTA
CGTGATATCT GGCAAAACGA CCAGTGGCGG ATTGTCGGTT TACTAACCAT TTTCAATATC
CTGGCGGTGT GCGTACTCGG TGGGGCGATG ATGTATTACG TCACATGGAT TTTGGGCACG
CCGGAAGTGT TTGTCGCTTT TCTCACCACT TATTGCGTGG GTAACCTGAT TGGTTCCGCA
CTGGCAAAAC CTCTGACCGA CTGGAAATGT AAAGTCACTA TCTTCTGGTG GACGAACGCC
CTGCTGGCAG TGATTAGCCT CGCGATGTTC TTTGTTCCCA TGCAGGCCAG CATCACTATG
TTTGTCTTCA TCTTCGTGAT TGGTGTGTTG CATCAACTGG TGACACCTAT CCAGTGGGTA
ATGATGTCCG ATACCGTCGA CTACGGCGAG TGGTGCAATG GTAAACGCCT GACCGGGATC
AGTTTTGCTG GCACGCTGTT TGTGCTCAAA CTGGGGTTGG CCTTCGGCGG CGCTCTTATC
GGCTGGATGC TGGCTTATGG CGGATATGAT GCGGCAGAAA AAGCGCAGAA CAGCGCCACG
ATTAGCATCA TTATTGCGCT ATTCACGATT GTTCCGGCGA TCTGTTATTT GCTGAGCGCG
ATTATCGCTA AACGCTACTA CTCACTCACG ACGCACAATC TGAAAACCGT TATGGAACAG
CTGGCTCAGG GTAAACGCCG TTGCCAGCAA CAATTCACCT CTCAAGAAGT GCAGAACTAA
 
Protein sequence
MVYKMRYRSQ HPYSIKEKQM KSEVLSVKEK IGYGMGDAAS HIIFDNVMLY MMFFYTDIFG 
IPAGFVGTMF LVARALDAIS DPCMGLLADR TRSRWGKFRP WVLFGALPFG IVCVLAYSTP
DLSMNGKMIY AAITYTLLTL LYTVVNIPYC ALGGVITNDP TQRISLQSWR FVLATAGGML
STVLMMPLVN LIGGDNKPLG FQGGIAVLSV VAFMMLAFCF FTTKERVEAP PTTTSMREDL
RDIWQNDQWR IVGLLTIFNI LAVCVLGGAM MYYVTWILGT PEVFVAFLTT YCVGNLIGSA
LAKPLTDWKC KVTIFWWTNA LLAVISLAMF FVPMQASITM FVFIFVIGVL HQLVTPIQWV
MMSDTVDYGE WCNGKRLTGI SFAGTLFVLK LGLAFGGALI GWMLAYGGYD AAEKAQNSAT
ISIIIALFTI VPAICYLLSA IIAKRYYSLT THNLKTVMEQ LAQGKRRCQQ QFTSQEVQN