Gene SbBS512_E0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0801 
Symbol 
ID6272407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp751109 
End bp752359 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID641724977 
Productnucleoside transporter, NupC family 
Protein accessionYP_001879504 
Protein GI187734164 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATAA TGAGAAGTGT TGTGGGGATG GTGGTGTTAC TGGCAATAGC ATTTCTGTTG 
TCAGTGAATA AAAAGAGCAT CAGTTTGCGC ACGGTTGGAG CCGCACTGCT GCTGCAAATC
GCTATTGGTG GCATCATGCT CTACTTCCCA CCGGGAAAGT GGGCAGTAGA ACAGGCGGCA
TTAGGCGTTC ATAAAGTGAT GTCTTACAGT GATGCCGGTA GCGCCTTCAT TTTTGGTTCG
CTGGTTGGGC CGAAAATGGA TGTCCTGTTT GACGGTGCGG GTTTTATCTT CGCCTTTCGC
GTACTTCCGG CGATTATTTT CGTTACTGCG CTCATCAGTC TGCTGTACTA CATTGGCGTG
ATGGGGCTGC TGATTCGCAT CCTTGGCAGC ATTTTCCAGA AAGCTCTTAA CATCAGCAAA
ATCGAATCTT TTGTCGCAGT CACAACTATT TTCCTCGGGC AAAATGAGAT CCCGGCGATC
GTTAAACCGT TTATCGATCG CATGAATCGC AACGAGTTGT TTACCGCAAT TTGTAGCGGG
ATGGCGTCCA TTGCTGGTTC GATGATGATT GGTTATGCCG GAATGGGCGT ACCAATTGAC
TACCTGTTAG CGGCATCGCT GATGGCGATC CCAGGTGGTA TTTTGTTTGC ACGTATTCTT
AGCCCGGCCA CCGAGCCTTC GCAGGTCACA TTTGAAAATC TGTCGTTCAG CGAAACGCCG
CCAAAAAGCT TTATCGAAGC GGCGGCGAGC GGTGCGATGA CCGGGCTGAA AATCGCCGCT
GGTGTAGCGA CGGTGGTAAT GGCGTTTGTC GCAATTATTG CGCTGATCAA CGGCATTATC
GGCGGAATTG GTGGCTGGTT TGGTTTCGCC AATGCCTCTC TGGAAAGTAT TTTTGGCTAT
GTGCTGGCAC CGCTGGCGTG GATCATGGGT GTGGACTGGA GTGATGCCAA TCTTGCGGGT
AGCCTGATTG GGCAGAAACT GGCGATTAAC GAATTCGTCG CTTACCTGAG TTTCTCCCCA
TACCTGCAAA CGGGCGGCAC GCTGGAAGTG AAAACCATTG CGATTATCTC CTTTGCGCTT
TGTGGTTTTG CTAACTTTGG TTCTATCGGT GTTGTCGTTG GCGCATTTTC GGCTATTTCG
CCAAAACGCG CGCCGGAAAT CGCCCAGCTT GGTTTACGGG CGCTGGCAGC AGCAACGCTT
TCCAACCTGA TGAGTGCGAC TATTGCCGGG TTCTTTATTG GTTTAGCTTG A
 
Protein sequence
MDIMRSVVGM VVLLAIAFLL SVNKKSISLR TVGAALLLQI AIGGIMLYFP PGKWAVEQAA 
LGVHKVMSYS DAGSAFIFGS LVGPKMDVLF DGAGFIFAFR VLPAIIFVTA LISLLYYIGV
MGLLIRILGS IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRMNR NELFTAICSG
MASIAGSMMI GYAGMGVPID YLLAASLMAI PGGILFARIL SPATEPSQVT FENLSFSETP
PKSFIEAAAS GAMTGLKIAA GVATVVMAFV AIIALINGII GGIGGWFGFA NASLESIFGY
VLAPLAWIMG VDWSDANLAG SLIGQKLAIN EFVAYLSFSP YLQTGGTLEV KTIAIISFAL
CGFANFGSIG VVVGAFSAIS PKRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA