Gene SbBS512_E0805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0805 
Symbol 
ID6271421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp754016 
End bp755263 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content51% 
IMG OID641724981 
Productnucleoside transporter, NupC family 
Protein accessionYP_001879508 
Protein GI187731113 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGACGATTGC GTTTTTGCTG 
TCAGTAAACA AGAAGAAGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC
GTGATTGGCG GGATTATGCT TTGGTTACCG GCAGGGCGTT GGATCGCTGA AAAAGTCGCT
TTTGGCGTGC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCG
CTGGTCGGGC CGAAAATGGA CACGCTGTTT GATGGCGCAG GATTTATCTT TGGTTTCAGG
GTATTACCGG CAATTATCTT CGTCACTGCA CTGGTGAGTA TTCTCTACTA CATCGGTGTG
ATGGGGATTT TAATTCGCAT TCTCGGCGGT ATATTCCAGA AAGCATTAAA TATCAGCAAG
ATTGAGTCAT TCGTCGCGGT CACCACCATT TTCCTCGGGC AAAACGAAAT TCCGGCGATC
GTCAAACCCT TTATCGATCA TCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGTGGC
ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTACGCCG CCCTGGGCGT ACCTGTGGAA
TATTTGCTGG CGGCATCGTT AATGGCGATC CCAGGCGGGA TCTTGTTTGC CCGCCTGTTA
AGCCCGGCTA CGGAATCTTC GCAGGTTTCC TTTAATAACC TCTCTTTCAC CGAAACACCG
CCAAAAAGCA TTATTGAAGC CGCTGCGACA GGGGCAATGA CCGGGCTGAA AATCGCCGCC
GGTGTAGCGA CAGTTGTTAT GGCAGTCGCC ATCATTGCGT TAATTAATGG TATTATCGGC
GGCGTTGGCG GCTGGTTTGG TTTTGCACAT GCCTCGCTGG AGTCCATTTT AGGTTACCTG
TTGGCCCCAT TGGCGTGGGT GATGGGGGTT GACTGGAGTG ATGCAAATCT TGCCGGGAGT
TTGATTGGGC AGAAGCTGGC GATCAATGAA TTTGTCGCTT ATCTCAATTT CTCGCCATAT
CTGCAAACGG GTGGCACTCT GGATGCTAAA ACCGTGGCGA TTATTTCTTT CGCGTTGTGC
GGTTTCGCTA ACTTTGGTTC TATCGGGGTG GTGGTGGGGG CGTTTTCTGC GGTTGCGCCA
CACCGTGCGC CGGAAATCGC CCAACTTGGT TTACGCGCGC TGGCGGCGGC GACACTTTCT
AACCTGATGA GTGCTACTAT TGCAGGATTC TTTATTGGTT TAGCGTAG
 
Protein sequence
MDVMRSVLGM VVLLTIAFLL SVNKKKISLR TVGAALVLQV VIGGIMLWLP AGRWIAEKVA 
FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV
MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDHLNR NELFTAICSG
MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP
PKSIIEAAAT GAMTGLKIAA GVATVVMAVA IIALINGIIG GVGGWFGFAH ASLESILGYL
LAPLAWVMGV DWSDANLAGS LIGQKLAINE FVAYLNFSPY LQTGGTLDAK TVAIISFALC
GFANFGSIGV VVGAFSAVAP HRAPEIAQLG LRALAAATLS NLMSATIAGF FIGLA