Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0801 |
Symbol | |
ID | 6272407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 751109 |
End bp | 752359 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641724977 |
Product | nucleoside transporter, NupC family |
Protein accession | YP_001879504 |
Protein GI | 187734164 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATAA TGAGAAGTGT TGTGGGGATG GTGGTGTTAC TGGCAATAGC ATTTCTGTTG TCAGTGAATA AAAAGAGCAT CAGTTTGCGC ACGGTTGGAG CCGCACTGCT GCTGCAAATC GCTATTGGTG GCATCATGCT CTACTTCCCA CCGGGAAAGT GGGCAGTAGA ACAGGCGGCA TTAGGCGTTC ATAAAGTGAT GTCTTACAGT GATGCCGGTA GCGCCTTCAT TTTTGGTTCG CTGGTTGGGC CGAAAATGGA TGTCCTGTTT GACGGTGCGG GTTTTATCTT CGCCTTTCGC GTACTTCCGG CGATTATTTT CGTTACTGCG CTCATCAGTC TGCTGTACTA CATTGGCGTG ATGGGGCTGC TGATTCGCAT CCTTGGCAGC ATTTTCCAGA AAGCTCTTAA CATCAGCAAA ATCGAATCTT TTGTCGCAGT CACAACTATT TTCCTCGGGC AAAATGAGAT CCCGGCGATC GTTAAACCGT TTATCGATCG CATGAATCGC AACGAGTTGT TTACCGCAAT TTGTAGCGGG ATGGCGTCCA TTGCTGGTTC GATGATGATT GGTTATGCCG GAATGGGCGT ACCAATTGAC TACCTGTTAG CGGCATCGCT GATGGCGATC CCAGGTGGTA TTTTGTTTGC ACGTATTCTT AGCCCGGCCA CCGAGCCTTC GCAGGTCACA TTTGAAAATC TGTCGTTCAG CGAAACGCCG CCAAAAAGCT TTATCGAAGC GGCGGCGAGC GGTGCGATGA CCGGGCTGAA AATCGCCGCT GGTGTAGCGA CGGTGGTAAT GGCGTTTGTC GCAATTATTG CGCTGATCAA CGGCATTATC GGCGGAATTG GTGGCTGGTT TGGTTTCGCC AATGCCTCTC TGGAAAGTAT TTTTGGCTAT GTGCTGGCAC CGCTGGCGTG GATCATGGGT GTGGACTGGA GTGATGCCAA TCTTGCGGGT AGCCTGATTG GGCAGAAACT GGCGATTAAC GAATTCGTCG CTTACCTGAG TTTCTCCCCA TACCTGCAAA CGGGCGGCAC GCTGGAAGTG AAAACCATTG CGATTATCTC CTTTGCGCTT TGTGGTTTTG CTAACTTTGG TTCTATCGGT GTTGTCGTTG GCGCATTTTC GGCTATTTCG CCAAAACGCG CGCCGGAAAT CGCCCAGCTT GGTTTACGGG CGCTGGCAGC AGCAACGCTT TCCAACCTGA TGAGTGCGAC TATTGCCGGG TTCTTTATTG GTTTAGCTTG A
|
Protein sequence | MDIMRSVVGM VVLLAIAFLL SVNKKSISLR TVGAALLLQI AIGGIMLYFP PGKWAVEQAA LGVHKVMSYS DAGSAFIFGS LVGPKMDVLF DGAGFIFAFR VLPAIIFVTA LISLLYYIGV MGLLIRILGS IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRMNR NELFTAICSG MASIAGSMMI GYAGMGVPID YLLAASLMAI PGGILFARIL SPATEPSQVT FENLSFSETP PKSFIEAAAS GAMTGLKIAA GVATVVMAFV AIIALINGII GGIGGWFGFA NASLESIFGY VLAPLAWIMG VDWSDANLAG SLIGQKLAIN EFVAYLSFSP YLQTGGTLEV KTIAIISFAL CGFANFGSIG VVVGAFSAIS PKRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA
|
| |