Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0805 |
Symbol | |
ID | 6271421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 754016 |
End bp | 755263 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641724981 |
Product | nucleoside transporter, NupC family |
Protein accession | YP_001879508 |
Protein GI | 187731113 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGACGATTGC GTTTTTGCTG TCAGTAAACA AGAAGAAGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC GTGATTGGCG GGATTATGCT TTGGTTACCG GCAGGGCGTT GGATCGCTGA AAAAGTCGCT TTTGGCGTGC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCG CTGGTCGGGC CGAAAATGGA CACGCTGTTT GATGGCGCAG GATTTATCTT TGGTTTCAGG GTATTACCGG CAATTATCTT CGTCACTGCA CTGGTGAGTA TTCTCTACTA CATCGGTGTG ATGGGGATTT TAATTCGCAT TCTCGGCGGT ATATTCCAGA AAGCATTAAA TATCAGCAAG ATTGAGTCAT TCGTCGCGGT CACCACCATT TTCCTCGGGC AAAACGAAAT TCCGGCGATC GTCAAACCCT TTATCGATCA TCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGTGGC ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTACGCCG CCCTGGGCGT ACCTGTGGAA TATTTGCTGG CGGCATCGTT AATGGCGATC CCAGGCGGGA TCTTGTTTGC CCGCCTGTTA AGCCCGGCTA CGGAATCTTC GCAGGTTTCC TTTAATAACC TCTCTTTCAC CGAAACACCG CCAAAAAGCA TTATTGAAGC CGCTGCGACA GGGGCAATGA CCGGGCTGAA AATCGCCGCC GGTGTAGCGA CAGTTGTTAT GGCAGTCGCC ATCATTGCGT TAATTAATGG TATTATCGGC GGCGTTGGCG GCTGGTTTGG TTTTGCACAT GCCTCGCTGG AGTCCATTTT AGGTTACCTG TTGGCCCCAT TGGCGTGGGT GATGGGGGTT GACTGGAGTG ATGCAAATCT TGCCGGGAGT TTGATTGGGC AGAAGCTGGC GATCAATGAA TTTGTCGCTT ATCTCAATTT CTCGCCATAT CTGCAAACGG GTGGCACTCT GGATGCTAAA ACCGTGGCGA TTATTTCTTT CGCGTTGTGC GGTTTCGCTA ACTTTGGTTC TATCGGGGTG GTGGTGGGGG CGTTTTCTGC GGTTGCGCCA CACCGTGCGC CGGAAATCGC CCAACTTGGT TTACGCGCGC TGGCGGCGGC GACACTTTCT AACCTGATGA GTGCTACTAT TGCAGGATTC TTTATTGGTT TAGCGTAG
|
Protein sequence | MDVMRSVLGM VVLLTIAFLL SVNKKKISLR TVGAALVLQV VIGGIMLWLP AGRWIAEKVA FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDHLNR NELFTAICSG MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP PKSIIEAAAT GAMTGLKIAA GVATVVMAVA IIALINGIIG GVGGWFGFAH ASLESILGYL LAPLAWVMGV DWSDANLAGS LIGQKLAINE FVAYLNFSPY LQTGGTLDAK TVAIISFALC GFANFGSIGV VVGAFSAVAP HRAPEIAQLG LRALAAATLS NLMSATIAGF FIGLA
|
| |