Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3200 |
Symbol | proX |
ID | 6272788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2985629 |
End bp | 2986621 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641727115 |
Product | glycine betaine transporter periplasmic subunit |
Protein accession | YP_001881569 |
Protein GI | 187733067 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACATA GCGTACTTTT TGCGACAGCG TTTGCCACGC TTATCTCTAC ACAAACTTTT GCTGCCGATC TGCCGGGCAA AGGTATTACT GTTAATCCAG TTCAAAGTAC CATCACCGAA GAAACCTTCC AGACGCTGCT GGTCAGCCGT GCGCTGGAGA AATTAGGCTA TACCGTCAAC AAACCCAGCG AAGTAGATTA CAACGTTGGC TACACCTCGC TTGCTTCCGG CGATGCAACC TTCACCGCCG TGAACTGGAC GCCACTGCAT GACAACATGT ACGAAGCTGC CGGTGGCGAT AAGAAATTTT ATCGTGAAGG GGTATTTGTT AACGGCGCGG CACAGGGTTA CCTGATCGAT AAGAAAACCG CCGACCAGTA CAAAATCACC AACATCGCAC AACTGAAAGA TCCGAAGATC GCCAAACTGT TCGATACCAA CGGCGACGGT AAAGCGGATT TAACCGGTTG TAACCCTGGC TGGGGCTGCG AAGGTGCGAT CAACCACCAG CTTGCCGCGT ATGAACTGAC CAACACCGTG ACGCATAATC AGGGGAACTA CGCAGCGATG ATGGCCGACA CCATCAGTCG CTACAAAGAG GGCAAACCGG TGTTTTATTA CACCTGGACG CCGTACTGGG TGAGTAATGA GCTGAAGCCA GGGAAAGATG TGGTCTGGTT GCAGGTGCCG TTCTCCGCAC TGCCGGGCGA TAAAAATGCC GATACCAAAC TGCCGAATGG CGCGAATTAC GGCTTCCCGG TCAGCACCAT GCATATCGTT GCCAACAAAG CCTGGACCGA GAAAAATCCG GCGGCAGCCA AACTGTTTGC CATTATGCAG TTGCCAGTGG CCGATATTAA CGCCCAGAAC GCCATTATGC ATGACGGCAA AGCCTCAGAA GGCGATATTC AGGGCCACGT TGATGGCTGG ATCAAAGCCC ACCAGCAGCA GTTCGATGGC TGGGTGAATG AGGCGCTGGC AGCGCAGAAG TAA
|
Protein sequence | MRHSVLFATA FATLISTQTF AADLPGKGIT VNPVQSTITE ETFQTLLVSR ALEKLGYTVN KPSEVDYNVG YTSLASGDAT FTAVNWTPLH DNMYEAAGGD KKFYREGVFV NGAAQGYLID KKTADQYKIT NIAQLKDPKI AKLFDTNGDG KADLTGCNPG WGCEGAINHQ LAAYELTNTV THNQGNYAAM MADTISRYKE GKPVFYYTWT PYWVSNELKP GKDVVWLQVP FSALPGDKNA DTKLPNGANY GFPVSTMHIV ANKAWTEKNP AAAKLFAIMQ LPVADINAQN AIMHDGKASE GDIQGHVDGW IKAHQQQFDG WVNEALAAQK
|
| |