Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2801 |
Symbol | proX |
ID | 6144425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2882619 |
End bp | 2883611 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617670 |
Product | glycine betaine transporter periplasmic subunit |
Protein accession | YP_001744830 |
Protein GI | 170681945 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.56693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0222237 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACATA GCGTACTTTT TGCGACAGCG TTTGCCACGC TTATCTCTAC ACAAACTTTT GCTGCCGATC TGCCGGGCAA AGGCATTACT GTTAATCCAG TTCAGAGTAC CATCACTGAA GAAACCTTCC AGACGCTGCT GGTCAGTCGT GCACTGGAGA AATTAGGTTA TACCGTCAAT AAACCCAGCG AAGTGGATTA CAACGTTGGC TACACCTCAC TTGCTTCCGG CGATGCAACC TTCACCGCCG TGAACTGGAC GCCGCTGCAT GACAACATGT ATGAAGCTGC CGGTGGCGAT AAGAAATTTT ATCGTGAAGG AGTATTTGTT AACGGCGCGG CACAGGGTTA CCTGATCGAT AAGAAAACCG CCGACCAGTA TAAAATTACC AACATCGCGC AACTTAAAGA TCCGAAGATC GCCAAACTGT TCGATACCAA CGGCGACGGT AAAGCGGATT TAACCGGCTG TAACCCAGGC TGGGGCTGCG AAGGTGCGAT CAACCACCAG CTTGCCGCGT ATGGACTGAC CAATACCGTG ACGCATAATC AGGGGAACTA CGCAGCGATG ATGGCTGACA CCATCAGTCG TTACAAAGAG GGTAAGCCGG TGTTTTACTA CACCTGGACG CCGTACTGGG TGAGTAACGA ACTGAAACCG GGGAAAGATG TGGTCTGGTT GCAGGTGCCG TTCTCCGCAC TGCCGGGCGA TAAAAATGCC GATACCAAAC TGCCGAATGG CGCGAATTAC GGCTTCCCGG TCAGTACCAT GCATATCGTT GCCAACAAAG CCTGGGCTGA GAAAAACCCG GCAGCAGCGA AACTGTTTGC CATTATGCAG TTACCAGTGG CAGATATTAA CGCCCAGAAC GCCATTATGC ATGACGGCAA AGCCTCAGAA GGCGATATTC AGGGCCACGT TGATGGTTGG ATCAAAGCCC ACCAGCAGCA GTTCGATGGC TGGGTGAATG AAGCGCTGGC AGCGCAGAAG TAA
|
Protein sequence | MRHSVLFATA FATLISTQTF AADLPGKGIT VNPVQSTITE ETFQTLLVSR ALEKLGYTVN KPSEVDYNVG YTSLASGDAT FTAVNWTPLH DNMYEAAGGD KKFYREGVFV NGAAQGYLID KKTADQYKIT NIAQLKDPKI AKLFDTNGDG KADLTGCNPG WGCEGAINHQ LAAYGLTNTV THNQGNYAAM MADTISRYKE GKPVFYYTWT PYWVSNELKP GKDVVWLQVP FSALPGDKNA DTKLPNGANY GFPVSTMHIV ANKAWAEKNP AAAKLFAIMQ LPVADINAQN AIMHDGKASE GDIQGHVDGW IKAHQQQFDG WVNEALAAQK
|
| |