Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3011 |
Symbol | proX |
ID | 6484236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2934921 |
End bp | 2935916 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642738327 |
Product | glycine betaine transporter periplasmic subunit |
Protein accession | YP_002042056 |
Protein GI | 194443724 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.00437042 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACATA CTGTGATATT TGCCTCAGCG TTTGCCACCC TTGTCACCGC CAGCGCTTTC GCTGCCGACC TGCCTGGCAA AGGCATTACC GTTCAACCTA TCCAAAGCAC GATTTCTGAA GAGTCCTTCC AGACGTTACT GGTCAGCCGG GCGCTGGAAA AGTTGGGTTA TACCGTAAAT AAGCCGAGTG AAGTCGATTA CAACGTAGGC TACACCTCCA TCGCCTCTGG CGATGCGACG TTTACCGCCG TGAACTGGCA GCCGCTGCAT GATGATATGT ATGCCGCAGC AGGCGGAGAC AAAAAATTTT ACCGCGAGGG CGTCTTCGTC TCCGGCGCTG CGCAGGGCTA TCTGATCGAT AAAAAAACCA CCGAGCAGTA CAACATCACC AATATCGCTC AGCTAAAAGA TCCGAAAATC GCCAAAATCT TCGACACCAA TGGCGACGGA AAAGCGGACA TGATGGGCTG CTCGCCGGGC TGGGGCTGCG AAGCCGTGAT CAATCACCAG AATAAAGCGT TTGATCTGCA AAAAACCGTT GAGGTCAGCC ACGGTAACTA TGCGGCGATG ATGGCTGATA CCATTACCCG TTTTAAAGAA GGCAAGCCGG TGCTGTATTA CACCTGGACC CCGTACTGGG TGAGCGACGT AATGAAGCCG GGTAAAGATG TCGTCTGGCT ACAGGTGCCG TTCTCCTCCC TGCCGGGCGA ACAGAAAAAT ATTGATACTA AACTGCCGAA CGGCGCGAAC TATGGGTTCC CGGTTAATAC TATGCATATT GTCGCCAATA AGGCATGGGC GGAGAAAAAC CCGGCGGCGG CGAAACTGTT CGCCATCATG AAGCTGCCGC TGGCGGATAT CAACGCGCAG AACGCCATGA TGCATGCCGG TAAATCGTCT GAAGCCGATG TTCAGGGCCA CGTAGACGGC TGGATCAACG CCCACCAGCA GCAGTTTGAC GGCTGGGTGA AAGAAGCGCT GGCCGCGCAG AAATAA
|
Protein sequence | MRHTVIFASA FATLVTASAF AADLPGKGIT VQPIQSTISE ESFQTLLVSR ALEKLGYTVN KPSEVDYNVG YTSIASGDAT FTAVNWQPLH DDMYAAAGGD KKFYREGVFV SGAAQGYLID KKTTEQYNIT NIAQLKDPKI AKIFDTNGDG KADMMGCSPG WGCEAVINHQ NKAFDLQKTV EVSHGNYAAM MADTITRFKE GKPVLYYTWT PYWVSDVMKP GKDVVWLQVP FSSLPGEQKN IDTKLPNGAN YGFPVNTMHI VANKAWAEKN PAAAKLFAIM KLPLADINAQ NAMMHAGKSS EADVQGHVDG WINAHQQQFD GWVKEALAAQ K
|
| |