Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3117 |
Symbol | proX |
ID | 6875441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3002941 |
End bp | 3003936 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642786141 |
Product | glycine betaine transporter periplasmic subunit |
Protein accession | YP_002216787 |
Protein GI | 198244108 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.293613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACATA CCGTGATATT TGCCTCAGCG TTTGCCACCC TTGTCACCGC CAGCGCTTTC GCTGCCGACC TGCCTGGCAA AGGCATTACC GTCCAACCTA TCCAAAGCAC GATTTCTGAA GAGACCTTCC AGACGTTACT GGTCAGTCGG GCGCTGGAAA AGTTGGGTTA TACCGTAAAT AAGCCGAGTG AAGTCGATTA CAACGTAGGC TATACCTCCA TCGCCTCTGG CGATGCGACG TTTACCGCCG TGAACTGGCA GCCGCTGCAT GATGATATGT ATGCCGCAGC AGGCGGAGAC AACAAGTTTT ACCGCGAGGG CGTCTTCGTC TCCGGCGCTG CGCAGGGCTA TCTGATCGAT AAAAAAACCG CCGAGCAGTA CAACATCACC AATATCGCTC AGCTAAAAGA TCCGAAAATC GCCAAAATCT TCGATACCAA TGGCGACGGA AAAGCGGACA TGATGGGCTG CTCGCCGGGC TGGGGCTGCG AAGCCGTGAT CAATCATCAG AATAAAGCGT TTGATCTGCA AAAAACCGTT GAGGTCAGCC ACGGTAACTA TGCGGCGATG ATGGCTGATA CCATTACCCT TTTTAAAGAA GGCAAGCCGG TGCTGTATTA CACCTGGACC CCGTACTGGG TGAGCGACGT AATGAAGCCG GGTAAAGATG TCGTCTGGCT ACAGGTGCCG TTCTCCTCCC TGCCGGGCGA ACAGAAAAAT ATTGATACTA AACTGCCGAA CGGCGCGAAC TATGGGTTCC CGGTCAATAC TATGCATATT GTCGCCAATA AGGCATGGGC GGAGAAAAAC CCGGCGGCGG CGAAACTGTT CGCCATCATG AAGCTGCCGC TGGCGGATAT CAACGCGCAG AACGCCATGA TGCATGCCGG TAAATCGTCT GAAGCCGATG TTCAGGGCCA CGTAGACGGC TGGATCAACG CCCACCAGCA GCAGTTTGAC GGCTGGGTGA AAGAAGCGCT GGCCGCGCAG AAATAA
|
Protein sequence | MRHTVIFASA FATLVTASAF AADLPGKGIT VQPIQSTISE ETFQTLLVSR ALEKLGYTVN KPSEVDYNVG YTSIASGDAT FTAVNWQPLH DDMYAAAGGD NKFYREGVFV SGAAQGYLID KKTAEQYNIT NIAQLKDPKI AKIFDTNGDG KADMMGCSPG WGCEAVINHQ NKAFDLQKTV EVSHGNYAAM MADTITLFKE GKPVLYYTWT PYWVSDVMKP GKDVVWLQVP FSSLPGEQKN IDTKLPNGAN YGFPVNTMHI VANKAWAEKN PAAAKLFAIM KLPLADINAQ NAMMHAGKSS EADVQGHVDG WINAHQQQFD GWVKEALAAQ K
|
| |