Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2994 |
Symbol | proX |
ID | 6489041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 2932791 |
End bp | 2933786 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642743150 |
Product | glycine betaine transporter periplasmic subunit |
Protein accession | YP_002046774 |
Protein GI | 194449221 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.00724532 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACATA CCGTGATATT TGCCTCAGCG TTTGCCACCC TTGTCACCGC CAGCGCTTTC GCTGCCGACC TGCCTGGCAA AGGCATTACC GTCCAACCTA TCCAAAGCAC GATTTCTGAA GAGACCTTCC AGACGTTACT GGTCAGTCGG GCGCTGGAAA AGTTGGGTTA TACCGTAAAT AAGCCGAGTG AAGTCGATTA CAACGTAGGC TACACCTCCA TCGCCTCTGG CGATGCGACG TTTACCGCCG TGAACTGGCA GCCGCTGCAT GATGATATGT ATGCCGCAGC AGGCGGAGAC AAAAAATTTT ACCGCGAGGG CGTCTTCGTC TCCGGCGCTG CGCAGGGCTA TCTGATCGAT AAAAAAACCG CCGAGCAGTA CAACATCACC AATATCGCTC AGCTAAAAGA TCCGAAAATC GCCAAAATCT TCGATACCAA TGGCGACGGA AAAGCGGACA TGATGGGCTG CTCGCCGGGC TGGGGCTGCG AAGCCGTGAT CAATCACCAG AATAAAGCGT TTGATCTGCA AAAAACCGTT GAGGTCAGCC ACGGTAACTA TGCGGCGATG ATGGCTGATA CCATTACCCG TTTTAAAGAA GGCAAGCCGG TGCTGTATTA CACCTGGACC CCGTACTGGG TGAGCGACGT AATGAAGCCG GGTAAAGATG TCGTCTGGCT ACAGGTGCCG TTCTCCTCTC TGCCGGGCGA ACAGAAAAAT ATTGATACTA AACTGCCGAA CGGCGCGAAC TATGGGTTCC CGGTTAATAC CATGCATATT GTCGCCAATA AGGCATGGGC GGAGAAAAAC CCGGCGGCGG CGAAACTGTT CGCCATCATG AAGCTGCCGC TGGCGGATAT CAACGCGCAG AACGCCATGA TGCATGCCGG TAAATCGTCT GAAGCCGATG TTCAGGGCCA CGTGGACGGC TGGATCAACG CCCACCAGCA GCAGTTTGAC GGCTGGGTGA AAGAAGCGCT GGCCGCGCAG AAATAA
|
Protein sequence | MRHTVIFASA FATLVTASAF AADLPGKGIT VQPIQSTISE ETFQTLLVSR ALEKLGYTVN KPSEVDYNVG YTSIASGDAT FTAVNWQPLH DDMYAAAGGD KKFYREGVFV SGAAQGYLID KKTAEQYNIT NIAQLKDPKI AKIFDTNGDG KADMMGCSPG WGCEAVINHQ NKAFDLQKTV EVSHGNYAAM MADTITRFKE GKPVLYYTWT PYWVSDVMKP GKDVVWLQVP FSSLPGEQKN IDTKLPNGAN YGFPVNTMHI VANKAWAEKN PAAAKLFAIM KLPLADINAQ NAMMHAGKSS EADVQGHVDG WINAHQQQFD GWVKEALAAQ K
|
| |