Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3010 |
Symbol | |
ID | 6482367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2933787 |
End bp | 2934851 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642738326 |
Product | glycine betaine transporter membrane protein |
Protein accession | YP_002042055 |
Protein GI | 194444232 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4176] ABC-type proline/glycine betaine transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.00677598 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGATC AAACGAATCC GTGGGATACC GCACAGGTGG CCGATACTAC GACGCAAACG GCTGATGCCT GGGGAACACC GGCAGGCGTA GCCACGGACG GCGGCAGTAC CGACTGGTTG AACAGCGCGC CCGCGCCAGC CCCTGAACAC TTTTCTCTTC TGGACCCGTT CCATAAGACG CTTATCCCGC TGGATAGCTG GGTCACAGAG GGAATCGACT GGGTCGTCAC CCATTTCCGT CCCCTTTTTC AGGGGATTCG TGTGCCGGTG GATTACATCC TTAACGGCTT TCAGCAACTG CTGCTGGGAA TGCCCGCCCC TGTGGCGATT ATTCTCTTTG CGCTGATTGC CTGGCAGGTT TCCGGTGTGG GCATGGGGAT CGCGGCGCTG ATATCGCTGA TCGCCATCGG CGCGATCGGC GCCTGGTCGC AGGCGATGAT TACCCTGGCG CTGGTGCTGA CCGCCCTGTT GTTCTGCGTC GTGATCGGAT TACCGATGGG AATCTGGCTG GCGCGCAGCC CGCGCGCGGC CAAAATAGTT CGTCCGCTGC TGGATGCGAT GCAGACCACG CCCGCGTTTG TCTATCTGGT GCCGATTGTC ATGTTATTCG GCATCGGTAA CGTGCCGGGC GTGGTGGTGA CGATTATTTT TGCTCTACCG CCGATTATAC GCCTGACGAT CCTGGGCATT AACCAGGTGC CTGCCGACTT AATTGAAGCG TCGCGCTCGT TCGGCGCCAG CCCGCGCCAA ATGTTGTTCA AAGTGCAACT ACCGCTGGCG ATGCCCACCA TTATGGCAGG CGTTAATCAG ACGCTGATGC TGGCTCTCTC AATGGTCGTC ATCGCCTCGA TGATTGCGGT CGGTGGGCTT GGCCAGATGG TACTACGCGG CATTGGTCGT CTTGATATGG GGCTGGCAAC CGTCGGCGGC GTCGGCATTG TGATTCTCGC CATCATTCTG GACCGTCTGA CGCAGGCCGT CGGGCGCGAT TCGCGTAGCC GCGGTAACCG TCGCTGGTAT ACCACCGGTC CTGTTGGGCT AATCACCCGC CCTTTCGTTA AGTAA
|
Protein sequence | MADQTNPWDT AQVADTTTQT ADAWGTPAGV ATDGGSTDWL NSAPAPAPEH FSLLDPFHKT LIPLDSWVTE GIDWVVTHFR PLFQGIRVPV DYILNGFQQL LLGMPAPVAI ILFALIAWQV SGVGMGIAAL ISLIAIGAIG AWSQAMITLA LVLTALLFCV VIGLPMGIWL ARSPRAAKIV RPLLDAMQTT PAFVYLVPIV MLFGIGNVPG VVVTIIFALP PIIRLTILGI NQVPADLIEA SRSFGASPRQ MLFKVQLPLA MPTIMAGVNQ TLMLALSMVV IASMIAVGGL GQMVLRGIGR LDMGLATVGG VGIVILAIIL DRLTQAVGRD SRSRGNRRWY TTGPVGLITR PFVK
|
| |