Gene SNSL254_A3010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3010 
Symbol 
ID6482367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2933787 
End bp2934851 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content59% 
IMG OID642738326 
Productglycine betaine transporter membrane protein 
Protein accessionYP_002042055 
Protein GI194444232 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4176] ABC-type proline/glycine betaine transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.00677598 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGATC AAACGAATCC GTGGGATACC GCACAGGTGG CCGATACTAC GACGCAAACG 
GCTGATGCCT GGGGAACACC GGCAGGCGTA GCCACGGACG GCGGCAGTAC CGACTGGTTG
AACAGCGCGC CCGCGCCAGC CCCTGAACAC TTTTCTCTTC TGGACCCGTT CCATAAGACG
CTTATCCCGC TGGATAGCTG GGTCACAGAG GGAATCGACT GGGTCGTCAC CCATTTCCGT
CCCCTTTTTC AGGGGATTCG TGTGCCGGTG GATTACATCC TTAACGGCTT TCAGCAACTG
CTGCTGGGAA TGCCCGCCCC TGTGGCGATT ATTCTCTTTG CGCTGATTGC CTGGCAGGTT
TCCGGTGTGG GCATGGGGAT CGCGGCGCTG ATATCGCTGA TCGCCATCGG CGCGATCGGC
GCCTGGTCGC AGGCGATGAT TACCCTGGCG CTGGTGCTGA CCGCCCTGTT GTTCTGCGTC
GTGATCGGAT TACCGATGGG AATCTGGCTG GCGCGCAGCC CGCGCGCGGC CAAAATAGTT
CGTCCGCTGC TGGATGCGAT GCAGACCACG CCCGCGTTTG TCTATCTGGT GCCGATTGTC
ATGTTATTCG GCATCGGTAA CGTGCCGGGC GTGGTGGTGA CGATTATTTT TGCTCTACCG
CCGATTATAC GCCTGACGAT CCTGGGCATT AACCAGGTGC CTGCCGACTT AATTGAAGCG
TCGCGCTCGT TCGGCGCCAG CCCGCGCCAA ATGTTGTTCA AAGTGCAACT ACCGCTGGCG
ATGCCCACCA TTATGGCAGG CGTTAATCAG ACGCTGATGC TGGCTCTCTC AATGGTCGTC
ATCGCCTCGA TGATTGCGGT CGGTGGGCTT GGCCAGATGG TACTACGCGG CATTGGTCGT
CTTGATATGG GGCTGGCAAC CGTCGGCGGC GTCGGCATTG TGATTCTCGC CATCATTCTG
GACCGTCTGA CGCAGGCCGT CGGGCGCGAT TCGCGTAGCC GCGGTAACCG TCGCTGGTAT
ACCACCGGTC CTGTTGGGCT AATCACCCGC CCTTTCGTTA AGTAA
 
Protein sequence
MADQTNPWDT AQVADTTTQT ADAWGTPAGV ATDGGSTDWL NSAPAPAPEH FSLLDPFHKT 
LIPLDSWVTE GIDWVVTHFR PLFQGIRVPV DYILNGFQQL LLGMPAPVAI ILFALIAWQV
SGVGMGIAAL ISLIAIGAIG AWSQAMITLA LVLTALLFCV VIGLPMGIWL ARSPRAAKIV
RPLLDAMQTT PAFVYLVPIV MLFGIGNVPG VVVTIIFALP PIIRLTILGI NQVPADLIEA
SRSFGASPRQ MLFKVQLPLA MPTIMAGVNQ TLMLALSMVV IASMIAVGGL GQMVLRGIGR
LDMGLATVGG VGIVILAIIL DRLTQAVGRD SRSRGNRRWY TTGPVGLITR PFVK