Gene SNSL254_A1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1220 
SymbolputP 
ID6482844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1214319 
End bp1215827 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content54% 
IMG OID642736621 
Productsodium/proline symporter 
Protein accessionYP_002040379 
Protein GI194443976 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.405046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTA GCACACCGAT GTTGGTGACA TTCTGTGTCT ATATTTTTGG CATGATATTG 
ATTGGGTTTA TCGCCTGGCG CTCAACCAAA AACTTTGATG ACTATATTCT TGGCGGTCGC
AGCCTGGGGC CGTTTGTTAC GGCTTTATCA GCCGGCGCGT CGGATATGAG CGGCTGGCTG
TTAATGGGGC TGCCTGGCGC TATCTTTCTG TCGGGGATCT CTGAAAGCTG GATCGCCATT
GGCCTGACGT TAGGCGCATG GATTAACTGG AAGCTGGTGG CCGGGCGCCT GCGCGTGCAT
ACCGAATTTA ACAATAACGC GCTCACGCTA CCGGACTATT TTACCGGTCG GTTTGAAGAT
AAGAGCCGAG TCCTGCGTAT TATTTCCGCG CTGGTCATTC TGCTGTTTTT CACTATCTAT
TGCGCATCAG GTATTGTCGC TGGGGCACGA CTGTTCGAAA GCACCTTCGG TATGAGCTAT
GAAACCGCAC TGTGGGCGGG GGCCGCGGCA ACCATTATTT ATACCTTTAT CGGCGGGTTT
CTTGCCGTTA GCTGGACGGA TACCGTTCAG GCCAGCCTGA TGATTTTTGC GTTAATCCTG
ACGCCGGTGA TGGTTATTGT CGGCGTAGGC GGTTTTAGCG AGTCGCTGGA GGTGATCAAG
CAAAAGAGCA TCGAGAATGT CGACATGCTC AAGGGGCTGA ATTTTGTCGC TATTATTTCT
CTGATGGGCT GGGGACTGGG TTACTTCGGT CAGCCGCATA TCCTGGCGCG CTTTATGGCG
GCGGATTCCC ATCACAGTAT TGTTCATGCG CGTCGTATCA GTATGACCTG GATGATTCTG
TGTCTGGCGG GCGCGGTGGC GGTGGGCTTC TTTGGCATTG CGTACTTTAA CAATAACCCC
GCGCTGGCCG GGGCGGTGAA CCAAAACTCA GAACGCGTAT TTATTGAACT GGCGCAGATC
CTGTTTAACC CGTGGATTGC CGGTGTTCTG CTGTCTGCTA TCCTGGCGGC GGTGATGTCG
ACGTTGAGCT GTCAGTTGCT GGTATGCTCC AGCGCGATTA CAGAAGATTT ATATAAGGCT
TTTCTGCGTA AAAGCGCCAG CCAGCAAGAG CTGGTATGGG TAGGGCGAGT GATGGTGCTG
GTGGTAGCGC TGATCGCCAT TGCGCTGGCG GCGAATCCCG ATAACCGTGT GCTGGGGCTG
GTGAGCTACG CCTGGGCTGG ATTCGGCGCG GCATTTGGAC CTGTTGTCCT GTTTTCTGTG
ATGTGGTCGC GTATGACACG TAACGGCGCG CTGGCGGGAA TGATTATTGG CGCGGTGACG
GTTATCGTCT GGAAACAATA TGGCTGGCTG GATCTGTATG AGATCATCCC TGGCTTCATT
TTCGGCAGCC TGGGGATCGT AATCTTTAGC CTGCTTGGCA AAGCGCCGAC AGCAGCGATG
CAGGAACGCT TTGCAAAAGC GGACGCGCAT TATCATTCCG CGCCGCCGTC GAAGCTACAG
GCGGAATAA
 
Protein sequence
MAISTPMLVT FCVYIFGMIL IGFIAWRSTK NFDDYILGGR SLGPFVTALS AGASDMSGWL 
LMGLPGAIFL SGISESWIAI GLTLGAWINW KLVAGRLRVH TEFNNNALTL PDYFTGRFED
KSRVLRIISA LVILLFFTIY CASGIVAGAR LFESTFGMSY ETALWAGAAA TIIYTFIGGF
LAVSWTDTVQ ASLMIFALIL TPVMVIVGVG GFSESLEVIK QKSIENVDML KGLNFVAIIS
LMGWGLGYFG QPHILARFMA ADSHHSIVHA RRISMTWMIL CLAGAVAVGF FGIAYFNNNP
ALAGAVNQNS ERVFIELAQI LFNPWIAGVL LSAILAAVMS TLSCQLLVCS SAITEDLYKA
FLRKSASQQE LVWVGRVMVL VVALIAIALA ANPDNRVLGL VSYAWAGFGA AFGPVVLFSV
MWSRMTRNGA LAGMIIGAVT VIVWKQYGWL DLYEIIPGFI FGSLGIVIFS LLGKAPTAAM
QERFAKADAH YHSAPPSKLQ AE