Gene SeD_A1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1200 
SymbolputP 
ID6874229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1190480 
End bp1191988 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content54% 
IMG OID642784382 
Productsodium/proline symporter 
Protein accessionYP_002215055 
Protein GI198243177 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.133838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTA GCACACCGAT GTTGGTGACA TTCTGTGTCT ATATTTTTGG CATGATATTG 
ATTGGGTTTA TCGCCTGGCG CTCAACCAAA AACTTTGATG ACTATATTCT TGGCGGTCGC
AGCCTGGGGC CGTTTGTTAC GGCTTTATCA GCCGGCGCGT CGGATATGAG CGGCTGGCTG
TTAATGGGGC TGCCTGGCGC TATCTTTCTG TCGGGGATCT CTGAAAGCTG GATCGCCATT
GGCCTGACGT TAGGCGCATG GATTAACTGG AAGCTGGTGG CCGGGCGCCT GCGCGTGCAT
ACCGAATTTA ACAATAACGC GCTCACGCTA CCGGACTATT TTACCGGTCG GTTTGAAGAT
AAGAGCCGAG TCCTGCGTAT TATTTCCGCG CTGGTCATTC TGCTGTTTTT CACTATCTAT
TGCGCATCAG GTATTGTCGC TGGGGCACGA CTGTTCGAAA GCACCTTCGG TATGAGCTAT
GAAACCGCAC TGTGGGCGGG GGCCGCGGCA ACCATTATTT ATACCTTTAT CGGCGGGTTT
CTTGCCGTTA GCTGGACGGA TACCGTTCAG GCCAGCCTGA TGATTTTTGC GTTAATCCTG
ACGCCGGTGA TGGTTATTGT CGGCGTAGGC GGTTTTAGCG AGTCGCTGGA GGTGATCAAG
CAAAAGAGCA TCGAGAATGT CGACATGCTC AAGGGGCTGA ATTTTGTCGC TATTATTTCT
CTGATGGGCT GGGGACTGGG TTACTTCGGT CAGCCGCATA TCCTGGCGCG CTTTATGGCG
GCGGATTCCC ATCACAGTAT TGTTCATGCG CGTCGTATCA GTATGACCTG GATGATTCTG
TGTCTGGCGG GCGCGGTGGC GGTGGGCTTC TTTGGCATTG CGTACTTTAA CAATAACCCC
GCGCTGGCCG GGGCGGTGAA CCAAAACTCA GAACGCGTAT TTATTGAACT GGCGCAGATC
CTGTTTAACC CGTGGATTGC CGGTGTTCTG CTGTCTGCTA TCCTGGCGGC GGTGATGTCG
ACGTTGAGCT GTCAGTTGCT GGTATGCTCC AGCGCGATTA CGGAAGATTT ATATAAGGCT
TTTCTGCGTA AAAGCGCCAG CCAGCAAGAG CTGGTATGGG TAGGGCGAGT GATGGTGCTG
GTGGTAGCGC TGATCGCCAT TGCGCTGGCG GCGAACCCCG ATAACCGTGT GCTGGGGCTG
GTGAGCTACG CCTGGGCTGG ATTCGGCGCG GCATTTGGGC CTGTTGTCCT GTTTTCTGTG
ATGTGGTCGC GTATGACACG TAACGGCGCG CTGGCGGGAA TGATTATTGG CGCGGTGACG
GTTATCGTCT GGAAACAATA TGGCTGGCTG GATCTGTATG AGATTATCCC TGGCTTCATT
TTCGGCAGCC TGGGGATCGT AATCTTTAGC CTGCTTGGCA AAGCGCCGAC AGCAGCGATG
CAGGAACGCT TTGCAAAAGC GGACGCGCAT TATCATTCCG CGCCGCCGTC GAAGCTACAG
GCGGAATAA
 
Protein sequence
MAISTPMLVT FCVYIFGMIL IGFIAWRSTK NFDDYILGGR SLGPFVTALS AGASDMSGWL 
LMGLPGAIFL SGISESWIAI GLTLGAWINW KLVAGRLRVH TEFNNNALTL PDYFTGRFED
KSRVLRIISA LVILLFFTIY CASGIVAGAR LFESTFGMSY ETALWAGAAA TIIYTFIGGF
LAVSWTDTVQ ASLMIFALIL TPVMVIVGVG GFSESLEVIK QKSIENVDML KGLNFVAIIS
LMGWGLGYFG QPHILARFMA ADSHHSIVHA RRISMTWMIL CLAGAVAVGF FGIAYFNNNP
ALAGAVNQNS ERVFIELAQI LFNPWIAGVL LSAILAAVMS TLSCQLLVCS SAITEDLYKA
FLRKSASQQE LVWVGRVMVL VVALIAIALA ANPDNRVLGL VSYAWAGFGA AFGPVVLFSV
MWSRMTRNGA LAGMIIGAVT VIVWKQYGWL DLYEIIPGFI FGSLGIVIFS LLGKAPTAAM
QERFAKADAH YHSAPPSKLQ AE