Gene SeHA_C1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1235 
SymbolputP 
ID6488347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1216709 
End bp1218217 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content54% 
IMG OID642741473 
Productsodium/proline symporter 
Protein accessionYP_002045124 
Protein GI194451512 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0211084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value0.60164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTA GCACACCGAT GTTGGTGACA TTCTGTGTCT ATATTTTTGG CATGATATTG 
ATTGGGTTTA TCGCCTGGCG CTCAACCAAA AACTTTGATG ACTATATTCT TGGCGGTCGC
AGCCTGGGGC CGTTTGTTAC GGCTTTATCA GCCGGCGCGT CGGATATGAG CGGCTGGCTG
TTAATGGGGC TGCCTGGCGC TATCTTTCTG TCGGGGATCT CTGAAAGCTG GATCGCCATT
GGCCTGACGT TAGGCGCATG GATTAACTGG AAGCTGGTGG CCGGGCGCCT GCGCGTGCAT
ACCGAATTTA ACAATAACGC GCTCACGCTA CCGGACTATT TTACCGGTCG GTTTGAAGAT
AAGAGCCGAG TCCTGCGTAT TATTTCCGCG CTGGTCATTC TGCTGTTTTT CACTATCTAT
TGCGCATCAG GTATTGTCGC TGGGGCACGA CTGTTCGAAA GCACCTTCGG TATGAGCTAT
GAAACCGCAC TGTGGGCGGG GGCCGCGGCA ACCATTATTT ATACCTTTAT CGGCGGGTTT
CTTGCCGTTA GCTGGACGGA TACCGTTCAG GCCAGCCTGA TGATTTTTGC GTTAATCCTG
ACGCCGGTGA TGGTTATTGT CGGCGTAGGC GGTTTTAGCG AGTCGCTGGA GGTGATCAAG
CAAAAGAGCA TCGAGAATGT CGACATGCTC AAGGGGCTGA ATTTTGTCGC TATTATTTCT
CTGATGGGCT GGGGACTGGG TTACTTCGGT CAGCCGCATA TCCTGGCGCG CTTTATGGCG
GCGGATTCCC ATCACAGTAT TGTTCATGCG CGTCGTATCA GTATGACCTG GATGATTCTG
TGTCTGGCGG GCGCGGTGGC GGTGGGCTTC TTTGGCATTG CGTACTTTAA CAATAACCCC
GCGCTGGCCG GGGCGGTGAA CCAAAACTCA GAACGCGTAT TTATTGAACT GGCGCAGATC
CTGTTTAACC CGTGGATTGC CGGTGTTCTG CTGTCTGCTA TCCTGGCGGC GGTGATGTCG
ACGTTGAGCT GTCAGTTGCT GGTATGCTCC AGCGCGATTA CGGAAGATTT ATATAAGGCT
TTTCTGCGTA AAAGCGCCAG CCAGCAAGAG CTGGTATGGG TAGGGCGAGT GATGGTGCTG
GTGGTAGCGC TGATCGCCAT TGCGCTGGCG GCGAACCCCG ATAACCGTGT GCTGGGGCTG
GTGAGCTACG CCTGGGCTGG ATTCGGCGCG GCATTTGGAC CTGTTGTCCT GTTTTCTGTG
ATGTGGTCGC GTATGACACG TAACGGCGCG CTGGCGGGAA TGATTATTGG CGCGGTGACG
GTTATCGTCT GGAAACAATA TGGCTGGCTG GATCTGTATG AGATTATCCC TGGCTTCATT
TTCGGCAGCC TGGGGATCGT AATCTTTAGC CTGCTTGGCA AAGCGCCGAC AGCAGCGATG
CAGGAACGCT TTGCAAAAGC GGACGCGCAT TATCATTCCG CGCCGCCGTC GAAGCTACAG
GCGGAATAA
 
Protein sequence
MAISTPMLVT FCVYIFGMIL IGFIAWRSTK NFDDYILGGR SLGPFVTALS AGASDMSGWL 
LMGLPGAIFL SGISESWIAI GLTLGAWINW KLVAGRLRVH TEFNNNALTL PDYFTGRFED
KSRVLRIISA LVILLFFTIY CASGIVAGAR LFESTFGMSY ETALWAGAAA TIIYTFIGGF
LAVSWTDTVQ ASLMIFALIL TPVMVIVGVG GFSESLEVIK QKSIENVDML KGLNFVAIIS
LMGWGLGYFG QPHILARFMA ADSHHSIVHA RRISMTWMIL CLAGAVAVGF FGIAYFNNNP
ALAGAVNQNS ERVFIELAQI LFNPWIAGVL LSAILAAVMS TLSCQLLVCS SAITEDLYKA
FLRKSASQQE LVWVGRVMVL VVALIAIALA ANPDNRVLGL VSYAWAGFGA AFGPVVLFSV
MWSRMTRNGA LAGMIIGAVT VIVWKQYGWL DLYEIIPGFI FGSLGIVIFS LLGKAPTAAM
QERFAKADAH YHSAPPSKLQ AE