Gene SNSL254_A2614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2614 
SymbolnupG 
ID6482451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2532584 
End bp2533840 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content49% 
IMG OID642737947 
Productnucleoside permease NupG 
Protein accessionYP_002041681 
Protein GI194442824 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATTA CGTCCCGCTT AAAAGTCATG TCGTTCTTGC AATATTTTAT CTGGGGGAGC 
TGGCTGGTTA CCCTGGGCTC TTACATGATC AACACTCTGG ATTTTACCGG CGCGAATGTC
GGTATGGTCT ACAGCTCAAA AGGACTGGCA GCGATTATCA TGCCGGGCAT TATGGGGATC
ATTGCTGATA AATGGCTGCG CGCTGAGCGA GCCTACATGC TTTGCCATCT GGTTTGCGCG
GGGGCGTTAT TGTACGCCAC CACCGTTACC GATCCCCAGA CGATGTTCTG GGTGATGTTG
GTTAATGCGA TGGCGTATAT GCCAACGATT GCATTATCCA ATAGCGTTTC GTACTCCTGT
CTGGCGAAAG CAGGTCAGGA TCCGGTAACG TCATTTCCGC CTGTGCGCGT TTTCGGCACA
ATAGGTTTTA TTGTTGCGAT GTGGACGGTG AGCCTGATGG GGCTGGAACT GAGCAGTGCG
CAATTATACA TCGCTTCTGG CGCATCGTTA TTGCTGGCCC TGTATGCGCT GACGTTACCG
AAAATTCCGG TAGCCGAGAA GAAGGCGAAC ACCACGCTTG CCAGTAAGCT CGGACTGGAT
GCTTTTGTTC TGTTTAAAAA TCCACGCATG GCAATCTTCT TTTTGTTTGC GATGATGTTG
GGGGCGGTGC TGCAAATTAC CAATGTCTTC GGTAATCCGT TCCTGCATGA TTTTGCCCGT
AATCCTGAGT TTGCCGACAG CTTTGTGGTG AAGTATCCCT CTATCTTGCT TTCAGTTTCG
CAGATGGCGG AAGTGGGCTT TATCCTCACC ATTCCGTTCT TCCTTAAACG CTTTGGTATT
AAAACGGTAA TGCTGATGAG TATGCTGGCG TGGACGCTGC GTTTCGGCTT CTTTGCCTTT
GGCGATCCAT CCCCGTTTGG CTTTGTGCTA CTGCTGCTGT CGATGATTGT TTATGGCTGC
GCATTTGATT TCTTCAACAT CTCAGGGTCA GTATTTGTAG AGCAGGAGGT GGACTCAAGT
ATTCGCGCCA GCGCGCAGGG GCTATTTATG ACCATGGTTA ACGGCGTGGG GGCGTGGATT
GGGTCTCTTT TAAGCGGTAT GGCCGTGGAT TATTTTTCTA TTGATGGTGT AAAAGATTGG
CAAACCATTT GGCTGGTTTT TGCCGCCTAC GCTCTGGCAT TGGCCGTTAT TTTTGCATTG
TTCTTTAAAT ATCAGCACCA TCCAGAAAAA CTGTCGACCA AATCATTAGC ACATTAA
 
Protein sequence
MGITSRLKVM SFLQYFIWGS WLVTLGSYMI NTLDFTGANV GMVYSSKGLA AIIMPGIMGI 
IADKWLRAER AYMLCHLVCA GALLYATTVT DPQTMFWVML VNAMAYMPTI ALSNSVSYSC
LAKAGQDPVT SFPPVRVFGT IGFIVAMWTV SLMGLELSSA QLYIASGASL LLALYALTLP
KIPVAEKKAN TTLASKLGLD AFVLFKNPRM AIFFLFAMML GAVLQITNVF GNPFLHDFAR
NPEFADSFVV KYPSILLSVS QMAEVGFILT IPFFLKRFGI KTVMLMSMLA WTLRFGFFAF
GDPSPFGFVL LLLSMIVYGC AFDFFNISGS VFVEQEVDSS IRASAQGLFM TMVNGVGAWI
GSLLSGMAVD YFSIDGVKDW QTIWLVFAAY ALALAVIFAL FFKYQHHPEK LSTKSLAH