Gene SeD_A2786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2786 
SymbolnupG 
ID6872973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2659791 
End bp2661047 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content49% 
IMG OID642785840 
Productnucleoside permease NupG 
Protein accessionYP_002216490 
Protein GI198245553 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.055348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATTA CGTCCCGCTT AAAAGTCATG TCGTTCTTGC AATATTTTAT CTGGGGGAGC 
TGGCTGGTTA CCCTGGGCTC TTACATGATC AACACTCTGG ATTTTACCGG CGCGAATGTC
GGTATGGTCT ACAGCTCAAA AGGACTGGCA GCGATTATCA TGCCGGGCAT TATGGGGATC
ATTGCTGATA AATGGCTGCG CGCTGAGCGA GCCTATATGC TTTGCCATCT GGTTTGCGCG
GGGGCGTTAT TGTACGCCAC CACCGTTACC GATCCCCAGA CGATGTTCTG GGTGATGTTG
GTTAATGCGA TGGCGTATAT GCCAACGATT GCATTATCCA ATAGCGTTTC GTACTCCTGT
CTGGCGAAAG CAGGTCAGGA TCCGGTAACG TCATTTCCGC CCGTGCGCGT TTTCGGCACA
ATAGGTTTTA TTGTTGCGAT GTGGACGGTG AGCCTGATAG GGCTGGAACT GAGCAGTGCG
CAATTATACA TCGCTTCTGG CGCATCGTTA TTGCTGGCCC TGTATGCGCT GACGTTACCG
AAAATTCCGG TAGCCGAGAA GAAGGCGAAC ACCACGCTTG TCAGTAAGCT CGGACTGGAT
GCTTTTGTTC TGTTTAAAAA TCCACGCATG GCAATCTTCT TTTTGTTTGC GATGATGTTG
GGGGCGGTGC TGCAAATTAC CAATGTCTTC GGTAATCCAT TCCTGCATGA TTTTGCCCGT
AATCCTGAGT TTGCCGACAG CTTTGTGGTG AGGTATCCCT CTATCTTGCT TTCAGTTTCG
CAGATGGCGG AAGTGGGCTT TATCCTCACC ATTCCGTTTT TCCTTAAACG CTTTGGTATT
AAAACAGTAA TGCTGATGAG CATGCTGGCG TGGACGCTGC GTTTCGGCTT CTTTGCCTTT
GGCGATCCAT CCCCGTTTGG CTTTGTGCTA CTGCTGCTGT CGATGATTGT TTATGGCTGC
GCATTTGATT TCTTCAACAT CTCAGGGTCA GTATTTGTAG AGCAGGAGGT GGACTCAAGT
ATTCGCGCCA GCGCGCAGGG GCTATTTATG ACCATGGTTA ACGGCGTGGG GGCGTGGATT
GGGTCTCTTT TAAGCGGTAT GGCCGTGGAT TATTTTTCTA TTGATGGTGT AAAAGATTGG
CAAACCATTT GGCTGGTTTT TGCCGCCTAC GCTCTGGCAT TGGCCGTTAT TTTTGCATTG
TTCTTTAAAT ATCAGCACCA TCCAGAAAAA CTGTCGACCA AATCATTAGC ACATTAA
 
Protein sequence
MGITSRLKVM SFLQYFIWGS WLVTLGSYMI NTLDFTGANV GMVYSSKGLA AIIMPGIMGI 
IADKWLRAER AYMLCHLVCA GALLYATTVT DPQTMFWVML VNAMAYMPTI ALSNSVSYSC
LAKAGQDPVT SFPPVRVFGT IGFIVAMWTV SLIGLELSSA QLYIASGASL LLALYALTLP
KIPVAEKKAN TTLVSKLGLD AFVLFKNPRM AIFFLFAMML GAVLQITNVF GNPFLHDFAR
NPEFADSFVV RYPSILLSVS QMAEVGFILT IPFFLKRFGI KTVMLMSMLA WTLRFGFFAF
GDPSPFGFVL LLLSMIVYGC AFDFFNISGS VFVEQEVDSS IRASAQGLFM TMVNGVGAWI
GSLLSGMAVD YFSIDGVKDW QTIWLVFAAY ALALAVIFAL FFKYQHHPEK LSTKSLAH