Gene SeHA_C2680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2680 
SymbolnupG 
ID6491130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2591350 
End bp2592606 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content49% 
IMG OID642742858 
Productnucleoside permease NupG 
Protein accessionYP_002046485 
Protein GI194447355 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones93 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATTA CGTCCCGCTT AAAAGTCATG TCGTTCTTGC AATATTTTAT CTGGGGGAGC 
TGGCTGGTTA CCCTGGGCTC TTACATGATC AACACTCTGG ATTTTACCGG CGCGAATGTC
GGTATGGTCT ACAGCTCAAA AGGACTGGCA GCGATTATCA TGCCGGGTAT TATGGGGATC
ATTGCTGATA AATGGCTGCG CGCTGAGCGA GCCTACATGC TTTGCCATCT GGTTTGCGCG
GGGGCGTTAT TGTACGCCAC CACCGTTACC GATCCCCAGA CGATGTTCTG GGTGATGTTG
GTGAATGCGA TGGCGTATAT GCCAACAATT GCATTATCCA ATAGCGTTTC GTACTCCTGT
CTGGCGAAAG CAGGTCAGGA TCCGGTAACG TCATTTCCGC CTGTGCGCGT TTTCGGCACA
ATAGGTTTTA TTGTTGCGAT GTGGACGGTG AGCCTGATGG GGCTGGAACT GAGCAGTGCG
CAATTATACA TCGCTTCTGG CGCATCGTTA TTGCTGGCCC TGTATGCGCT GACGTTACCG
AAAATTCCGG TAGCCGAGAA GAAGGCGAAC ACCACGCTTG CCAGTAAGCT CGGACTGGAT
GCTTTTGTTC TGTTTAAAAA TCCACGCATG GCAATCTTCT TTTTGTTTGC GATGATGTTG
GGGGCGGTGC TGCAAATTAC CAATGTCTTC GGTAATCCGT TCCTGCATGA TTTTGCCCGT
AATCCTGAGT TTGCCGACAG CTTTGTGGTG AAGTATCCCT CTATCTTGCT TTCAGTTTCG
CAGATGGCGG AAGTGGGCTT TATCCTCACC ATTCCGTTCT TCCTTAAACG CTTTGGTATT
AAAACGGTAA TGCTGATGAG TATGCTGGCG TGGACGCTGC GTTTCGGCTT CTTTGCCTTT
GGCGATCCAT CCCCGTTTGG CTTTGTGCTA CTGCTGCTGT CGATGATTGT TTATGGCTGC
GCATTTGATT TCTTCAACAT CTCAGGGTCA GTATTTGTAG AGCAGGAGGT GGACTCAAGT
ATTCGCGCCA GCGCGCAGGG GCTATTTATG ACCATGGTTA ACGGCGTGGG GGCGTGGATT
GGGTCTCTTT TAAGCGGTAT GGCCGTGGAT TATTTTTCTA TTGATGGCGT AAAAGATTGG
CAAACTATCT GGCTGGTCTT TGCCGCCTAC GCTCTGGCAT TGGCCGTTAT TTTTGCATTG
TTCTTTAAAT ATCAGCACCA TCCAGAAAAA CTGTCGACCA AATCATTAGC ACATTAA
 
Protein sequence
MGITSRLKVM SFLQYFIWGS WLVTLGSYMI NTLDFTGANV GMVYSSKGLA AIIMPGIMGI 
IADKWLRAER AYMLCHLVCA GALLYATTVT DPQTMFWVML VNAMAYMPTI ALSNSVSYSC
LAKAGQDPVT SFPPVRVFGT IGFIVAMWTV SLMGLELSSA QLYIASGASL LLALYALTLP
KIPVAEKKAN TTLASKLGLD AFVLFKNPRM AIFFLFAMML GAVLQITNVF GNPFLHDFAR
NPEFADSFVV KYPSILLSVS QMAEVGFILT IPFFLKRFGI KTVMLMSMLA WTLRFGFFAF
GDPSPFGFVL LLLSMIVYGC AFDFFNISGS VFVEQEVDSS IRASAQGLFM TMVNGVGAWI
GSLLSGMAVD YFSIDGVKDW QTIWLVFAAY ALALAVIFAL FFKYQHHPEK LSTKSLAH