Gene SeHA_C3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3354 
SymbolnupG 
ID6491357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3263822 
End bp3265078 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content51% 
IMG OID642743487 
Productnucleoside permease NupG 
Protein accessionYP_002047103 
Protein GI194450077 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.650502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.93107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTA AGCTGCAGCT TAAAATACTC TCTTTTCTGC AGTTCTGTCT GTGGGGAAGC 
TGGCTCACTA CCCTGGGCTC GTATATGTTC GTCACCTTAA AATTTGACGG CGCATCTATT
GGCGCAGTTT ATAGTTCACT GGGGATTGCC GCCGTCTTTA TGCCGACCTT GCTAGGCATT
GTGGCTGACA AATGGCTGAG CGCGAAATGG GTCTATGCCC TGTGTCATGT TGTCGGCGCC
ATCACGCTAT TCATGGCCGC GGAAGTCACT ACGCCTGGGG CGATGTTCTT TGTGATCCTG
CTTAACTCGT TGGCCTATAT GCCGACGTTG GGCTTGATCA ATACCATATC GTATTACCGC
CTGCAGTCTG CCGGCATGGA TATTGTGACT GACTTTCCGC CTATCCGTAT CTGGGGCACC
ATTGGCTTTA TTCTGGCGAT GTGGGGCGTG AGTTTCTCCG GTTTCGAGCT GAGCCATATG
CAGCTTTATA TCGGCGCGAC GCTTTCCGTT CTGCTGGTAC TGTTTACCTT TACCCTGCCG
CACATTCCGG TGGCGAACCA ACAGAAAAAC CAGAGCTGGA CATCAATGCT GGGCCTTGAC
GCTTTTGCGC TGTTTAAAAA TAAGCGGATG GCGATTTTCT TCATCTTCTC CATGATGCTG
GGCGCGGAAC TGCAGATCAC TAACATGTTT GGCAACACCT TCCTGCATAG CTTTGATAAA
GATCCGCTAT TCGCCAGTAG CTTTATCGTG CAGCACGCCT CGGTGATGAT GTCGATTTCG
CAGATTTCTG AAACGTTATT CATCCTGACC ATTCCGTTCT TCCTGAGCCG CTATGGTATT
AAGAACGTTA TGCTTATCAG TATTGTGGCG TGGATGCTGC GTTTCGGCCT GTTCGCTTAT
GGCGACCCGA CGCCGTTCGG TACCGTTCTG CTGGTATTGT CGATGATTGT GTACGGCTGC
GCCTTCGACT TCTTCAACAT TTCTGGCTCG GTGTTTGTCG AAAAAGAAGT ACGCCCGGAA
ATCCGCGCCA GCGCGCAGGG GATGTTCCTG ATGATGACCA ATGGCTTCGG CTGTATCCTG
GGCGGCATTG TGAGCGGTAA AGTGGTGGAG TATTACACTC AAAACGGCAT TACCGACTGG
CAGACCGTGT GGCTCATCTT CGCAGGCTAC TCGCTGGTGC TGGCCTTCGC GTTCGTAGCC
TTGTTCAAAT ACAAACACGT TCGCGTTCCG GCAAGTTCGC AACCCGTTGC ACATTAA
 
Protein sequence
MNLKLQLKIL SFLQFCLWGS WLTTLGSYMF VTLKFDGASI GAVYSSLGIA AVFMPTLLGI 
VADKWLSAKW VYALCHVVGA ITLFMAAEVT TPGAMFFVIL LNSLAYMPTL GLINTISYYR
LQSAGMDIVT DFPPIRIWGT IGFILAMWGV SFSGFELSHM QLYIGATLSV LLVLFTFTLP
HIPVANQQKN QSWTSMLGLD AFALFKNKRM AIFFIFSMML GAELQITNMF GNTFLHSFDK
DPLFASSFIV QHASVMMSIS QISETLFILT IPFFLSRYGI KNVMLISIVA WMLRFGLFAY
GDPTPFGTVL LVLSMIVYGC AFDFFNISGS VFVEKEVRPE IRASAQGMFL MMTNGFGCIL
GGIVSGKVVE YYTQNGITDW QTVWLIFAGY SLVLAFAFVA LFKYKHVRVP ASSQPVAH