Gene ECH74115_4268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4268 
SymbolnupG 
ID6971932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3951705 
End bp3952961 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content50% 
IMG OID643388006 
Productnucleoside permease NupG 
Protein accessionYP_002272445 
Protein GI209400129 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.218652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTA AGCTGCAGCT GAAAATCCTC TCTTTTCTGC AGTTCTGTCT GTGGGGAAGT 
TGGCTGACGA CCCTCGGTTC CTATATGTTT GTTACCCTGA AGTTTGACGG TGCTTCTATT
GGCGCAGTTT ATAGCTCACT GGGTATCGCC GCGGTCTTTA TGCCTGCGCT GCTGGGGATT
GTGGCCGACA AATGGTTAAG TGCGAAATGG GTATATGCCA TTTGCCACAC CATTGGCGCT
ATCACGCTGT TCATGGCGGC ACAGGTCACG ACGCCGGAGG CGATGTTCCT TGTGATATTG
ATTAACTCGT TTGCTTATAT GCCAACGCTT GGGTTAATCA ACACCATCTC TTACTATCGC
CTGCAAAATG CCGGGATGGA TATCGTTACT GACTTCCCGC CAATCCGTAT CTGGGGCACC
ATCGGCTTTA TCATGGCAAT GTGGGTGGTG AGCCTGTCTG GCTTCGAATT AAGCCACATG
CAGCTGTATA TTGGCGCAGC ACTTTCCGCC ATTCTGGTTC TGTTTACCCT GACTCTGCCG
CATATTCCGG TTGCTAAACA GCAAGCGAAT CAGAGCTGGA CAACCCTGCT GGGCCTCGAT
GCATTCGCGC TGTTTAAAAA CAAGCGTATG GCAATCTTCT TCATCTTCTC AATGCTGCTG
GGCGCGGAAC TGCAGATTAC CAACATGTTC GGTAATACCT TCCTGCACAG CTTCGACAAA
GATCCGATGT TTGCCAGCAG CTTTATTGTG CAGCATGCGT CAATCATCAT GTCGATTTCG
CAGATCTCTG AAACCCTGTT CATTCTGACC ATCCCGTTCT TCTTAAGCCG CTACGGTATT
AAGAACGTAA TGATGATCAG TATTGTGGCG TGGATCCTGC GTTTTGCGCT GTTTGCTTAC
GGCGACCCGA CTCCGTTCGG CACCGTACTG CTGGTTCTAT CGATGATTGT TTACGGCTGT
GCGTTCGACT TCTTCAACAT CTCTGGTTCG GTGTTTGTCG AAAAAGAAGT TAGCCCGGCA
ATTCGCGCCA GTGCACAAGG GATGTTCCTG ATGATGACTA ACGGCTTCGG CTGTATCCTC
GGCGGCATCG TGAGCGGTAA AGTTGTTGAG ATGTACACCC AAAACGGTAT TACCGACTGG
CAGACCGTAT GGTTGATTTT CGCTGGTTAC TCCGTGGTTC TGGCCTTCGC GTTCATGGCG
ATGTTCAAAT ATAAACACGT TCGTGTCCCG ACAGGTACAC AGACGGTTAG CCACTAA
 
Protein sequence
MNLKLQLKIL SFLQFCLWGS WLTTLGSYMF VTLKFDGASI GAVYSSLGIA AVFMPALLGI 
VADKWLSAKW VYAICHTIGA ITLFMAAQVT TPEAMFLVIL INSFAYMPTL GLINTISYYR
LQNAGMDIVT DFPPIRIWGT IGFIMAMWVV SLSGFELSHM QLYIGAALSA ILVLFTLTLP
HIPVAKQQAN QSWTTLLGLD AFALFKNKRM AIFFIFSMLL GAELQITNMF GNTFLHSFDK
DPMFASSFIV QHASIIMSIS QISETLFILT IPFFLSRYGI KNVMMISIVA WILRFALFAY
GDPTPFGTVL LVLSMIVYGC AFDFFNISGS VFVEKEVSPA IRASAQGMFL MMTNGFGCIL
GGIVSGKVVE MYTQNGITDW QTVWLIFAGY SVVLAFAFMA MFKYKHVRVP TGTQTVSH