Gene EcSMS35_3108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3108 
SymbolnupG 
ID6144908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3192520 
End bp3193776 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content51% 
IMG OID641617976 
Productnucleoside permease NupG 
Protein accessionYP_001745127 
Protein GI170679662 
COG category 
COG ID 
TIGRFAM ID[TIGR00889] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00661547 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCTTA AGCTGCAGCT GAAAATCCTC TCTTTTCTGC AGTTCTGTCT GTGGGGAAGT 
TGGCTGACGA CCCTCGGCTC CTATATGTTT GTTACCCTGA AGTTTGACGG TGCTTCTATC
GGCGCAGTTT ATAGTTCACT GGGTATCGCC GCGGTCTTTA TGCCTGCGCT GCTGGGGATT
GTGGCCGACA AATGGTTAAG TGCGAAATGG GTATATGCCA TTTGCCACAC CATTGGCGCT
ATCACGCTGT TCATGGCGGC ACAGGTCACG ACGCCGGAAG CGATGTTCCT TGTGATATTG
ATTAACTCGT TTGCTTATAT GCCAACGCTT GGGTTAATCA ACACCATCTC TTACTATCGC
CTGCAAAATG CCGGGATGGA TATCGTTACT GACTTCCCGC CAATCCGTAT CTGGGGCACC
ATTGGCTTTA TCATGGCAAT GTGGGTGGTG AGCCTGTCTG GCTTCGAATT AAGCCACATG
CAGCTGTATA TTGGCGCAGC TCTTTCCGCC GTTCTGGTTC TGTTTACCCT GACTCTGCCG
CACATTCCGG TTGCTAAACA GCAAGCGAAT CAGAGCTGGA CAACCCTGCT GGGCCTCGAT
GCATTCGCGC TGTTTAAAAA CAAGCGTATG GCAATCTTCT TCATCTTCTC AATGCTGCTG
GGCGCGGAAC TGCAGATTAC CAACATGTTC GGTAACACCT TCCTGCACAG TTTCGACAAA
GATCCGATGT TTGCCAGCAG CTTCATCGTG CAGCATGCGT CAATCATCAT GTCGATTTCG
CAGATCTCTG AAACGCTGTT CATTCTGACC ATCCCGTTCT TCTTAAGCCG CTACGGCATT
AAGAACGTAA TGATGATCAG TATCGTGGCG TGGATCCTGC GTTTTGCGCT GTTTGCTTAC
GGTGACCCGA CTCCGTTCGG TACCGTGCTG CTGGTTCTGT CGATGATTGT TTACGGCTGC
GCATTCGACT TCTTCAACAT CTCTGGTTCG GTGTTTGTCG AAAAAGAAGT TAGCCCGGCA
ATTCGCGCCA GTGCGCAGGG GATGTTCCTG ATGATGACTA ACGGCTTCGG CTGTATCCTC
GGCGGCATCG TGAGCGGTAA AGTGGTTGAG ATGTACACCC AAAACGGCAT TACCGACTGG
CAGACCGTAT GGCTGATTTT CGCGGGTTAC TCCGTGGTTC TGGCCTTCGC GTTCATGGCG
ATGTTCAAAT ATAAACACGT TCGTGTCCCG ACAGGCACAC AGACGGTTAG CCACTAA
 
Protein sequence
MNLKLQLKIL SFLQFCLWGS WLTTLGSYMF VTLKFDGASI GAVYSSLGIA AVFMPALLGI 
VADKWLSAKW VYAICHTIGA ITLFMAAQVT TPEAMFLVIL INSFAYMPTL GLINTISYYR
LQNAGMDIVT DFPPIRIWGT IGFIMAMWVV SLSGFELSHM QLYIGAALSA VLVLFTLTLP
HIPVAKQQAN QSWTTLLGLD AFALFKNKRM AIFFIFSMLL GAELQITNMF GNTFLHSFDK
DPMFASSFIV QHASIIMSIS QISETLFILT IPFFLSRYGI KNVMMISIVA WILRFALFAY
GDPTPFGTVL LVLSMIVYGC AFDFFNISGS VFVEKEVSPA IRASAQGMFL MMTNGFGCIL
GGIVSGKVVE MYTQNGITDW QTVWLIFAGY SVVLAFAFMA MFKYKHVRVP TGTQTVSH