Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3108 |
Symbol | nupG |
ID | 6144908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3192520 |
End bp | 3193776 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617976 |
Product | nucleoside permease NupG |
Protein accession | YP_001745127 |
Protein GI | 170679662 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00889] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.00661547 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATCTTA AGCTGCAGCT GAAAATCCTC TCTTTTCTGC AGTTCTGTCT GTGGGGAAGT TGGCTGACGA CCCTCGGCTC CTATATGTTT GTTACCCTGA AGTTTGACGG TGCTTCTATC GGCGCAGTTT ATAGTTCACT GGGTATCGCC GCGGTCTTTA TGCCTGCGCT GCTGGGGATT GTGGCCGACA AATGGTTAAG TGCGAAATGG GTATATGCCA TTTGCCACAC CATTGGCGCT ATCACGCTGT TCATGGCGGC ACAGGTCACG ACGCCGGAAG CGATGTTCCT TGTGATATTG ATTAACTCGT TTGCTTATAT GCCAACGCTT GGGTTAATCA ACACCATCTC TTACTATCGC CTGCAAAATG CCGGGATGGA TATCGTTACT GACTTCCCGC CAATCCGTAT CTGGGGCACC ATTGGCTTTA TCATGGCAAT GTGGGTGGTG AGCCTGTCTG GCTTCGAATT AAGCCACATG CAGCTGTATA TTGGCGCAGC TCTTTCCGCC GTTCTGGTTC TGTTTACCCT GACTCTGCCG CACATTCCGG TTGCTAAACA GCAAGCGAAT CAGAGCTGGA CAACCCTGCT GGGCCTCGAT GCATTCGCGC TGTTTAAAAA CAAGCGTATG GCAATCTTCT TCATCTTCTC AATGCTGCTG GGCGCGGAAC TGCAGATTAC CAACATGTTC GGTAACACCT TCCTGCACAG TTTCGACAAA GATCCGATGT TTGCCAGCAG CTTCATCGTG CAGCATGCGT CAATCATCAT GTCGATTTCG CAGATCTCTG AAACGCTGTT CATTCTGACC ATCCCGTTCT TCTTAAGCCG CTACGGCATT AAGAACGTAA TGATGATCAG TATCGTGGCG TGGATCCTGC GTTTTGCGCT GTTTGCTTAC GGTGACCCGA CTCCGTTCGG TACCGTGCTG CTGGTTCTGT CGATGATTGT TTACGGCTGC GCATTCGACT TCTTCAACAT CTCTGGTTCG GTGTTTGTCG AAAAAGAAGT TAGCCCGGCA ATTCGCGCCA GTGCGCAGGG GATGTTCCTG ATGATGACTA ACGGCTTCGG CTGTATCCTC GGCGGCATCG TGAGCGGTAA AGTGGTTGAG ATGTACACCC AAAACGGCAT TACCGACTGG CAGACCGTAT GGCTGATTTT CGCGGGTTAC TCCGTGGTTC TGGCCTTCGC GTTCATGGCG ATGTTCAAAT ATAAACACGT TCGTGTCCCG ACAGGCACAC AGACGGTTAG CCACTAA
|
Protein sequence | MNLKLQLKIL SFLQFCLWGS WLTTLGSYMF VTLKFDGASI GAVYSSLGIA AVFMPALLGI VADKWLSAKW VYAICHTIGA ITLFMAAQVT TPEAMFLVIL INSFAYMPTL GLINTISYYR LQNAGMDIVT DFPPIRIWGT IGFIMAMWVV SLSGFELSHM QLYIGAALSA VLVLFTLTLP HIPVAKQQAN QSWTTLLGLD AFALFKNKRM AIFFIFSMLL GAELQITNMF GNTFLHSFDK DPMFASSFIV QHASIIMSIS QISETLFILT IPFFLSRYGI KNVMMISIVA WILRFALFAY GDPTPFGTVL LVLSMIVYGC AFDFFNISGS VFVEKEVSPA IRASAQGMFL MMTNGFGCIL GGIVSGKVVE MYTQNGITDW QTVWLIFAGY SVVLAFAFMA MFKYKHVRVP TGTQTVSH
|
| |