Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4268 |
Symbol | nupG |
ID | 6971932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3951705 |
End bp | 3952961 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643388006 |
Product | nucleoside permease NupG |
Protein accession | YP_002272445 |
Protein GI | 209400129 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00889] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.218652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTTA AGCTGCAGCT GAAAATCCTC TCTTTTCTGC AGTTCTGTCT GTGGGGAAGT TGGCTGACGA CCCTCGGTTC CTATATGTTT GTTACCCTGA AGTTTGACGG TGCTTCTATT GGCGCAGTTT ATAGCTCACT GGGTATCGCC GCGGTCTTTA TGCCTGCGCT GCTGGGGATT GTGGCCGACA AATGGTTAAG TGCGAAATGG GTATATGCCA TTTGCCACAC CATTGGCGCT ATCACGCTGT TCATGGCGGC ACAGGTCACG ACGCCGGAGG CGATGTTCCT TGTGATATTG ATTAACTCGT TTGCTTATAT GCCAACGCTT GGGTTAATCA ACACCATCTC TTACTATCGC CTGCAAAATG CCGGGATGGA TATCGTTACT GACTTCCCGC CAATCCGTAT CTGGGGCACC ATCGGCTTTA TCATGGCAAT GTGGGTGGTG AGCCTGTCTG GCTTCGAATT AAGCCACATG CAGCTGTATA TTGGCGCAGC ACTTTCCGCC ATTCTGGTTC TGTTTACCCT GACTCTGCCG CATATTCCGG TTGCTAAACA GCAAGCGAAT CAGAGCTGGA CAACCCTGCT GGGCCTCGAT GCATTCGCGC TGTTTAAAAA CAAGCGTATG GCAATCTTCT TCATCTTCTC AATGCTGCTG GGCGCGGAAC TGCAGATTAC CAACATGTTC GGTAATACCT TCCTGCACAG CTTCGACAAA GATCCGATGT TTGCCAGCAG CTTTATTGTG CAGCATGCGT CAATCATCAT GTCGATTTCG CAGATCTCTG AAACCCTGTT CATTCTGACC ATCCCGTTCT TCTTAAGCCG CTACGGTATT AAGAACGTAA TGATGATCAG TATTGTGGCG TGGATCCTGC GTTTTGCGCT GTTTGCTTAC GGCGACCCGA CTCCGTTCGG CACCGTACTG CTGGTTCTAT CGATGATTGT TTACGGCTGT GCGTTCGACT TCTTCAACAT CTCTGGTTCG GTGTTTGTCG AAAAAGAAGT TAGCCCGGCA ATTCGCGCCA GTGCACAAGG GATGTTCCTG ATGATGACTA ACGGCTTCGG CTGTATCCTC GGCGGCATCG TGAGCGGTAA AGTTGTTGAG ATGTACACCC AAAACGGTAT TACCGACTGG CAGACCGTAT GGTTGATTTT CGCTGGTTAC TCCGTGGTTC TGGCCTTCGC GTTCATGGCG ATGTTCAAAT ATAAACACGT TCGTGTCCCG ACAGGTACAC AGACGGTTAG CCACTAA
|
Protein sequence | MNLKLQLKIL SFLQFCLWGS WLTTLGSYMF VTLKFDGASI GAVYSSLGIA AVFMPALLGI VADKWLSAKW VYAICHTIGA ITLFMAAQVT TPEAMFLVIL INSFAYMPTL GLINTISYYR LQNAGMDIVT DFPPIRIWGT IGFIMAMWVV SLSGFELSHM QLYIGAALSA ILVLFTLTLP HIPVAKQQAN QSWTTLLGLD AFALFKNKRM AIFFIFSMLL GAELQITNMF GNTFLHSFDK DPMFASSFIV QHASIIMSIS QISETLFILT IPFFLSRYGI KNVMMISIVA WILRFALFAY GDPTPFGTVL LVLSMIVYGC AFDFFNISGS VFVEKEVSPA IRASAQGMFL MMTNGFGCIL GGIVSGKVVE MYTQNGITDW QTVWLIFAGY SVVLAFAFMA MFKYKHVRVP TGTQTVSH
|
| |