Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3310 |
Symbol | nupG |
ID | 5589872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3320858 |
End bp | 3322114 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640926947 |
Product | nucleoside permease NupG |
Protein accession | YP_001464319 |
Protein GI | 157157375 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00889] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTTA AGCTGCAGCT GAAAATCCTC TCTTTTCTGC AGTTCTGTCT GTGGGGAAGT TGGCTGACGA CCCTCGGCTC CTATATGTTT GTTACCCTGA AGTTTGACGG TGCTTCTATT GGCGCAGTTT ATAGCTCACT GGGTATCGCC GCGGTCTTTA TGCCTGCGCT GCTGGGGATT GTGGCCGACA AATGGTTAAG TGCGAAATGG GTATATGCCA TTTGCCACAC CATTGGCGCT ATCACGCTGT TCATGGCGGC ACAGGTCACG ACACCGGAGG CGATGTTCCT TGTGATATTG ATTAACTCGT TTGCTTATAT GCCAACGCTT GGGTTAATCA ACACCATCTC TTACTATCGC CTGCAAAATG CCGGGATGGA TATCGTTACT GACTTCCCGC CAATCCGTAT CTGGGGCACC ATCGGCTTTA TCATGGCAAT GTGGGTGGTG AGCCTGTCTG GCTTCGAATT AAGCCACATG CAGCTGTATA TTGGCGCAGC ACTTTCCGCC ATTCTGGTTC TGTTTACCCT GACTCTGCCG CATATTCCGG TTGCTAAACA GCAAGCGAAT CAGAGCTGGA CAACCCTGCT GGGCCTCGAT GCATTCGCGC TGTTTAAAAA CAAGCGTATG GCAATCTTCT TTATCTTCTC AATGCTGCTG GGCGCGGAAC TGCAGATTAC CAACATGTTC GGTAATACCT TCCTGCACAG CTTCGACAAA GATCCGATGT TTGCCAGCAG CTTTATTGTG CAGCATGCGT CAATCATCAT GTCGATTTCG CAGATCTCTG AAACCCTGTT CATTCTGACC ATCCCGTTCT TCTTAAGCCG CTACGGTATT AAGAACGTAA TGATGATCAG TATTGTGGCG TGGATCCTGC GTTTTGCGCT GTTTGCTTAC GGCGACCCGA CTCCGTTCGG TACTGTACTG CTGGTACTGT CGATGATCGT TTACGGTTGC GCATTCGACT TCTTCAACAT CTCTGGTTCG GTGTTTGTCG AAAAAGAAGT TAGCCCGGCA ATTCGCGCCA GTGTACAAGG GATGTTCCTG ATGATGACTA ACGGCTTCGG CTGTATCCTC GGCGGCATCG TGAGCGGTAA AGTTGTTGAG ATGTACACCC AAAACGGCAT TACCGACTGG CAGACCGTAT GGTTGATTTT CGCTGGTTAC TCCGTGGTTC TGGCCTTCGC GTTCATGGCG ATGTTCAAAT ATAAACACGT TCGTGTCCCG ACAGGCACAC AGACGGTTAG CCACTAA
|
Protein sequence | MNLKLQLKIL SFLQFCLWGS WLTTLGSYMF VTLKFDGASI GAVYSSLGIA AVFMPALLGI VADKWLSAKW VYAICHTIGA ITLFMAAQVT TPEAMFLVIL INSFAYMPTL GLINTISYYR LQNAGMDIVT DFPPIRIWGT IGFIMAMWVV SLSGFELSHM QLYIGAALSA ILVLFTLTLP HIPVAKQQAN QSWTTLLGLD AFALFKNKRM AIFFIFSMLL GAELQITNMF GNTFLHSFDK DPMFASSFIV QHASIIMSIS QISETLFILT IPFFLSRYGI KNVMMISIVA WILRFALFAY GDPTPFGTVL LVLSMIVYGC AFDFFNISGS VFVEKEVSPA IRASVQGMFL MMTNGFGCIL GGIVSGKVVE MYTQNGITDW QTVWLIFAGY SVVLAFAFMA MFKYKHVRVP TGTQTVSH
|
| |