Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3297 |
Symbol | |
ID | 6972024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3029185 |
End bp | 3030435 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387109 |
Product | nucleoside transporter, NupC family |
Protein accession | YP_002271573 |
Protein GI | 209396884 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0458092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.103773 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGACGATTGC GTTTTTACTG TCAGTAAACA AGAAGAAGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC GTGATTGGCG GCATTATGCT TTGGTTACCG CCAGGGCGTT GGGTCGCTGA AAAAGTCGCT TTTGGTGTGC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCT CTGGTCGGAC CGAAAATGGA CACCTTATTT GATGGCGCAG GATTTATCTT TGGTTTCAGG GTATTACCGG CAATTATCTT CGTCACTGCA CTGGTGAGTA TTCTCTACTA CATCGGTGTG ATGGGGATTT TAATTCGCAT TCTCGGCGGT ATATTCCAGA AAGCATTAAA TATCAGCAAG ATTGAGTCAT TCGTCGCGGT CACTACCATT TTCCTCGGGC AAAACGAAAT TCCGGCGATT GTGAAGCCCT TTATCGATCG GCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGCGGC ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTATGCCG CACTGGGCGT GCCTGTGGAA TATTTGCTGG CGGCATCGTT AATGGCGATC CCCGGGGGGA TCTTGTTTGC CCGCCTGCTA AGCCCGGCTA CGGAATCTTC GCAGGTTTCT TTTAATAACC TCTCTTTCAC CGAAACACCG CCAAAAAGCA TTATTGAAGC CGCCGCGACA GGGGCAATGA CCGGGCTGAA AATCGCCGCA GGTGTGGCGA CAGTAGTGAT GGCATTCGTC GCCATCATTG CGTTAATTAA CGGTATTATC GGCGGCGTTG GCGGCTGGTT TGGTTTTGAA CATGCCTCGC TGGAGTCCAT TGTAGGTTAT CTGCTGGCCC CACTGGCGTG GGTAATGGGT GTTGACTGGA GTGATGCGAA TCTTGCCGGG AGTTTGATTG GACAGAAACT GGCAATAAAT GAATTTGTCG CTTATCTCAA TTTCTCACCC TATCTGCAAA CGGCTGGCAC TCTGGATGCT AAAACCGTGG CGATTATTTC CTTCGCGTTG TGCGGTTTCG CTAACTTTGG TTCTATCGGG GTGGTGGTGG GGGCGTTTTC TGCGGTTGCG CCACACCGTG CGCCGGAAAT CGCCCAGCTT GGTTTACGCG CGCTGGCGGC GGCGACACTT TCTAACCTGA TGAGTGCCAC CATTGCCGGG TTCTTTATTG GTTTAGCGTA G
|
Protein sequence | MDVMRSVLGM VVLLTIAFLL SVNKKKISLR TVGAALVLQV VIGGIMLWLP PGRWVAEKVA FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRLNR NELFTAICSG MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP PKSIIEAAAT GAMTGLKIAA GVATVVMAFV AIIALINGII GGVGGWFGFE HASLESIVGY LLAPLAWVMG VDWSDANLAG SLIGQKLAIN EFVAYLNFSP YLQTAGTLDA KTVAIISFAL CGFANFGSIG VVVGAFSAVA PHRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA
|
| |