Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3300 |
Symbol | |
ID | 6969084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3032375 |
End bp | 3033625 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387112 |
Product | nucleoside transporter, NupC family |
Protein accession | YP_002271576 |
Protein GI | 209397477 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00807637 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0252399 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATAA TGAGAAGTGT TGTGGGGATG GTGGTGTTAC TGGCAATAGC ATTTCTGTTG TCAGTGAATA AAAAGAGCAT CAGTTTGCGT ACGGTTGCAG CCGCACTACT GCTGCAAATT GCTCTTGGTG GCATCATGCT CTATTTTCCG CCGGGAAAGT GGGCTGTAGA ACAGGCGGCA TTAGGCGTTC ATAAAGTGAT GTCTTACAGT GATGCTGGTA GCGCCTTCAT TTTTGGTTCG CTGGTCGGGC CGAAAATGGA TGTCCTCTTT GACGGTGCGG GTTTTATCTT CGCCTTTCGC GTACTTCCGG CGATTATCTT CGTCACTGCG CTCATTAGTC TGCTGTACTA CATTGGCGTG ATGGGGCTGC TGATTCGCAT CCTTGGCAGC ATTTTCCAGA AAGCCCTCAA CATCAGCAAA ATCGAATCTT TTGTCGCAGT CACCACTATT TTCCTCGGGC AAAATGAGAT CCCGGCGATC GTTAAACCGT TTATCGATCG CATGAATCGA AACGAGTTGT TTACGGCTAT TTGTAGCGGG ATGGCGTCCA TTGCTGGTTC GATGATGATT GGTTATGCCG GAATGGGCGT ACCAATTGAC TACCTGTTGG CGGCATCGCT GATGGCGATC CCCGGCGGTA TTTTGTTTGC ACGTATTCTT AGTCCAGCAA CCGAGCTTTC GCAGGTCACG TTTGAAAATC TGTCGTTCAG CGAAACGCCG CCAAAAAGCT TTATCGAAGC AGCGGCGAGC GGTGCGATGA CCGGGCTGAA AATCGCCGCT GGTGTGGCGA CGGTGGTAAT GGCATTTGTG GCAATTATTG CGCTGATCAA CGGCATTATC GGCGGAATTG GCGGCTGGTT TGGCTTCGCC AATGCTTCTC TGGAAAGTAT TTTTGGCTAT GTGCTGGCAC CGCTGGCGTG GATCATGGGT GTGGACTGGA GTGATGCCAA CCTTGCGGGT AGCCTGATTG GGCAGAAACT GGCGATTAAC GAATTCGTCG CTTACCTGAG TTTCTCCCCA TACCTGCAAA CGGGCGGCAC GCTGGAAGTG AAAACCATTG CGATTATCTC CTTTGCGCTT TGTGGTTTTG CTAACTTTGG TTCTATCGGT GTTGTCGTTG GGGCATTTTC GGCTATTTCG CCAAAACGCG CACCGGAAAT CGCTCAGCTT GGTTTACGGG CGCTGGCAGC AGCAACGCTT TCCAACCTGA TGAGCGCGAC CATTGCCGGG TTCTTTATTG GTCTGGCGTA A
|
Protein sequence | MDIMRSVVGM VVLLAIAFLL SVNKKSISLR TVAAALLLQI ALGGIMLYFP PGKWAVEQAA LGVHKVMSYS DAGSAFIFGS LVGPKMDVLF DGAGFIFAFR VLPAIIFVTA LISLLYYIGV MGLLIRILGS IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRMNR NELFTAICSG MASIAGSMMI GYAGMGVPID YLLAASLMAI PGGILFARIL SPATELSQVT FENLSFSETP PKSFIEAAAS GAMTGLKIAA GVATVVMAFV AIIALINGII GGIGGWFGFA NASLESIFGY VLAPLAWIMG VDWSDANLAG SLIGQKLAIN EFVAYLSFSP YLQTGGTLEV KTIAIISFAL CGFANFGSIG VVVGAFSAIS PKRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA
|
| |