Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2297 |
Symbol | |
ID | 5591814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2295071 |
End bp | 2296321 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640921425 |
Product | NupC family nucleoside transporter |
Protein accession | YP_001458961 |
Protein GI | 157161643 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGACGATTGC GTTTTTGCTG TCAGTAAACA AGAAGAAGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC GTGATTGGCG GGATTATGCT TTGGTTACCG CCAGGGCGTT GGGTCGCTGA AAAAGTCGCT TTTGGCGTGC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCG CTGGTCGGGC CGAAAATGGA CACGCTGTTT GATGGCGCAG GATTTATCTT TGGTTTCAGG GTATTACCGG CAATTATCTT CGTCACTGCA CTGGTGAGTA TTCTCTACTA CATCGGTGTG ATGGGGATTT TAATTCGCAT TCTCGGCGGT ATATTCCAGA AAGCATTAAA TATCAGCAAG ATTGAGTCAT TCGTCGCGGT CACCACCATT TTCCTCGGGC AAAACGAAAT TCCGGCGATC GTCAAACCCT TTATCGATCG TCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGTGGC ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTACGCCG CCCTGGGCGT ACCTGTGGAA TATTTGCTGG CGGCATCGTT AATGGCGATC CCAGGCGGGA TCTTGTTTGC CCGCCTGTTA AGCCCGGCTA CGGAATCTTC GCAGGTTTCC TTTAATAACC TCTCTTTCAC CGAAACACCG CCAAAAAGCA TTATTGAAGC CGCTGCGACA GGGGCAATGA CCGGGCTGAA AATCGCCGCC GGTGTAGCGA CAGTTGTTAT GGCATTTGTC GCCATCATTG CGTTAATTAA TGGTATTATC GGCGGCGTTG GCGGCTGGTT TGGTTTTGCA CATGCCTCGC TGGAGTCCAT TTTAGGTTAC CTGTTGGCCC CATTGGCGTG GGTGATGGGG GTTGACTGGA GTGATGCAAA TCTTGCCGGG AGTTTGATTG GGCAGAAGCT GGCGATCAAT GAATTTGTCG CTTATCTCAA TTTCTCGCCA TATCTGCAAA CGGGTGGCAC TCTGGATGCT AAAACCGTGG CGATTATTTC TTTCGCGTTG TGCGGTTTCG CTAACTTTGG TTCTATCGGG GTGGTGGTGG GGGCGTTTTC TGCGGTTGCG CCACACCGTG CGCCGGAAAT CGCCCAACTT GGTTTACGCG CGCTGGCGGC GGCGACACTT TCTAACCTGA TGAGTGCTAC TATTGCAGGA TTCTTTATTG GTTTAGCGTA G
|
Protein sequence | MDVMRSVLGM VVLLTIAFLL SVNKKKISLR TVGAALVLQV VIGGIMLWLP PGRWVAEKVA FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRLNR NELFTAICSG MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP PKSIIEAAAT GAMTGLKIAA GVATVVMAFV AIIALINGII GGVGGWFGFA HASLESILGY LLAPLAWVMG VDWSDANLAG SLIGQKLAIN EFVAYLNFSP YLQTGGTLDA KTVAIISFAL CGFANFGSIG VVVGAFSAVA PHRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA
|
| |