Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2308 |
Symbol | |
ID | 6142796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2338555 |
End bp | 2339805 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617182 |
Product | NupC family nucleoside transporter |
Protein accession | YP_001744355 |
Protein GI | 170684054 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.903484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.000933793 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGGCAATTGC ATTTTTGCTG TCAGTAAACA AGAAGAGGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC GTGATTGGCG GCATTATGCT TTGGTTACCG CCAGGGCGTT GGGTCGCTGA AAAAGTCGCT TTTGGCGTCC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCG CTGGTCGGGC CGAAAATGGA CACGCTGTTT GATGGTGCAG GATTTATCTT TGGTTTCAGA GTATTACCGG CAATTATCTT CGTCACCGCG CTGGTGAGTA TTCTCTACTA CATCGGTGTG ATGGGGATCT TAATTCGCAT TCTCGGCGGT ATCTTCCAGA AAGCATTAAA TATCAGCAAG ATCGAGTCAT TCGTCGCGGT CACTACCATT TTCCTCGGGC AAAACGAAAT TCCGGCAATC GTCAAACCCT TTATCGATCG GCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGTGGC ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTACGCCG CACTGGGCGT ACCTGTGGAA TATTTGCTGG CGGCATCGTT AATGGCGATC CCAGGCGGGA TCTTGTTTGC CCGCCTGTTA AGCCCGGCTA CGGAATCTTC GCAGGTTTCT TTTAATAACC TCTCTTTCAC CGAAACACCG CCAAAAAGTA TTATTGAGGC CGCTGCGACA GGGGCAATGA CCGGGCTGAA AATCGCTGCG GGTGTGGCAA CAGTGGTGAT GGCATTTGTT GCCATCATTG CGTTAATTAA CGGTATTATC GGCGGCGTTG GCGGCTGGTT TGGTTTTGAG CAAGCCTCGC TGGAGTCCAT TTTAGGTTAT CTGCTGGCCC CGCTGGCGTG GGTGATGGGG GTTGACTGGA GTGATGCAAA TCTTGCCGGG AGTTTGATTG GGCAGAAGTT GGCGATCAAT GAATTTGTCG CTTACCTCAA TTTCTCACCT TATCTGCAAA CAGCTGGCAC GCTCGATGCT AAAACCGTGG CGATTATTTC CTTCGCGTTG TGCGGTTTCG CTAACTTTGG TTCTATCGGG GTAGTGGTTG GGGCATTTTC TGCGGTTGCG CCACACCGCG CGCCGGAAAT CGCCCAGCTT GGTTTACGCG CGCTGGCGGC GGCGACACTT TCTAACCTGA TGAGTGCCAC CATTGCAGGA TTCTTTATTG GCTTAGCGTA G
|
Protein sequence | MDVMRSVLGM VVLLAIAFLL SVNKKRISLR TVGAALVLQV VIGGIMLWLP PGRWVAEKVA FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRLNR NELFTAICSG MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP PKSIIEAAAT GAMTGLKIAA GVATVVMAFV AIIALINGII GGVGGWFGFE QASLESILGY LLAPLAWVMG VDWSDANLAG SLIGQKLAIN EFVAYLNFSP YLQTAGTLDA KTVAIISFAL CGFANFGSIG VVVGAFSAVA PHRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA
|
| |