Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2311 |
Symbol | |
ID | 6143241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2341742 |
End bp | 2342992 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617185 |
Product | NupC family nucleoside transporter |
Protein accession | YP_001744358 |
Protein GI | 170681919 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0705452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.000255025 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGATATAA TGAGAAGTGT TGTGGGGATG GTGGTGTTAC TGGCAATAGC ATTTTTGTTG TCAGTGAATA AAAAGAGCAT CAGTTTGCGC ACGGTTGGAG CCGCGTTGCT GCTACAAATC GCCATTGGCG GAATCATGCT CTATTTTCCA CCAGGAAAAT GGGCGGTAGA ACAGGCGGCA TTAGGCGTTC ATAAAGTGAT GTCTTACAGT GATGCCGGTA GCGCCTTCAT TTTTGGTTCG CTGGTTGGGC CGAAAATGGA TGTCCTCTTT GACGGTGCGG GTTTTATCTT CGCCTTTCGC GTACTCCCGG CGATTATCTT CGTCACTGCG CTCATTAGTC TGCTGTACTA CATTGGCGTG ATGGGGCTGC TGATTCGCAT CCTTGGCAGC ATTTTCCAGA AAGCCCTCAA CATCAGCAAA ATCGAATCTT TTGTTGCGGT TACCACTATT TTCCTCGGGC AAAATGAGAT CCCGGCGATC GTTAAACCGT TTATCGATCG CATGAATCGC AACGAGTTGT TTACCGCAAT TTGTAGCGGG ATGGCGTCCA TTGCGGGTTC GATGATGATT GGTTATGCCG GAATGGGCGT ACCAATTGAC TACCTGTTAG CGGCATCGCT GATGGCGATC CCTGGCGGGA TTTTGTTTGC ACGTATTCTT AGCCCGGCAA CCGAGCCTTC GCAGGTCACA TTTGAAAATT TATCGTTCAG CGAAACGCCG CCAAAAAGCT TTATCGAAGC GGCGGCGAGC GGTGCGATGA CCGGGCTGAA AATCGCCGCT GGTGTGGCGA CGGTGGTAAT GGTATTTGTG GCAATTATTG CGCTGATCAA CGGCATTATC GGCGGAATTG GCGGCTGGTT TGGCTTCGCC AATGCTTCTC TGGAAAGTAT TTTTGGCTAT GTGCTGGCAC CGCTGGCGTG GATCATGGGT GTGGACTGGA GTGATGCCAA CCTTGCGGGT AGCCTGATTG GGCAGAAACT GGCAATTAAC GAATTCGTCG CTTACCTGAG TTTCTCCCCA TACCTGCAAA CGGGCGGCAC GCTGGAAGTG AAAACCATTG CGATTATCTC CTTTGCGCTT TGTGGTTTTG CTAACTTTGG TTCTATCGGT GTTGTCGTTG GGGCATTTTC GGCTATTTCG CCAAAACGCG CACCGGAAAT CGCTCAGCTT GGTTTACGGG CGCTGGCAGC AGCAACGCTT TCCAACCTGA TGAGCGCGAC CATTGCCGGG TTCTTTATTG GTTTAGCTTG A
|
Protein sequence | MDIMRSVVGM VVLLAIAFLL SVNKKSISLR TVGAALLLQI AIGGIMLYFP PGKWAVEQAA LGVHKVMSYS DAGSAFIFGS LVGPKMDVLF DGAGFIFAFR VLPAIIFVTA LISLLYYIGV MGLLIRILGS IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRMNR NELFTAICSG MASIAGSMMI GYAGMGVPID YLLAASLMAI PGGILFARIL SPATEPSQVT FENLSFSETP PKSFIEAAAS GAMTGLKIAA GVATVVMVFV AIIALINGII GGIGGWFGFA NASLESIFGY VLAPLAWIMG VDWSDANLAG SLIGQKLAIN EFVAYLSFSP YLQTGGTLEV KTIAIISFAL CGFANFGSIG VVVGAFSAIS PKRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA
|
| |