Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2544 |
Symbol | nupC |
ID | 6143290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2603776 |
End bp | 2604978 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617416 |
Product | nucleoside transporter NupC |
Protein accession | YP_001744587 |
Protein GI | 170683520 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000306295 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGCG TCCTTCATTT TGTACTGGCA CTTGCCGTTG TTGCGATTCT CGCACTGCTG GTAAGCAGCG ACCGCAAAAA AATTCGTATC CGTTATGTTA TTCAACTGCT TGTTATCGAA GTGTTACTGG CGTGGTTCTT CCTGAACTCC GACGTTGGTT TAGGCTTCGT GAAAGGCTTC TCCGAAATGT TCGAAAAACT GCTCGGATTT GCCAACGAAG GGACTAACTT CGTCTTTGGT AGCATGAATG ATCAAGGCCT GGCATTCTTC TTTCTGAAAG TGCTGTGCCC AATCGTCTTT ATCTCTGCAT TGATCGGTAT TCTCCAGCAC ATTCGCGTGT TGCCAGTGAT TATCCGCGCA ATTGGTTTCC TGCTTTCCAA AGTCAACGGT ATGGGCAAAC TGGAATCCTT TAACGCCGTC AGCTCCCTGA TTCTGGGTCA GTCTGAAAAC TTTATTGCCT ATAAAGATAT CCTCGGCAAA ATCTCCCGCA ATCGTATGTA CACCATGGCA GCAACGGCGA TGTCCACCGT GTCGATGTCC ATCGTTGGTG CATACATGAC CATGCTGGAG CCGAAATATG TCGTTGCGGC GCTGGTACTG AACATGTTCA GCACCTTTAT CGTGCTGTCG CTGATCAACC CTTACCGTGT TGATGCCAGT GAAGAAAACA TTCAGATGTC CAACCTGCAC GAAGGTCAGA GCTTCTTCGA AATGCTGGGT GAATACATTC TGGCAGGTTT CAAAGTTGCC ATTATCGTTG CCGCAATGCT GATCGGCTTT ATCGCCCTGA TCGCCGCGCT GAACGCACTG TTTGCTACCG TGACTGGCTG GTTTGGCTAC AGCATCTCCT TCCAGGGCAT CCTGGGCTAC ATCTTCTATC CGATTGCATG GGTGATGGGT GTTCCTTCCA GTGAAGCACT GCAAGTGGGC AGTATCATGG CGACCAAACT GGTTTCCAAC GAGTTCGTTG CGATGATGGA TCTGCAGAAA ATTGCTTCCA CGCTCTCTCC GCGTGCTGAA GGCATCATCT CTGTGTTCCT GGTTTCCTTC GCTAACTTCT CATCAATCGG GATTATCGCA GGTGCAGTTA AAGGCCTGAA TGAAGAGCAA GGTAACGTGG TTTCTCGCTT CGGTCTGAAA CTGGTTTACG GCTCTACCCT GGTGAGTGTG CTGTCTGCGT CAATCGCAGC ACTGGTGCTG TAA
|
Protein sequence | MDRVLHFVLA LAVVAILALL VSSDRKKIRI RYVIQLLVIE VLLAWFFLNS DVGLGFVKGF SEMFEKLLGF ANEGTNFVFG SMNDQGLAFF FLKVLCPIVF ISALIGILQH IRVLPVIIRA IGFLLSKVNG MGKLESFNAV SSLILGQSEN FIAYKDILGK ISRNRMYTMA ATAMSTVSMS IVGAYMTMLE PKYVVAALVL NMFSTFIVLS LINPYRVDAS EENIQMSNLH EGQSFFEMLG EYILAGFKVA IIVAAMLIGF IALIAALNAL FATVTGWFGY SISFQGILGY IFYPIAWVMG VPSSEALQVG SIMATKLVSN EFVAMMDLQK IASTLSPRAE GIISVFLVSF ANFSSIGIIA GAVKGLNEEQ GNVVSRFGLK LVYGSTLVSV LSASIAALVL
|
| |