Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3624 |
Symbol | nupC |
ID | 6970185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3345291 |
End bp | 3346493 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643387419 |
Product | nucleoside transporter NupC |
Protein accession | YP_002271878 |
Protein GI | 209400703 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00341076 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.575462 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGCG TCCTTCATTT TGTACTGGCA CTTGCCGTTG TTGCGATCCT CGCACTGCTG GTAAGCAGCG ACCGCAAAAA AATTCGTATC CGTTATGTTA TTCAACTGCT TGTTATCGAA GTGTTACTGG CGTGGTTCTT CCTGAACTCC GACGTTGGTT TAGGCTTCGT GAAAGGCTTC TCCGAAATGT TCGAAAAACT GCTCGGATTT GCCAACGAAG GGACTAACTT CGTCTTTGGT AGCATGAATG ATCAAGGCCT GGCATTCTTC TTCCTGAAAG TGCTGTGCCC AATCGTCTTT ATCTCTGCAC TGATCGGTAT TCTCCAGCAC ATTCGCGTGT TGCCGGTGAT TATCCGCGCA ATTGGTTTCC TGCTCTCCAA AGTCAACGGC ATGGGCAAAC TGGAATCCTT TAACGCCGTC AGCTCCCTGA TTCTGGGTCA GTCTGAAAAC TTTATTGCCT ATAAAGATAT CCTTGGCAAA ATCTCCCGTA ACCGTATGTA CACCATGGCA GCAACGGCGA TGTCCACCGT TTCTATGTCC ATCGTTGGTG CATACATGAC CATGCTGGAA CCAAAATACG TCGTTGCAGC GCTGGTACTG AACATGTTCA GCACCTTTAT CGTGCTGTCG CTGATCAACC CTTACCGTGT TGATGCCTGT GAAGAAAACA TTCAGATGTC CAACCTGCAC GAAGGTCAGA GCTTCTTCGA AATGCTGGGT GAATACATTC TGGCAGGTTT CAAAGTTGCC ATTATCGTTG CCGCAATGCT GATTGGCTTT ATCGCCCTGA TCGCCGCGCT GAACGCACTG TTTGCTACCG TGACTGGCTG GATTGGCTAC AGCATCTCCT TCCAGGGCAT CCTGGGCTAC ATCTTCTATC CGATTGCATG GGTGATGGGT GTTCCTTCCA GTGAAGCACT GCAAGTGGGC AGTATCATGG CGACCAAACT GGTTTCCAAC GAGTTCGTTG CGATGATGGA TCTTCAGAAA ATTGCCTCTA CACTCTCTCC GCGTGCTGAA GGCATCATCT CTGTGTTCCT GGTTTCCTTC GCTAACTTCT CTTCAATCGG GATTATCGCA GGTGCAGTTA AAGGCCTGAA TGAAGAGCAA GGTAACGTGG TTTCTCGCTT CGGTCTGAAG CTGGTTTACG GCTCTACCCT GGTGAGTGTG CTGTCTGCGT CAATCGCAGC ACTGGTGCTG TAA
|
Protein sequence | MDRVLHFVLA LAVVAILALL VSSDRKKIRI RYVIQLLVIE VLLAWFFLNS DVGLGFVKGF SEMFEKLLGF ANEGTNFVFG SMNDQGLAFF FLKVLCPIVF ISALIGILQH IRVLPVIIRA IGFLLSKVNG MGKLESFNAV SSLILGQSEN FIAYKDILGK ISRNRMYTMA ATAMSTVSMS IVGAYMTMLE PKYVVAALVL NMFSTFIVLS LINPYRVDAC EENIQMSNLH EGQSFFEMLG EYILAGFKVA IIVAAMLIGF IALIAALNAL FATVTGWIGY SISFQGILGY IFYPIAWVMG VPSSEALQVG SIMATKLVSN EFVAMMDLQK IASTLSPRAE GIISVFLVSF ANFSSIGIIA GAVKGLNEEQ GNVVSRFGLK LVYGSTLVSV LSASIAALVL
|
| |