Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4197 |
Symbol | |
ID | 6142696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4297574 |
End bp | 4298797 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641619020 |
Product | cytosine/purines/uracil/thiamine/allantoin family permease |
Protein accession | YP_001746148 |
Protein GI | 170680197 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1457] Purine-cytosine permease and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00118639 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTAAAA AAGAAGAGAA TCTGAATACG GCATCAGGAT TGCGTATTGC CATGATTTTG CTGGGTATTG CCGTCACACC TGTGCTGTTG TCATCCTCAA GCCTCGGCAA TCAACTTTCC AGCGGCAGTT TAATTAGCGT CGTATTGTTA GGCGGCGTCA TTCTGACCTT ACTTTCAGCC ATCACCATTA GCGTGGGAGA AAAAGCCCGC CTACCAACGT ATGGCATTGT GAAATATTCG TTTGGCGAAA AAGGGGCCAT CGCCATTAAC ATTTTGATGG CGATAAGTCT GTTCGGCTGG ATTGCCGTTA CCGCCAATAT GTTTGGTCAT TCGGTACATG ACTTACTGGC TCAACATGGA CTGGAAGTTC CACTGGCACT GTTAGTGGCG GCTGGCTGTG TCATTTTTGT CGCCTCTACG GCATTTGGCT TTGCCGTTCT GGGAAAAATT GCCCAGGTTG CCGTGCCGGT TATCGCGCTG GTGCTGTGTT ACATCCTCTA TGTGGCAACC CATACCGAAG TGGCAGTACC AGCGGCGATT GTGGAGATGA ATACAGGTGT CGCCGTTTCC ACCGTTGTTG GCACCATTAT TGTGCTGGTT GCCACACTGC CTGATTTCGG TAGTTTTGTG CATAACCGCA AACATGCGCT GATTGCCGCA GGCGTGACGT TTCTGGTTGC CTACCCTCTG CTCTACTGGG CGGGTGCAAC GCCGAGCGCC ATTAGTGGTC AGGGATCTTT ACTGGGTGCG ATGGCGGTAT TCGGTGCGGT TCTGCCTGCG GCGCTGTTGT TGATTTTCGC CTGCGTCACC GGTAACGCGG GCAATATGTT CCAGGGCACG CTGGTGGTTT CCACACTGCT TACCCGCTTT CCCAAATGGC AGATTACCGT GGCGCTGGGT ATCCTTTCCG CCATCGTAGG CAGTATGGAT ATTATGGCGT GGTTTATTCC GTTTCTGCTG TTCCTGGGTA TCGCCACGCC ACCCGTTGCC GGAATTTATA TCGCTGACTT TTTCTTTTAT CGCCGTAATG GCTATCAAGA GTCAGTGTTA GCCCAGGAGT CACAGATTAA AGTGCTGACA TTCGCAGCAT GGATCATAGG CGCAGCGGTT GGCTTTATGA CCGTAAAAGG CTTATTCACC CTGACGACGA TCCCTTCGGT AGACTCGATT CTGGTGGCAT GTATCGCTTA TGCGATTCTC AGTCGGGCAA GTCAACACCG CTAA
|
Protein sequence | MRKKEENLNT ASGLRIAMIL LGIAVTPVLL SSSSLGNQLS SGSLISVVLL GGVILTLLSA ITISVGEKAR LPTYGIVKYS FGEKGAIAIN ILMAISLFGW IAVTANMFGH SVHDLLAQHG LEVPLALLVA AGCVIFVAST AFGFAVLGKI AQVAVPVIAL VLCYILYVAT HTEVAVPAAI VEMNTGVAVS TVVGTIIVLV ATLPDFGSFV HNRKHALIAA GVTFLVAYPL LYWAGATPSA ISGQGSLLGA MAVFGAVLPA ALLLIFACVT GNAGNMFQGT LVVSTLLTRF PKWQITVALG ILSAIVGSMD IMAWFIPFLL FLGIATPPVA GIYIADFFFY RRNGYQESVL AQESQIKVLT FAAWIIGAAV GFMTVKGLFT LTTIPSVDSI LVACIAYAIL SRASQHR
|
| |