Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2314 |
Symbol | fruA |
ID | 6144771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2345377 |
End bp | 2347068 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617188 |
Product | PTS system fructose-specific transporter subunits IIBC |
Protein accession | YP_001744361 |
Protein GI | 170681268 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component [COG1445] Phosphotransferase system fructose-specific component IIB |
TIGRFAM ID | [TIGR00829] PTS system, fructose-specific, IIB component [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00422844 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.00031102 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACGC TGCTGATTAT TGACGCTAAT CTCGGTCAGG CACGCGCCTA TATGGCGAAG ACCCTGCTGG GCGCGGCGGC GCGAAAAGCA AAACTGGAAA TCATCGACAA TCCGAACGAC GCAGAAATGG CGATTGTTCT CGGTGATTCC ATCCCGAATG ACAGCGCGCT GAACGGTAAA AATGTCTGGC TGGGCGATAT TTCCCGGGCA GTTGCGCACC CTGAGCTGTT CCTGAGTGAA GCCAAAGGCC ATGCGAAACC TTACACTGCG CCGGTCGCTG CGACAGCACC GGTTGCCGCC AGCGGTCCGA AACGCGTAGT TGCGGTGACT GCTTGCCCGA CTGGCGTAGC ACACACCTTT ATGGCGGCTG AAGCCATTGA AACCGAAGCG AAAAAACGTG GCTGGTGGGT GAAAGTTGAA ACCCGTGGTT CTGTTGGCGC GGGTAATGCA ATCACCCCTG AAGAAGTCGC AGCAGCGGAT CTGGTGATTG TGGCGGCAGA TATCGAAGTG GATCTGGCGA AATTTGCTGG TAAACCGATG TATCGCACCT CTACCGGTCT GGCGCTGAAG AAAACTGCGC AGGAACTGGA TAAAGCGGTT GCTGAAGCAA CGCCGTATGA ACCAGCGGGC AAAGCTCAAA CAGCGACCAC TGAAGGTAAG AAAGAGAGTG CAGGCGCTTA CCGTCACTTG CTGACGGGCG TCTCTTACAT GCTGCCGATG GTCGTTGCAG GTGGTCTGTG TATCGCGCTT TCTTTTGCTT TTGGTATCGA AGCGTTTAAA GAGCCGGGTA CGTTGGCTGC GGCGCTGATG CAGATTGGTG GTGGTTCAGC CTTTGCGTTG ATGGTGCCGG TACTGGCAGG TTATATTGCC TTTTCCATTG CCGATCGTCC GGGTCTCACG CCGGGTCTGA TTGGCGGTAT GCTGGCGGTC AGCACCGGTT CTGGCTTCAT TGGCGGTATT ATTGCGGGCT TCCTGGCTGG TTACATTGCG AAGTTAATCA GTACGCAATT GAAACTGCCA CAGAGTATGG AGGCGCTGAA ACCGATTCTG ATCATTCCGC TAATTTCCAG TCTGGTGGTC GGTCTGGCGA TGATCTACCT GATCGGTAAA CCGGTTGCTG GCATTCTCGA AGGGTTGACT CACTGGCTGC AGACCATGGG GACTGCGAAT GCGGTTCTGC TGGGGGCGAT CCTCGGTGGC ATGATGTGTA CTGACATGGG CGGTCCGGTA AACAAAGCAG CGTACGCATT CGGTGTGGGT CTGCTGAGTA CTCAAACCTA TGGCCCGATG GCGGCGATTA TGGCGGCAGG TATGGTGCCA CCGCTGGCAA TGGGCCTGGC AACAATGGTG GCGCGTCGCA AATTCGACAA GGCGCAGCAG GAAGGGGGCA AAGCTGCTCT GGTACTGGGA CTGTGCTTTA TTTCGGAAGG TGCAATTCCG TTTGCTGCCC GTGATCCGAT GCGTGTGCTG CCGTGCTGTA TCGTGGGTGG GGCGCTGACT GGCGCAATCT CAATGGCGAT TGGTGCGAAA CTGATGGCAC CGCACGGTGG TCTGTTTGTT CTGCTGATCC CTGGCGCGAT TACGCCGGTA TTGGGTTACC TGGTAGCCAT TATTGCCGGT ACGCTGGTGG CGGGGTTGGC CTATGCCTTC CTGAAACGTC CGGAAGTGGA CGCAGTAGCG AAAGCAGCGT AA
|
Protein sequence | MKTLLIIDAN LGQARAYMAK TLLGAAARKA KLEIIDNPND AEMAIVLGDS IPNDSALNGK NVWLGDISRA VAHPELFLSE AKGHAKPYTA PVAATAPVAA SGPKRVVAVT ACPTGVAHTF MAAEAIETEA KKRGWWVKVE TRGSVGAGNA ITPEEVAAAD LVIVAADIEV DLAKFAGKPM YRTSTGLALK KTAQELDKAV AEATPYEPAG KAQTATTEGK KESAGAYRHL LTGVSYMLPM VVAGGLCIAL SFAFGIEAFK EPGTLAAALM QIGGGSAFAL MVPVLAGYIA FSIADRPGLT PGLIGGMLAV STGSGFIGGI IAGFLAGYIA KLISTQLKLP QSMEALKPIL IIPLISSLVV GLAMIYLIGK PVAGILEGLT HWLQTMGTAN AVLLGAILGG MMCTDMGGPV NKAAYAFGVG LLSTQTYGPM AAIMAAGMVP PLAMGLATMV ARRKFDKAQQ EGGKAALVLG LCFISEGAIP FAARDPMRVL PCCIVGGALT GAISMAIGAK LMAPHGGLFV LLIPGAITPV LGYLVAIIAG TLVAGLAYAF LKRPEVDAVA KAA
|
| |