Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4288 |
Symbol | |
ID | 6147274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4391040 |
End bp | 4392110 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619109 |
Product | putative fructose-specific phosphotransferase system protein FrvX |
Protein accession | YP_001746233 |
Protein GI | 170683436 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.427406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTG AGTTGCTGCA ACAGTTGTGC GAAGCCAGCG CCGTCAGCGG CGATGAACAG GAAGTTCGCG ACATTCTGAT AAACACGCTG GAACCTTGCG TTAATGAGAT CACCTTTGAT GGTCTGGGCA GCTTTGTTGC CCGTAAGGGG AATAAAGGGC CAAAAGTTGC TGTTGTCGGG CATATGGATG AAGTCGGCTT TATGGTCACC CACATCGACA AGAGCGGTTT TCTGCGTTTT ACCACCATTG GCGGCTGGTG GAATCAGTCG ATGCTCAACC ACCGGGTAAC CATACGGACA CACAAGGGAG TGAAAATCCC TGGTGTAATT GGCTCCGTCG CCCCACATGC GTTAACCGAA AAGCAAAAGC AACAACCGCT GTCATTTGAT GAGATGTTCA TTGATATTGG CGCGAACAGT CGCGAAGAGG TGGAAAAACG CGGCGTGGAA ATTGGCAATT TTATTAGCCC GGAAGCCAAT TTTGCCTGCT GGGGCGAAGA TAAAGTGGTC GGCAAGGCGT TGGATAACCG CATCGGCTGC GCAATGATGG CCGAGCTACT ACAGACAGTA AATAACCCCG AAATTACGCT TTATGGCGTT GGCAGTGTGG AAGAAGAAGT TGGGCTACGC GGGGCACAAA CCTCGGCTGA ACACATTAAA CCGGATGTGG TGATCGTGCT GGATACCGCC GTCGCGGGCG ATGTTCCGGG CATTGATAAC ATTAAATACC CGCTGAAACT GGGCCAGGGG CCGGGGCTGA TGCTGTTTGA CAAGCGCTAC TTCCCCAACC AGAAACTGGT GGCGGCGTTA AAAAACTGTG CCGCACATAA CGATTTACCG CTGCAATGTT CCACCATGAA AACCGGAGCG ACGGATGGCG GGCGCTACAA CGTGATGGGC GGCGGGCGTC CAGTTGTCGC GCTGTGTCTG CCAACGCGTT ATCTGCACGC CAATAGCGGT ATGATTTCAA AAGCCGATTA CGATGCTCTG CTCACGTTGA TAAGGGATTT TCTGACCACC TTAACCGCGG AGAAAGTCAA CGCGTTTAGC CAGTTCCGTC AGGTGGATTA A
|
Protein sequence | MNIELLQQLC EASAVSGDEQ EVRDILINTL EPCVNEITFD GLGSFVARKG NKGPKVAVVG HMDEVGFMVT HIDKSGFLRF TTIGGWWNQS MLNHRVTIRT HKGVKIPGVI GSVAPHALTE KQKQQPLSFD EMFIDIGANS REEVEKRGVE IGNFISPEAN FACWGEDKVV GKALDNRIGC AMMAELLQTV NNPEITLYGV GSVEEEVGLR GAQTSAEHIK PDVVIVLDTA VAGDVPGIDN IKYPLKLGQG PGLMLFDKRY FPNQKLVAAL KNCAAHNDLP LQCSTMKTGA TDGGRYNVMG GGRPVVALCL PTRYLHANSG MISKADYDAL LTLIRDFLTT LTAEKVNAFS QFRQVD
|
| |