Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0047 |
Symbol | |
ID | 6145129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 50989 |
End bp | 52320 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641614948 |
Product | major facilitator family transporter |
Protein accession | YP_001742164 |
Protein GI | 170681587 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.454742 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCGT CCAGAAACTT TGACGATCTT AAATTCTCCT CTATTCACCG CCGCATTTTG CTGTGGGGAA GCGGTGGTCC GTTTCTGGAT GGTTATGTAC TGGTAATGAT TGGCGTGGCG CTGGAGCAAC TGACTCCGGC GCTGAAACTG GACGCTGACT GGATTGGCTT GCTGGGCGCG GGAACGCTCG CCGGGCTGTT CGTTGGCACA TCGCTGTTTG GCTATATCTC CGATAAAGTC GGACGGCGCA AAATGTTCCT CATTGATATC ATCGCCATCG GCGTGATATC TGTGGCGACG ATGTTTGTCT CATCCCCCGT CGAACTGTTG GTGATGCGGG TACTTATCGG CATTGTCATC GGTGCGGATT ATCCCATCGC CACCTCAATG ATCACCGAGT TCTCCAGTAC CCGTCAGCGG GCGTTTTCCA TCAGCTTTAT TGCCGCGATG TGGTATGTCG GCGCGACCTG CGCCGATCTG GTCGGTTACT GGCTTTATGA TGTGGAAGGT GGTTGGCGCT GGATGCTGGG TAGCGCAGCG ATCCCCTGTT TGTTGATTTT GATTGGTCGA TTCGAACTGC CTGAATCTCC CCGCTGGTTG TTACGCAAAG GGCGAGTAAA AGAGTGCGAA GAGATGATGA TAAAACTGTT TGGCGAACCG GTGGCTTTCG ATGAAGAGCA GCCGCAGCAA ACCCGTTTTC GCGATCTGTT TAATCGCCGC CATTTTCCTT TTGTTCTGTT TGTTGCCGCC ATCTGGACCT GCCAGGTGAT CCCCATGTTC GCCATTTACA CCTTTGGCCC GCAAATCGTT GGTTTGTTGG GATTGGGAGT TGGCAAAAAC GCGGCGTTGG GGAATGTGGT GATTAGCCTG TTCTTTATGC TTGGCTGTAT TCCGCCGATG CTGTGGTTAA ACACCGCCGG ACGGCGTCCA TTGTTGATTG GCAGTTTTGC CATGATGACG CTGGCGCTGG CGGTTTTGGG GCTGATCCCG GATATGGGGA TCTGGCTGGT AGTGATGGCA TTTGCGGTGT ATGCCTTTTT CTCTGGCGGG CCGGGTAATT TGCAGTGGCT CTATCCTAAT GAACTCTTCC CGACGGATAT CCGCGCCTCT GCCGTGGGCG TGATTATGTC CTTAAGCCGT ATTGGCACCA TTGTTTCGAC CTGGGCACTG CCAATCTTTA TCAATAATTA CGGCATCAGT AGCACCATGC TGATGGGGGC GGGTATCTCG CTATTTGGCT TGTTGATTTC CGTAGCGTTT GCTCCGGAGA CTCGAGGGAT GTCACTGGCG CAGACCAGCA ATATGACGAT CCGCGGGCAG AGAATGGGGT AA
|
Protein sequence | MQPSRNFDDL KFSSIHRRIL LWGSGGPFLD GYVLVMIGVA LEQLTPALKL DADWIGLLGA GTLAGLFVGT SLFGYISDKV GRRKMFLIDI IAIGVISVAT MFVSSPVELL VMRVLIGIVI GADYPIATSM ITEFSSTRQR AFSISFIAAM WYVGATCADL VGYWLYDVEG GWRWMLGSAA IPCLLILIGR FELPESPRWL LRKGRVKECE EMMIKLFGEP VAFDEEQPQQ TRFRDLFNRR HFPFVLFVAA IWTCQVIPMF AIYTFGPQIV GLLGLGVGKN AALGNVVISL FFMLGCIPPM LWLNTAGRRP LLIGSFAMMT LALAVLGLIP DMGIWLVVMA FAVYAFFSGG PGNLQWLYPN ELFPTDIRAS AVGVIMSLSR IGTIVSTWAL PIFINNYGIS STMLMGAGIS LFGLLISVAF APETRGMSLA QTSNMTIRGQ RMG
|
| |