Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0463 |
Symbol | |
ID | 6144885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 471333 |
End bp | 472697 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615357 |
Product | major facilitator transporter |
Protein accession | YP_001742564 |
Protein GI | 170679618 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATT ATAAAATGAC GCCAGGTGAG CGGCGCGCGA CCTGGGGTTT AGGCACCGTT TTCTCGTTGC GCATGCTGGG CATGTTCATG GTTCTGCCGG TTCTGACCAC GTACGGCATG GCTCTGCAAG GTGCCAGCGA AGCATTAATC GGTATTGCCA TTGGTATTTA TGGTCTGACT CAGGCCGTTT TTCAGATTCC GTTTGGCCTG CTTTCAGACC GCATTGGTCG CAAACCATTA ATTGTCGGTG GGCTGGCGGT GTTTGCCGCC GGTAGCGTTA TCGCTGCGCT CTCTGACTCC ATCTGGGGAA TTATTCTGGG CCGGGCGCTA CAAGGCTCCG GTGCGATTGC CGCTGCCGTT ATGGCGCTGC TTTCCGATCT TACGCGCGAA CAAAACCGCA CCAAAGCGAT GGCGTTTATC GGCGTGAGCT TTGGCATTAC CTTTGCCATT GCGATGGTGC TTGGCCCGAT CATCACTCAC AAACTTGGGC TGCACGCGCT GTTCTGGATG ATCGCTATTC TGGCAACAAC CGGCATTGCG TTGACCATTT GGGTTGTGCC CAACAGTAGC ACTCACGTAC TTAATCGTGA GTCCGGAATG GTGAAAGGCA GTTTCAGTAA AGTGCTGGCG GAACCGCGGT TGCTGAAACT CAACTTTGGC ATTATGTGTC TACACATGCT TCTGATGTCT ACGTTTGTTG CCCTGCCCGG ACAGCTGGCT GATGCGGGGT TCCCGGCGGC TGAACACTGG AAGGTCTATC TGGCGACGAT GCTAATCGCC TTTGGCTCGG TCGTGCCTTT CATTATCTAC GCTGAAGTTA AGCGCAAAAT GAAGCAAGTC TTTGTCTTCT GCGTCGGATT GATCGTGGTT GCGGAAATTG TGTTGTGGAA CGCACAAACG CAGTTCTGGC AACTGGTGGT CGGCGTGCAG CTTTTCTTTG TAGCGTTTAA TTTGATGGAA GCCCTTCTGC CTTCACTTAT CAGTAAAGAG TCGCCTGCAG GTTACAAAGG TACGGCGATG GGTGTTTACT CCACCAGCCA GTTTCTTGGC GTGGCGATTG GCGGTTCGCT GGGCGGCTGG ATTGACGGCA TGTTTGACGG TCAGGGCGTA TTTCTCGCTG GCGCAATGCT GGCCGCAGTG TGGCTGGCAG TCGCCAGTAC CATGAAAGAA CCGCCGTATG TCAGCAGCTT GCGCATTGAA ATCCCGGCGG ACATTGCCGC AAACGAAGCG TTAAAAGTAC GTTTGCTGGA AACTGAAGGC GTCAAAGAAG TGTTGATTGC AGAAGAAGAA CATTCAGCGT ATGTGAAAAT CGACAGCAAA GTGACGAATC GCTTTGAGGT TGAACAGGCA ATTCGCCAGG CATAA
|
Protein sequence | MNDYKMTPGE RRATWGLGTV FSLRMLGMFM VLPVLTTYGM ALQGASEALI GIAIGIYGLT QAVFQIPFGL LSDRIGRKPL IVGGLAVFAA GSVIAALSDS IWGIILGRAL QGSGAIAAAV MALLSDLTRE QNRTKAMAFI GVSFGITFAI AMVLGPIITH KLGLHALFWM IAILATTGIA LTIWVVPNSS THVLNRESGM VKGSFSKVLA EPRLLKLNFG IMCLHMLLMS TFVALPGQLA DAGFPAAEHW KVYLATMLIA FGSVVPFIIY AEVKRKMKQV FVFCVGLIVV AEIVLWNAQT QFWQLVVGVQ LFFVAFNLME ALLPSLISKE SPAGYKGTAM GVYSTSQFLG VAIGGSLGGW IDGMFDGQGV FLAGAMLAAV WLAVASTMKE PPYVSSLRIE IPADIAANEA LKVRLLETEG VKEVLIAEEE HSAYVKIDSK VTNRFEVEQA IRQA
|
| |