Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1604 |
Symbol | ynfM |
ID | 6145345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1592877 |
End bp | 1594130 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616481 |
Product | major facilitator family transporter |
Protein accession | YP_001743659 |
Protein GI | 170680108 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCGTA CTACAACTGT TGATGGCGCT CCGGCAAGCG ACACTGACAA GCAAAGCATT TCTCAGCCAA ATCAATTTAT TAAACGCGGT ACGCCGCAAT TTATGCGCGT CACCCTGGCG CTGTTCTCTG CCGGACTGGC AACCTTCGCA CTTCTCTATT GTGTGCAACC TATCCTTCCG GTGCTTTCGC AGGAGTTTGG CTTAACGCCC GCGAACAGTA GTATTTCACT GTCCATTTCC ACGGCGATGT TGGCTATCGG TTTGCTGTTT ACTGGCCCGT TATCCGATGC CATTGGTCGC AAACCAGTGA TGGTCACGGC GCTACTGTTG GCCTCCATTT GTACGTTACT TTCGACAATG ATGACCAGCT GGCACGGCAT TTTGATTATG CGCGCCTTGA TTGGGCTTTC GTTAAGTGGC GTAGCAGCTG TTGGCATGAC TTATCTTAGC GAGGAAATCC ACCCCAGTTT CGTGGCCTTT TCGATGGGGT TGTATATCAG CGGCAACTCA ATTGGCGGCA TGAGCGGACG CTTAATTAGC GGTGTCTTCA CGGACTTTTT CAACTGGCGA ATTGCGCTGG CGGCAATCGG TTGTGTCGCG CTGGCCTCGG CGTTAATGTT CTGGAAAATC CTCCCTGAAT CACGCCATTT TCGCCCGACT TCGCTGCGAC CTAAGACGTT GTTTATCAAC TTTCGTCTGC ACTGGCGTGA CCGGGGATTA CCGTTATTGT TCGCAGAAGG CTTTTTGCTG ATGGGGTCGT TCGTCACGCT GTTTAATTAC ATCGGCTATC GGTTGATGCT CTCCCCCTGG CATGTCAGTC AGGCTGTGGT TGGCTTATTA TCGCTGGCCT ATTTGACCGG TACATGGAGC TCACCCAAAG CCGGAACCAT GACCACCCGC TATGGGCGCG GTCCGGTGAT GTTGTTTTCG ACGGGGGTTA TGCTGTTTGG TTTACTGATG ACCTTATTCA GCTCGCTGTG GCTGATCTTT GCCGGAATGT TACTCTTCTC AGCCGGATTC TTCGCAGCCC ACTCCGTTGC CAGCAGCTGG ATCGGCCCCC GCGCAAAACG CGCTAAAGGC CAGGCCTCCT CGCTGTATCT GTTCAGTTAC TATCTGGGGT CGAGTATTGC CGGGACGCTG GGTGGTGTTT TCTGGCATAA CTATGGCTGG AACGGCGTCG GCGCATTTAT TGCTCTGATG CTGGTCATTG CTCTGCTGGT CGGGACGCGT TTGCATCATC GTCTGCACGC CTGA
|
Protein sequence | MSRTTTVDGA PASDTDKQSI SQPNQFIKRG TPQFMRVTLA LFSAGLATFA LLYCVQPILP VLSQEFGLTP ANSSISLSIS TAMLAIGLLF TGPLSDAIGR KPVMVTALLL ASICTLLSTM MTSWHGILIM RALIGLSLSG VAAVGMTYLS EEIHPSFVAF SMGLYISGNS IGGMSGRLIS GVFTDFFNWR IALAAIGCVA LASALMFWKI LPESRHFRPT SLRPKTLFIN FRLHWRDRGL PLLFAEGFLL MGSFVTLFNY IGYRLMLSPW HVSQAVVGLL SLAYLTGTWS SPKAGTMTTR YGRGPVMLFS TGVMLFGLLM TLFSSLWLIF AGMLLFSAGF FAAHSVASSW IGPRAKRAKG QASSLYLFSY YLGSSIAGTL GGVFWHNYGW NGVGAFIALM LVIALLVGTR LHHRLHA
|
| |