Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1670 |
Symbol | ynfM |
ID | 5595214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1691834 |
End bp | 1693087 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640920818 |
Product | major facilitator family transporter |
Protein accession | YP_001458374 |
Protein GI | 157161056 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCGTA CTACAACTGT TGATGGCGCT CCGGCAAGCG ACACTGACAA GCAAAGCATT TCTCAGCCAA ATCAATTTAT TAAACGCGGT ACGCCGCAAT TTATGCGCGT CACCCTGGCG CTGTTCTCTG CCGGACTGGC AACATTTGCA CTTCTCTATT GTGTGCAGCC TATCCTTCCG GTGCTTTCGC AGGAGTTTGG CTTAACCCCC GCGAACAGTA GTATTTCACT GTCCATTTCC ACGGCGATGT TGGCTATTGG TTTGCTGTTT ACTGGCCCGC TATCCGATGC CATTGGTCGC AAACCAGTGA TGGTCACGGC GCTACTGTTG GCCTCCATTT GTACGTTACT TTCGACAATG ATGACCAGCT GGCACGGCAT TTTGATTATG CGCGCCTTGA TTGGGCTTTC GTTAAGTGGC GTGGCAGCTG TTGGCATGAC TTATCTTAGC GAGGAAATCC ATCCCAGTTT CGTGGCCTTT TCAATGGGGT TGTATATCAG CGGCAACTCA ATTGGCGGCA TGAGCGGACG CTTAATTAGC GGTGTCTTCA CGGACTTTTT CAACTGGCGA ATTGCTCTGG CGGCAATCGG TTGTTTCGCG CTGGCCTCGG CGTTGATGTT CTGGAAAATC CTCCCTGAAT CACGCCATTT TCGCCCGACT TCGCTGCGCC CTAAGACGTT GTTTATCAAC TTTCGTCTGC ACTGGCGTGA CCGGGGATTA CCGTTATTGT TCGCAGAAGG CTTTTTGCTG ATGGGGTCGT TCGTCACGCT GTTTAATTAC ATCGGCTATC GGTTGATGCT CTCCCCCTGG CATGTCAGTC AGGCCGTGGT TGGCTTATTA TCGCTGGCTT ATTTGACCGG TACATGGAGC TCACCCAAAG CCGGAACCAT GACCACCCGC TATGGGCGTG GTCCAGTGAT GTTGTTTTCG ACGGGGGTTA TGCTGTTTGG TTTACTGATG ACCTTATTCA GCTCGCTGTG GCTGATCTTT GCCGGAATGT TACTCTTCTC AGCAGGATTC TTCGCAGCCC ACTCAGTAGC CAGCAGCTGG ATCGGCCCCC GCGCAAAACG CGCTAAAGGC CAGGCCTCCT CGCTGTATCT GTTCAGTTAC TATCTGGGGT CGAGTATTGC CGGGACGCTG GGTGGTGTTT TCTGGCATAA CTATGGCTGG AACGGCGTCG GCGCATTTAT TGCTCTGATG CTGGTCATTG CTCTGCTGGT CGGGACGCGT TTGCATCGTC GTCTGCACGC CTGA
|
Protein sequence | MSRTTTVDGA PASDTDKQSI SQPNQFIKRG TPQFMRVTLA LFSAGLATFA LLYCVQPILP VLSQEFGLTP ANSSISLSIS TAMLAIGLLF TGPLSDAIGR KPVMVTALLL ASICTLLSTM MTSWHGILIM RALIGLSLSG VAAVGMTYLS EEIHPSFVAF SMGLYISGNS IGGMSGRLIS GVFTDFFNWR IALAAIGCFA LASALMFWKI LPESRHFRPT SLRPKTLFIN FRLHWRDRGL PLLFAEGFLL MGSFVTLFNY IGYRLMLSPW HVSQAVVGLL SLAYLTGTWS SPKAGTMTTR YGRGPVMLFS TGVMLFGLLM TLFSSLWLIF AGMLLFSAGF FAAHSVASSW IGPRAKRAKG QASSLYLFSY YLGSSIAGTL GGVFWHNYGW NGVGAFIALM LVIALLVGTR LHRRLHA
|
| |