Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1537 |
Symbol | |
ID | 6146334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1521737 |
End bp | 1522948 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616414 |
Product | inner membrane transport protein YdhC |
Protein accession | YP_001743592 |
Protein GI | 170680618 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00710] drug resistance transporter, Bcr/CflA subfamily [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0900835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0201427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCTG GGAAAAGATT TTTAGTCTGG CTGGCAGGTT TGAGCGTACT CGGTTTTCTG GCAACCGATA TGTATCTGCC TGCTTTCGCC GCCATACAGG CCGACCTGCA AACGCCTGCG TCTGCTGTCA GTGCCAGCCT TAGTCTGTTC CTTGCCGGAT TTGCCGCAGC CCAGCTTCTG TGGGGGCCGC TCTCCGACCG TTATGGTCGT AAACCGGTAT TATTCATCGG CCTGACAATT TTTGCGTTAG GTAGTCTGGG GATGCTGTGG GTAGAAAACG CCGCTACGCT GCTGGTATTG CGTTTTGTAC AGGCTGTGGG TGTCTGCGCC GCGGCGGTTA TCTGGCAAGC GTTAGTGACG GATTATTATC CTTCACAGAA AGTTAACCGT ATTTTTGCGA CCATCATGCC GCTGGTGGGT CTATCTCCGG CACTGGCTCC TCTGTTAGGA AGCTGGCTGC TGGTCCATTT TTCCTGGCAG GCGATTTTCG CCACCCTGTT TGCCATTACC GTGGTGCTGA TTCTGCCTAT TTTCTGGCTC AAACCCACGA CGAAGGCCCG TAACAATAGT CAGGATGGTC TGACCTTTAC CGACCTGCTA CGTTCTAAAA CCTATCGCGG CAACGTGCTG ATATATGCGG CCTGTTCAGC CAGTTTTTTT GCATGGCTGA CCGGCTCACC GTTCATCCTT AGTGAAATGG GTTACAGCCC GGCAGTTATT GGTTTAAGTT ATGTCCCGCA AACTATCGCG TTTCTGATTG GTGGTTATGG CTGTCGCGCC GCACTGCAGA AATGGCAAGG CAAGCAGTTA TTACCGTGGT TGCTGGTACT GTTTGCTGTC AGCGTCATTG CGACCTGGGC TGCGGGCTTC ATTAGCCATG TGTCGCTAGT CGAAATCCTG ATCCCATTCT GTGTGATGGC GATTGCCAAC GGTGCGATCT ACCCGATTGT TGTCGCTCAG GCGCTGCGTC CCTTCCCACA TGCAACTGGT CGCGCCGCAG CGTTGCAGAA CACTCTGCAA CTGGGTCTGT GCTTCCTCGC AAGTCTGGTA GTTTCCTGGC TTATCAGTAT CAGCACGCCA TTGCTCACCA CCACCAGCGT GATGTTATCA ACAGTAGTGC TGGTCGCTCT GGGTTACATG ATGCAACGTT GTGAAGAAGT TGGCTGCCAG AATCATGGCA ATGCCGAAGT CGCTCATAGC GAATCACACT GA
|
Protein sequence | MQPGKRFLVW LAGLSVLGFL ATDMYLPAFA AIQADLQTPA SAVSASLSLF LAGFAAAQLL WGPLSDRYGR KPVLFIGLTI FALGSLGMLW VENAATLLVL RFVQAVGVCA AAVIWQALVT DYYPSQKVNR IFATIMPLVG LSPALAPLLG SWLLVHFSWQ AIFATLFAIT VVLILPIFWL KPTTKARNNS QDGLTFTDLL RSKTYRGNVL IYAACSASFF AWLTGSPFIL SEMGYSPAVI GLSYVPQTIA FLIGGYGCRA ALQKWQGKQL LPWLLVLFAV SVIATWAAGF ISHVSLVEIL IPFCVMAIAN GAIYPIVVAQ ALRPFPHATG RAAALQNTLQ LGLCFLASLV VSWLISISTP LLTTTSVMLS TVVLVALGYM MQRCEEVGCQ NHGNAEVAHS ESH
|
| |