Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4014 |
Symbol | setC |
ID | 6142948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4093311 |
End bp | 4094495 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641618839 |
Product | sugar efflux transporter C |
Protein accession | YP_001745977 |
Protein GI | 170683995 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00899] sugar efflux transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAA CGGCTACTAC TCCATCAAAA ATACTTGATC TCACTGCCGC GGCATTTTTA CTTGTCGCCT TTCTGACGGG TATTGCGGGC GCTCTCCAGA CTCCCACCCT AAGTATATTC CTCGCAGATG AACTGAAAGC CCGTCCTATA ATGGTAGGTT TTTTCTTCAC CGGTAGCGCT ATTATGGGGA TTCTGGTCAG TCAATTTCTG GCAAGGCACT CCGATAAACA AGGCGACCGT AAATTACTGA TCCTGCTATG TTGCTTATTT GGAGTGTTGG CCTGCACGCT TTTTGCGTGG AATCGCAACT ACTTCATTCT CCTCTCTACG GGCGTACTTC TGAGTAGTTT TGCTTCCACT GCAAACCCGC AAATGTTCGC CCTCGCCCGT GAACACGCCG ACAGAACAGG CCGTGAGACG GTCATGTTCA GTACATTTTT ACGTGCTCAG ATCTCGCTTG CCTGGGTTAT CGGGCCACCG CTCGCTTATG AACTGGCAAT GGGATTTAGT TTTAAAGTGA TGTATCTCAC CGCTGCCATC GCATTTGTTG TTTGCGGACT GATAGTCTGG TTGTTTTTGC CATCAATACA AAGAAATATT CCTGTCGTTA CCCAACCCGT AGAAATTTTA CCCTCCATCC ATAGAAAGCG GGATACACGG CTACTTTTTG TAGTCTGTTC AATGATGTGG GCGGCGAATA ATCTCTACAT GATAAATATG CCGCTATTTA TTATTGATGA ACTGCATCTA ACCGATAAAC TGGCTGGAGA AATGATTGGT ATCGCTGCCG GTCTGGAAAT TCCGATGATG TTAATCGCAG GCTATTACAT GAAACGTATT GGCAAGCGGC TATTAATGCT CATTGCTATC GTGAGTGGTA TGTGTTTTTA CGCCAGCGTA CTCATGGCGA CGACTCCGGC AATTGAGCTG GAATTGCAAA TTCTAAATGC CATTTTCCTT GGTATTCTCT GTGGTATCGG CATGCTTTAT TTTCAGGATC TGATGCCTGA AAAAATAGGC TCTGCGACAA CGTTATATGC AAATACCTCA CGCGTCGGCT GGATTATCGC CGGCTCTGTT GACGGAATTA TGGTTGAAAT CTGGAGCTAC CACGCGTTGT TCTGGCTGGC GATAGGGATG TTGGGTATTG CGATGATTTG CCTGCTGTTT ATTAAAGATA TTTAG
|
Protein sequence | MQKTATTPSK ILDLTAAAFL LVAFLTGIAG ALQTPTLSIF LADELKARPI MVGFFFTGSA IMGILVSQFL ARHSDKQGDR KLLILLCCLF GVLACTLFAW NRNYFILLST GVLLSSFAST ANPQMFALAR EHADRTGRET VMFSTFLRAQ ISLAWVIGPP LAYELAMGFS FKVMYLTAAI AFVVCGLIVW LFLPSIQRNI PVVTQPVEIL PSIHRKRDTR LLFVVCSMMW AANNLYMINM PLFIIDELHL TDKLAGEMIG IAAGLEIPMM LIAGYYMKRI GKRLLMLIAI VSGMCFYASV LMATTPAIEL ELQILNAIFL GILCGIGMLY FQDLMPEKIG SATTLYANTS RVGWIIAGSV DGIMVEIWSY HALFWLAIGM LGIAMICLLF IKDI
|
| |