Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1804 |
Symbol | |
ID | 6065836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2004265 |
End bp | 2005638 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641601219 |
Product | major facilitator transporter |
Protein accession | YP_001724781 |
Protein GI | 170019827 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.673075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0488273 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAG TTCAGGCCGA CGGCCTGCCA TTGCCCCAGC GATACGGTGC GATATTAACC ATTGTGATTG GTATTTCGAT GGCTGTCCTT GACGGCGCAA TCGCCAACGT CGCCCTGCCA ACAATCGCCA CGGACCTTCA TGCCACGCCA GCCAGTTCCA TCTGGGTAGT GAACGCCTAT CAAATCGCCA TTGTCATCTC CCTGCTCTCG TTTTCGTTTC TGGGCGATAT GTTTGGCTAT CGACGTATTT ATAAATGCGG TCTGGTCGTT TTTCTGTTGT CTTCACTGTT CTGCGCCCTT TCTGATTCGC TGCAAATGCT CACCCTTGCG CGTGTCATAC AAGGTTTCGG CGGTGCAGCG TTGATGAGCG TTAATACCGC ACTTATCCGC CTGATCTATC CACAACGTTT TCTGGGGAGA GGGATGGGCA TAAACTCGTT TATTGTTGCC GTCTCTTCTG CTGCCGGGCC GACAATTGCT GCAGCAATCC TCTCCATCGC ATCCTGGAAA TGGTTATTTT TAATCAACGT ACCGTTAGGT ATTATCGCCC TGCTTCTGGC GATGCGTTTT CTGCCACCCA ATGGTTCTCG CGCCAGTAAA CCCCGTTTCG ACCTGCCCAG CGCCGTGATG AACGCGTTAA CCTTCGGCCT GCTTATCACT GCGTTGAGTG GTTTCGCTCA GGGGCAATCG CTAACGTTAA TTGCTGCGGA ACTGGTGGTA ATGGTTGTTG TTGGTATTTT CTTTATTCGC CGCCAGCTTT CTCTTCCCGT ACCGCTGCTA CCGGTGGATT TACTGCGTAT CCCGCTGTTT TCACTTTCTA TTTGCACATC TGTTTGCTCT TTCTGCGCAC AAATGCTGGC AATGGTTTCC CTGCCCTTTT ACCTGCAAAC CGTGCTCGGG CGTAGTGAAG TCGAAACAGG TTTACTTCTG ACACCGTGGC CGTTAGCAAC GATGGTGATG GCTCCGCTGG CAGGCTATTT GATTGAACGC GTACATGCAG GATTGCTGGG TGCTTTAGGG TTGTTCATCA TGGCTGCGGG GCTTTTTTCC CTGGTTCTGC TGCCCGCGTC ACCTGCGGAT ATCAATATTA TCTGGCCGAT GATCTTATGT GGCGCTGGAT TTGGCTTATT CCAGTCACCC AATAACCACA CCATTATTAC CTCCGCTCCG CGCGAACGTA GCGGTGGGGC CAGTGGCATG TTAGGAACGG CTCGTCTACT GGGTCAGAGT AGTGGCGCGG CGCTGGTGGC GCTGATGCTA AATCAGTTCG GTGATAATGG TACGCACGTT TCGCTGATGG CTGCGGCTAT TCTGGCAGTG ATTGCTGCCT GTGTGAGTGG TTTACGTATC ACTCAGCCAC GATCCAGGGC ATAA
|
Protein sequence | MPKVQADGLP LPQRYGAILT IVIGISMAVL DGAIANVALP TIATDLHATP ASSIWVVNAY QIAIVISLLS FSFLGDMFGY RRIYKCGLVV FLLSSLFCAL SDSLQMLTLA RVIQGFGGAA LMSVNTALIR LIYPQRFLGR GMGINSFIVA VSSAAGPTIA AAILSIASWK WLFLINVPLG IIALLLAMRF LPPNGSRASK PRFDLPSAVM NALTFGLLIT ALSGFAQGQS LTLIAAELVV MVVVGIFFIR RQLSLPVPLL PVDLLRIPLF SLSICTSVCS FCAQMLAMVS LPFYLQTVLG RSEVETGLLL TPWPLATMVM APLAGYLIER VHAGLLGALG LFIMAAGLFS LVLLPASPAD INIIWPMILC GAGFGLFQSP NNHTIITSAP RERSGGASGM LGTARLLGQS SGAALVALML NQFGDNGTHV SLMAAAILAV IAACVSGLRI TQPRSRA
|
| |