Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2911 |
Symbol | |
ID | 5592738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2913765 |
End bp | 2915102 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640922028 |
Product | major facilitator family transporter |
Protein accession | YP_001459539 |
Protein GI | 157162221 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACTT CACCGGTGCG AATGGATGAT TTACCGCTTA ACCGTTTTCA CTGTCGAATT GCTGCGCTCA CTTTCGGCGC ACACCTGACC GACGGTTATG TTCTCGGCGT CATTGGTTAC GCCATTATTC AGCTTACGCC CGCCATGCAA CTGACGCCGT TTATGGCGGG AATGATCGGC GGCTCGGCGC TCCTTGGTTT GTTTCTTGGC AGCCTGGTTC TTGGGTGGAT CTCCGACCAT ATTGGTCGGC AAAAAATCTT CACCTTCAGC TTTTTGCTGA TTACGCTCGC TTCGTTCTTG CAATTTTTTG CCACCAAGCC AGAGCATCTT ATTGGGCTGC GCATTTTGAT CGGCATTGGG CTGGGAGGCG ATTACTCAGT TGGTCACACC TTGCTGGCTG AATTTTCCCC GCGCCGCCAT CGCGGTATTT TGCTGGGCGC ATTCAGCGTG GTGTGGACCG TAGGTTATGT GCTGGCAAGT ATTGCCGGAC ATCACTTTAT TTCCGAAAAC CCGGAGGCCT GGCGCTGGCT GTTGGCATCG GCAGCTCTGC CCGCGTTGTT GATTACGTTA TTACGCTGGG GAACGCCAGA ATCGCCACGC TGGCTACTGC GCCAGGGGCG TTTTGCAGAA GCTCACGCTA TCGTGCATCG CTATTTTGGT CCCCATGTTT TACTGGGCGA TGAAGTGGTA ACGGCGACCC ATAAACACAT CAAAACCTTG TTCTCTTCGC GTTACTGGCG GCGCACGGCG TTTAACAGCG TCTTCTTTGT CTGCCTCGTA ATCCCATGGT TTGTGATTTA TACCTGGCTG CCAACTATCG CCCAGACTAT TGGTCTGGAA GATGCGCTGA CTGCCAGCCT GATGCTTAAT GCGTTGTTAA TTGTAGGCGC GCTGCTGGGA TTAGTTCTGA CGCACCTGCT GGCACATCGC AAATTTTTGC TGGGAAGTTT TTTGCTGCTG GCGGCAACGC TGGTAGTGAT GGCCTGTTTG CCTGCCGGCA GTTCATTAAC GCTGCTGCTT TTTGTTCTCT TCAGCACCAC CATTTCGGCA GTCAGTAATC TGGTGGGCAT TTTGCCTGCG GAAAGTTTTC CTACTGACAT TCGCTCGCTG GGCGTCGGTT TTGCCACCGC TATGAGTCGG TTGGGGGCGG CAGTAAGTAC TGGCCTGCTA CCGTGGGTGC TGGCGCAGTG GGGAATGCAA GTCACCTTAT TGCTCCTGGC GACAGTGTTG TTAGTTGGTT TTGTCGTGAC CTGGCTATGG GCACCAGAAA CCAAAGCACT CCCGCTGGTG GCGGCGGGAA ATGTAGGAGG TGCTAATGAA CATTCTGTTA GCGTTTAA
|
Protein sequence | MNTSPVRMDD LPLNRFHCRI AALTFGAHLT DGYVLGVIGY AIIQLTPAMQ LTPFMAGMIG GSALLGLFLG SLVLGWISDH IGRQKIFTFS FLLITLASFL QFFATKPEHL IGLRILIGIG LGGDYSVGHT LLAEFSPRRH RGILLGAFSV VWTVGYVLAS IAGHHFISEN PEAWRWLLAS AALPALLITL LRWGTPESPR WLLRQGRFAE AHAIVHRYFG PHVLLGDEVV TATHKHIKTL FSSRYWRRTA FNSVFFVCLV IPWFVIYTWL PTIAQTIGLE DALTASLMLN ALLIVGALLG LVLTHLLAHR KFLLGSFLLL AATLVVMACL PAGSSLTLLL FVLFSTTISA VSNLVGILPA ESFPTDIRSL GVGFATAMSR LGAAVSTGLL PWVLAQWGMQ VTLLLLATVL LVGFVVTWLW APETKALPLV AAGNVGGANE HSVSV
|
| |