Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3672 |
Symbol | |
ID | 5594384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3656786 |
End bp | 3658045 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640922788 |
Product | major facilitator superfamily transporter |
Protein accession | YP_001460268 |
Protein GI | 157162950 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTAAAAA TGAAACACTG TTGTAAAAAT GTGGTGATCC TCATGCCCGA ACCCGTAGCC GAACCCGCGC TAAACGGATT GCGCCTGAAT TTGCGCATTG TCTCTATAGT CATGTTTAAC TTCGCCAGCT ACCTCACCAT CGGGTTGCCG CTCGCTGTAT TACCGGGCTA TGTCCATGAT GTGATGGGCT TTAGCGCCTT CTGGGCAGGA TTGGTTATCA GCCTGCAATA TTTCGCCACC TTGCTGAGCC GCCCTCATGC CGGACGTTAC GCCGATTCGC TGGGACCCAA AAAGATTGTC GTCCTCGGTT TATGCGGCTG CTTTTTGAGC GGTCTGGGGT ATCTGACGGC AGGATTAACC GCCAGTCTGC CTGTCATCAG CCTGTTATTA CTTTGCCTGG GGCGCGTCAT CCTTGGGATT GGGCAAAGTT TTGCCGGAAC GGGATCGACC CTATGGGGCG TTGGCGTGGT TGGCTCGCTG CATATCGGGC GGGTGATTTC GTGGAACGGC ATTGTCACTT ACGGGGCGAT GGCGATGGGT GCGCCGTTAG GCGTCGTGTT TTATCACTGG GGCGGCTTGC AGGCGTTAGC GTTAATCATT ATGGGCGTGG CGCTGGTGGC CATTTTGTTG GCGATCCCGC GTCCGACGGT AAAAGCCAGT AAAGGCAAAC CGCTGCCGTT TCGCGCGGTG CTTGGGCGCG TCTGGCTGTA CGGTATGGCG CTGGCACTGG CTTCCGCCGG ATTTGGCGTC ATCGCCACCT TTATCACGCT GTTTTATGAC GCTAAAGGTT GGGACGGTGC GGCTTTCGCG CTGACGCTGT TTAGCTGTGC GTTTGTCGGT ACGCGTTTGT TATTCCCTAA CGGCATTAAC CGTATCGGTG GCTTAAACGT AGCGATGATT TGCTTTAGCG TTGAGATAAT CGGCCTGCTA CTGGTTGGCG TGGCGACTAT GCCGTGGATG GCGAAAATCG GCGTCTTACT GGCGGGGGCC GGGTTTTCGC TGGTGTTCCC GGCATTGGGT GTAGTGGCGG TAAAAGCGGT TCCGCAGCAA AATCAGGGGG CGGCGCTGGC AACTTACACC GTATTTATGG ATTTATCGCT TGGCGTGACT GGACCACTGG CTGGGCTGGT GATGAGCTGG GCGGGCGTAC CGGTGATTTA TCTGGCGGCG GCGGGACTGG TCGCAATCGC GTTATTACTG ACGTGGCGAT TAAAAAAACG GCCTCCGGAA CACGTCCCTG AGGCCGCCTC ATCATCTTAA
|
Protein sequence | MVKMKHCCKN VVILMPEPVA EPALNGLRLN LRIVSIVMFN FASYLTIGLP LAVLPGYVHD VMGFSAFWAG LVISLQYFAT LLSRPHAGRY ADSLGPKKIV VLGLCGCFLS GLGYLTAGLT ASLPVISLLL LCLGRVILGI GQSFAGTGST LWGVGVVGSL HIGRVISWNG IVTYGAMAMG APLGVVFYHW GGLQALALII MGVALVAILL AIPRPTVKAS KGKPLPFRAV LGRVWLYGMA LALASAGFGV IATFITLFYD AKGWDGAAFA LTLFSCAFVG TRLLFPNGIN RIGGLNVAMI CFSVEIIGLL LVGVATMPWM AKIGVLLAGA GFSLVFPALG VVAVKAVPQQ NQGAALATYT VFMDLSLGVT GPLAGLVMSW AGVPVIYLAA AGLVAIALLL TWRLKKRPPE HVPEAASSS
|
| |