Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4107 |
Symbol | |
ID | 5592952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4098562 |
End bp | 4099995 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923211 |
Product | major facilitator transporter |
Protein accession | YP_001460670 |
Protein GI | 157163352 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 0.338047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAA TCACCAGTCC TGCAACTTAT TCAATTAGTC GGCCACAGGA CGTAATTGAT ATTGTTAATA AGAACTCTGC AATCAACACC AGTATCGGCG TTATTTTTAT TGCTTTGGGC GGCATATTGA TTGATGCCTA CCAGGCGGCG ATGGTGGGGT TTGGTAATAA ATACATTGCC GCTCAATTTG GCATTTCTCC AGGCCTTGCC GCAACCGTTA ATGCGTCAGT ATTAATCGCC GCGTTAATTG GCGGTTTATT AGCGAACCGA GTAATAAACC GCTTTGGGCA AAAGCGAGCA TTTATTATTG GCATGGGGCT GTGCACCATC GGGGCTGCTG CGGTAGCTAT TGCGCCCAGT ATCTGGTGGG TGCTGGTGTG CCGCGTCATC ATGGGCTTTG GTTTAGGCAT CGACTTCCCT TTGGCAACCA ATGCCGTGGC AGAGCTTCGT GGTTCAACGT CGAAGAAAAC CGGAACGTCG GTCAACCTCT GGCAAATGGC CTGGTATGTT TCGACAACTG TTGTTTATTT GGTGCTCTTG CCGCTGCTTC TGTCGGGTAT CGCTGAAGAA CAATTGTGGC GTTACGGAAT ATTCATCGGA GCTATTTTTG CAGTCATCTT CATGATTTTG CGTTACTTCT TTATTGGTGA ATCCGCAATG TGGGCCGCAC GCGTCGGGCG TTACCAGGAA GCGTGCGACA TTCTGGGAAA ACGTTATGGT GTTCAGGCTC GCGTTGCGGC ATCGAGTACA ACAGAAGCGA AATTCTCGGA AAAAGCAGAG AATAAATACA GTGGTGGATA TGGCATCTTA TTTAATGATC GTTACCGCAA ACGCACCATT CTTGGCTGTG TCGTGGCAAC CATGCAGGCG TGGCAATATA ACGCCGTAGG TGTTTATCTT CCTCTTACGT TGGCGGGAAT AATAAGTGGC GGGCTTACTG GTGCGTTGAC GGGTTCTGCC GTCGTGAATG CCCTTTGTGG GGTGACAGGC GGGATGATCG GCTCGTTTAT TCTCCAACGA CTGGGTACTC GCCGACAGTC GATGTATGGA TTTGCTGTTG TGACCTTAGC ATTGCTGTCG TTAGGCGCAC TGGCAACGAC TAATCCATGG CTGTCTTTAG GGTTATTGGG ATCAATTATT TTCTTCCATT CAGCGGGTCC TGGTGGGCTG GGCATGACCA TTGCCACACT CTCTTATCCT CCCGCTATTC GCCCTACTGG GGTCGGATTT GCCCGCGCTA TTATGCGCAC AGGGGCAATT GCAGGACTCA TTTTCTGGCC GATGCTGTGG GGGGCGTTGA AAACTGAAGC GTTTTACTGG TTGGCAATCG TGCCATTCTT GGGGTTCCTG ACCTGCGTAT TGATTAATTG GGAACCACTG GGTGCAAATG TTGATGCTGA GGATGCAGAG GTTCTGGCTG AATTGAAGAA ATAA
|
Protein sequence | MSQITSPATY SISRPQDVID IVNKNSAINT SIGVIFIALG GILIDAYQAA MVGFGNKYIA AQFGISPGLA ATVNASVLIA ALIGGLLANR VINRFGQKRA FIIGMGLCTI GAAAVAIAPS IWWVLVCRVI MGFGLGIDFP LATNAVAELR GSTSKKTGTS VNLWQMAWYV STTVVYLVLL PLLLSGIAEE QLWRYGIFIG AIFAVIFMIL RYFFIGESAM WAARVGRYQE ACDILGKRYG VQARVAASST TEAKFSEKAE NKYSGGYGIL FNDRYRKRTI LGCVVATMQA WQYNAVGVYL PLTLAGIISG GLTGALTGSA VVNALCGVTG GMIGSFILQR LGTRRQSMYG FAVVTLALLS LGALATTNPW LSLGLLGSII FFHSAGPGGL GMTIATLSYP PAIRPTGVGF ARAIMRTGAI AGLIFWPMLW GALKTEAFYW LAIVPFLGFL TCVLINWEPL GANVDAEDAE VLAELKK
|
| |