Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0849 |
Symbol | |
ID | 5593335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 855578 |
End bp | 856573 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640920021 |
Product | hypothetical protein |
Protein accession | YP_001457588 |
Protein GI | 157160270 |
COG category | [V] Defense mechanisms |
COG ID | [COG1566] Multidrug resistance efflux pump |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 63 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC CTGTCGTGAT CGGATTGGCG GTAGTGGTAC TTGCCGCCGT GGTTGCCGGA GGCTACTGGT GGTATCAAAG CCGCCAGGAT AACGGCCTGA CGCTGTATGG CAACGTGGAT ATTCGTACGG TAAATCTTAG TTTCCGTGTT GGGGGGCGCG TTGAATCGCT GGCGGTGGAC GAAGGTGATG CTATCAAAGC GGGCCAGGTG CTGGGCGAAC TGGATCACAA GCCGTATGAG ATTGCCCTGA TGCAGGCGAA AGCGGGTGTT TCGGTGGCAC AGGCGCAGTA TGACCTGATG CTTGCCGGGT ATCGCGATGA AGAAATCGCT CAGGCCGCCG CAGCGGTGAA ACAGGCGCAA GCCGCCTATG ATTATGCGCA GAACTTCTAT AACCGCCAGC AAGGGTTGTG GAAAAGCCGC ACTATTTCGG CAAATGACCT GGAAAATGCC CGCTCCTCGC GCGACCAGGC GCAGGCAACG CTGAAATCAG CACAGGATAA ATTGCGTCAG TACCGTTCCG GTAACCGTGA ACAGGACATC GCTCAGGCGA AAGCCAGCCT CGAACAGGCG CAGGCGCAAC TGGCGCAGGC GGAGTTGAAT TTACAGGACT CAACGTTGAT AGCCCCGTCT GATGGCACGC TGTTAACGCG CGCGGTGGAG CCAGGCACGG TCCTCAATGA AGGTGGCACG GTGTTTACCG TTTCACTAAC GCGTCCGGTG TGGGTGCGCG CTTATGTTGA TGAACGTAAT CTTGACCAGG CCCAGCCGGG GCGCAAAGTG CTGCTTTATA CCGATGGTCG CCCGGACAAG CCGTATCACG GGCAGATTGG TTTCGTTTCG CCGACTGCTG AATTTACCCC GAAAACCGTC GAAACGCCGG ATCTGCGTAC CGACCTCGTC TATCGCCTGC GTATTGTGGT GACCGACGCC GATGATGCGT TACGCCAGGG AATGCCAGTG ACGGTACAAT TCGGTGACGA GGCAGGACAT GAATGA
|
Protein sequence | MKKPVVIGLA VVVLAAVVAG GYWWYQSRQD NGLTLYGNVD IRTVNLSFRV GGRVESLAVD EGDAIKAGQV LGELDHKPYE IALMQAKAGV SVAQAQYDLM LAGYRDEEIA QAAAAVKQAQ AAYDYAQNFY NRQQGLWKSR TISANDLENA RSSRDQAQAT LKSAQDKLRQ YRSGNREQDI AQAKASLEQA QAQLAQAELN LQDSTLIAPS DGTLLTRAVE PGTVLNEGGT VFTVSLTRPV WVRAYVDERN LDQAQPGRKV LLYTDGRPDK PYHGQIGFVS PTAEFTPKTV ETPDLRTDLV YRLRIVVTDA DDALRQGMPV TVQFGDEAGH E
|
| |