Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3973 |
Symbol | |
ID | 5590960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3967596 |
End bp | 3969023 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640923078 |
Product | DHA2 family drug:H+ antiporter-1 |
Protein accession | YP_001460555 |
Protein GI | 157163237 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0000000028828 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATA AAAAGAAGCG CAGTATGGCG GGTTTGCCGT GGATCGCGGC GATGGCCTTC TTCATGCAGG CACTTGATGC CACTATTCTG AATACCGCCT TACCCGCAAT CGCTCATAGC CTTAATCGTT CTCCTCTCGC GATGCAATCA GCCATCATCA GTTATACGCT GACGGTGGCG ATGCTTATTC CGGTAAGCGG ATGGCTAGCC GATCGCTTCG GTACGCGTCG CATTTTTACC CTTGCCGTGA GTCTGTTCAC ATTGGGTTCT CTGGCCTGCG CACTTTCTAA TTCGCTACCA CAGCTGGTTG TCTTCCGGGT TATTCAGGGG ATAGGCGGCG CAATGATGAT GCCTGTTGCT CGGCTGGCCT TACTGCGCGC TTATCCTCGT AATGAACTTC TTCCAGTATT GAATTTTGTC GCCATGCCGG GTCTGGTGGG GCCAATTTTA GGCCCCGTTC TTGGCGGCGT GCTGGTCACC TGGGCAACCT GGCACTGGAT ATTTTTAATC AATATCCCCA TAGGTATTGC AGGCCTTCTT TACGCGCGCA AACATATGCC CAATTTCACC ACCGCACGAC GCAGATTCGA TATCACTGGC TTTTTGTTGT TTGGCCTCAG CCTTGTTCTC TTCTCAAGCG GAATAGAGCT ATTCGGGGAA AAGATTGTCG CCAGCTGGAT TGCCTTGACG GTAATTGTCA CCAGCATCGG GTTACTGCTT CTCTATATTC TCCATGCACG ACGCACGCCA AACCCATTAA TTTCATTAGA TTTATTTAAA ACCCGCACTT TCTCGATCGG TATCGTAGGC AATATTGCAA CCCGTCTGGG GACCGGCTGT GTACCGTTCC TTATGCCATT GATGTTACAG GTAGGATTTG GTTATCAGGC GTTTATTGCC GGCTGTATGA TGGCGCCGAC AGCGTTAGGT TCCATTATTG CAAAATCGAT GGTTACCCAA GTCTTACGTC GTCTGGGCTA TCGCCATACA TTAGTGGGGA TCACGGTGAT TATTGGGCTA ATGATCGCTC AGTTCTCTTT GCAATCACCG GCAATGGCGA TATGGATGCT GATCTTGCCG TTGTTTATAT TAGGGATGGC TATGTCGACG CAGTTTACCG CGATGAATAC CATCACACTT GCCGATCTGA CCGATGACAA CGCCAGCAGC GGTAACAGTG TTCTGGCGGT CACGCAGCAA CTGTCTATCA GTTTAGGCGT TGCTATAAGT GCGGCCGTCC TTCGCGTTTA TGAAGGAATG GAAGGCACAA CGACTGTCGA ACAATTCCAC TATACGTTTA TCACAATGGG CATTATTACT GTTGCTTCAG CAGCAATGTT CATGCTTCTG AAAACAACCG ATGGTAATAA TTTGATCAAA AGACAGCGTA AATCTAAGCC GAACCGCGTT CCATCAGAAT CGGAGTAA
|
Protein sequence | MSDKKKRSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA MLIPVSGWLA DRFGTRRIFT LAVSLFTLGS LACALSNSLP QLVVFRVIQG IGGAMMMPVA RLALLRAYPR NELLPVLNFV AMPGLVGPIL GPVLGGVLVT WATWHWIFLI NIPIGIAGLL YARKHMPNFT TARRRFDITG FLLFGLSLVL FSSGIELFGE KIVASWIALT VIVTSIGLLL LYILHARRTP NPLISLDLFK TRTFSIGIVG NIATRLGTGC VPFLMPLMLQ VGFGYQAFIA GCMMAPTALG SIIAKSMVTQ VLRRLGYRHT LVGITVIIGL MIAQFSLQSP AMAIWMLILP LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAIS AAVLRVYEGM EGTTTVEQFH YTFITMGIIT VASAAMFMLL KTTDGNNLIK RQRKSKPNRV PSESE
|
| |