Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1415 |
Symbol | |
ID | 5733323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1631536 |
End bp | 1632963 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278553 |
Product | EmrB/QacA family drug resistance transporter |
Protein accession | YP_001544187 |
Protein GI | 159897940 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00176967 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGA TCGTGATCGA TCAAAAAATT GCAGTTTGCG TCGTCTTTGT AGCGGCGTTG TTTATGAGCA TTATGGATGG CACAATTATC AACGTGGCAC TAGCAACCAT CCAACAAGAT TTTGGCGCGA GTAGCAGCGC GATCAACGCA ATCGTGGTGA TCTATTTAAT TTGTATTGCG GTGGTGATTC CGGCTTCGGG TTGGTTGGGC GACCGTTGGA ACACCAAATG GGTCTTTTTA ACCTCACTTG GTTTGTTTAC GTTGGCCTCG CTGGCCTGTG GCTTAGCTCA AAATATTGAG CAATTGATGC TAACTCGGGC GGCTCAAGGC ATCGCAGCAG GCGCACTAAT GCCAGTCGGC ACAACCATGC TCTTTCGGAC TTTCCCGCCG CACCAGCGCA TCCAAGTTTC GCGGGTGCTG ATTATTCCAA CGGTGATAGC ACCCGCAGTT GGGCCGGTGC TCGGCGGATT TTTAGTCGAT CATCTGTCGT GGCACTGGGT CTTTTTCGTC AATGGCCCAA TTGGCTTAGC CGCGCTGATC TTCGGGGTCA TTTGGTTGCA AGCGCCTGCC CAAGAAGAAG TTGGCGCATT CGATTGGCTG GGTTTTATTT TAGCGGGCGC TGGCTTTGCC GCCTTGCTCT ACACCTTGAC CGAAGGCGCG AGCAAAGGCT GGGGTTCGCC AATCATTTTG GTTAGTGCCG CGATTGGGGT TGGCGCACTT GCCGCGCTGG TGGTCGTTGA ATTAGCCAAA GCCAAGCCAA TGCTTGATCT GCGGCTCTTC TCCATTCGCT TGTTTCGCGT CAGCAATTTA GTAGCGATTT TTGGCTCAGC CGCCTTTACG GGCATTCTGT TTCTGATGCC CCAATTTTTG CAAAATGTCG TTGGAGCCAG TGCGCTCGAA TCGGGCTTGA CCACCTCGCC CGAAGCGATT GGCGTGGTAC TATCGAGCCA AATCGTAGCG CGACTTTACC CCAAAGTTGG CCCACGCCGC TTGACTTTTG GCGGGGTTTT AGGGGTTGCC GTGATGATGG GCTTGATGAG CACAATCGAT GCTGAAACCA ATTTGTGGTT GGTACGTGCC TTGATGTTTG GCACAGGCGT GGGGATGGCC TACCTCTTTT TGCCAATTGA GGCCGCCGTT TTTGCCCAAA TTCCGCATGC CTCAACTGGC CAAGCTTCAG CAATTTTCAG CATGCAACAA CAACTTGGTT CGGCCTTGGG CGTAGCCATT TTGGGCAGCG TGTTGGCGCT CAATCATAGC GGCACAACAG TTAATCAAAA TGCCCAAAGC TATCAGTATG CCTTCTTGGC AGCAGCAATT TTGGCATTTG TTTCAGCCTG TGTAGCGTTG TTTATTCGCG ATCGCGATGC AGCGGCGACC ATGGCCCAAC ATGGCGAGCA CAGTGAATTG AGTCATTCAA TCGCCTAA
|
Protein sequence | MKRIVIDQKI AVCVVFVAAL FMSIMDGTII NVALATIQQD FGASSSAINA IVVIYLICIA VVIPASGWLG DRWNTKWVFL TSLGLFTLAS LACGLAQNIE QLMLTRAAQG IAAGALMPVG TTMLFRTFPP HQRIQVSRVL IIPTVIAPAV GPVLGGFLVD HLSWHWVFFV NGPIGLAALI FGVIWLQAPA QEEVGAFDWL GFILAGAGFA ALLYTLTEGA SKGWGSPIIL VSAAIGVGAL AALVVVELAK AKPMLDLRLF SIRLFRVSNL VAIFGSAAFT GILFLMPQFL QNVVGASALE SGLTTSPEAI GVVLSSQIVA RLYPKVGPRR LTFGGVLGVA VMMGLMSTID AETNLWLVRA LMFGTGVGMA YLFLPIEAAV FAQIPHASTG QASAIFSMQQ QLGSALGVAI LGSVLALNHS GTTVNQNAQS YQYAFLAAAI LAFVSACVAL FIRDRDAAAT MAQHGEHSEL SHSIA
|
| |