Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3332 |
Symbol | |
ID | 5735202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4200736 |
End bp | 4201971 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280479 |
Product | major facilitator transporter |
Protein accession | YP_001546096 |
Protein GI | 159899849 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAAC AATTCAAAGA GTATGCGATT GTTACCGCCG CTTATTGGGC CTTTACATTG ACCGATGGCG CGTTGCGTAT GTTGGTCTTG TTTCATTTTC ATCAACTTGG CTATAGTGCC TTGGCGGTGG CGGCGCTCTT TATTTTCTAT GAAGTATTTG GGGTGATTAC CAATTTGTTT GGTGGCTGGC TGGGCGCACG CTTTGGCCTG AATCGCACCT TACATGCTGG CTTGGTGTTA CAAGTTATCG CTTTGGGCAT GATGGTTGTG CCCAGTGCTT GGCTCAGCGT GACATATGTG ATGCTAGCGC AAGCTGGCTC GGGCATCGCC AAAGATCTGA CCAAAATGAG TGCCAAGAGC AGCATCAAAA TGTTGGTGGC TGAAGATGCT CAATCAACTT TGTTCAAATG GGTAGCTATT TTGACTGGCT CGAAAAATGC GCTCAAAGGT GTGGGCTTTT TCTTGGGCAG CCTATTGCTG AGCACCATTG GCTTTCGCAG TGCCTTCGCT GTGCTTGCGA GTTTGATTGC CGTAGCTGCC ATTGGCACAT TCAGCTTTTT GCAACACGAT TTGGGGCGTA GCAAAGCCAA GCCGAAGTGG CAACAACTAT TTGCCAAAAG CCGCGCGATC AATCTGCTTT CAGCAGCGCG TTGTTTTTTG TTTGCAGCGC GGGATGTCTG GTTTGTGGTC GGCTTGCCAG TTTTTCTGAG TGAAGTGCTT GGCTGGTCGT TTTGGCAGGC TGGCGGCTAT TTGGCCTTAT GGGTGATTGG CTATGGCCTG ATTCAAGCGC TTGCGCCAGC GATTACCCGC CGCTGGCAAG CCACGCCCAA CGGCTGGGCT GCAACAATTT TGGCTGGCTT GTTGATGCTG ATTATGGCTG CGATTGCCAG TGTTGTGCAA ATCAACCAAC AATCGGCTAT CGTGGTTGTG GTCGGCTTAA TTGGCTTTGG ATTTATTTTT GCGCTGAATT CGGCAGTCCA CTCCTACCTG ATTTTAGCCT ACACTGACCA AGCCGATGTG GCTTTGAATG TAGGTTTTTA CTATATGGCC AACGCCTTGG GCCGCCTGCT TGGCACAATT CTTTCGGGTC TGCTGTATCA AAGCTATGGG TTGGCAGGCT GTTTATGGGC AGCGGCCTTA CTGAGCGCAA TCACAGCCGT GATCTCGTTG AGCTTGCCGC GTGTGCCAAA CCAACCGCTG TTGGGCGAAC AGCAGGCCAA AGCTAGCTCG ATCTGA
|
Protein sequence | MQQQFKEYAI VTAAYWAFTL TDGALRMLVL FHFHQLGYSA LAVAALFIFY EVFGVITNLF GGWLGARFGL NRTLHAGLVL QVIALGMMVV PSAWLSVTYV MLAQAGSGIA KDLTKMSAKS SIKMLVAEDA QSTLFKWVAI LTGSKNALKG VGFFLGSLLL STIGFRSAFA VLASLIAVAA IGTFSFLQHD LGRSKAKPKW QQLFAKSRAI NLLSAARCFL FAARDVWFVV GLPVFLSEVL GWSFWQAGGY LALWVIGYGL IQALAPAITR RWQATPNGWA ATILAGLLML IMAAIASVVQ INQQSAIVVV VGLIGFGFIF ALNSAVHSYL ILAYTDQADV ALNVGFYYMA NALGRLLGTI LSGLLYQSYG LAGCLWAAAL LSAITAVISL SLPRVPNQPL LGEQQAKASS I
|
| |