Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1447 |
Symbol | |
ID | 5733311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1680909 |
End bp | 1682207 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278585 |
Product | major facilitator transporter |
Protein accession | YP_001544219 |
Protein GI | 159897972 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.674302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAC TTGATACAAA TAATGACGCG CAGGCCTCAA CTAAGCTTGC GCGTTCATTA GGTTGGACAT TGGTGATTAA ACTGCTGATC TCGCTGCATG TTGGGGTTGG CGATTTTTTG TTGCCCTTGT ACGTTCAAGC GCTGGGTGGT TCGCCCCAGG TGATTGGCAA TTTGGTGGGT TTGGGCGCGG CCTTTGGCGT ATTTGCGCGG CTCAGCCTGA GTTGGCTTGC CGATCGTGGG ATCATCGGGA TTTTGCTGCG TTTGAGTTTG GCGGCCTTAG CGAGTGGCTT TGCAATTTGG AGCTTTGCTG ATAGCAGTGT TTGGTTAATG CCTGGTCAAG CATTGCTCGC GATCGGGCGA GCAGGCTCGA CTATCTGTTT AAGTTTGTTG ATTGCCCAAT TAACCAGCCA AGGTCAACGG GGTTCAGGCT ATGGGCGGCT GACCATGGCC AGTTCATTGG CAACAATTGT TGGGGCGATC ATCGCGGGCA TTGGCTTTAT TGGCTGGGAT GCAGAAACTC GCCAGCAATT GCAACAATTC ACTTGGCTGC AAACTAGCCT CAATTATTTA CCCAATCCAA TTCCCCGTGT CGAGCTATTT CATGGCATTT ACATCGTGTT TAGTAGTAGC GTCGTCGTAG CAGGAATCTT CAGTTTACGT TCATGGCCCC ATGCAATTCC GGCAAAACGC GGTACAATAC AAGCAATATG GCGCAGTGTG TTACGTCAGC CAACAATCCG CAGTCTGTTG CTTGTGCAAA GCTGCATCAG CGCGGGCTAT AGTGCCTCGA TTCCAATGAC CGTGCCGTTA TTGACTGATC GTTTTGGGGC AAGTGTGGCG GCGGTCGCTG TGGCCTATAT TGTGCCAGGC ATTATTTATG CGCTGTTTCC AGCACGCCTT GGGCGCGTCG CTGATCGGAT TGGCTATCGA CGGGCTGCCA AGCTTGGGCT TGGCGTAAGT ATGTTAGTTT ACCTAGCAAT ACCTATCAGC CCACAACTTG CAATCACCGC AGCATTTTGG GCTTTTGAAG CGCTGGCATG GAGTTTTTAT GTACCAGCTT TGGAAGCCTT ACTTGCGGAA AGTGTGATAC CACAACAACG CGGCACAGCC TTGGCGATCT ATGGAGCGTC AGGCGCATTG ACCGCAACTG TGGCAGCACC CTTGGGCGCA CGCTTGTATA GCCATTGGAT CGCCGCACCC TTTTTATTCT CGGCCCTTTG CCTAGGTATG GCAGCCATGT TTGCTGCCCG TACCCCACCA ACCAATGCCG ATTATGCTAC CATTAAATCT CACAATTGA
|
Protein sequence | MNKLDTNNDA QASTKLARSL GWTLVIKLLI SLHVGVGDFL LPLYVQALGG SPQVIGNLVG LGAAFGVFAR LSLSWLADRG IIGILLRLSL AALASGFAIW SFADSSVWLM PGQALLAIGR AGSTICLSLL IAQLTSQGQR GSGYGRLTMA SSLATIVGAI IAGIGFIGWD AETRQQLQQF TWLQTSLNYL PNPIPRVELF HGIYIVFSSS VVVAGIFSLR SWPHAIPAKR GTIQAIWRSV LRQPTIRSLL LVQSCISAGY SASIPMTVPL LTDRFGASVA AVAVAYIVPG IIYALFPARL GRVADRIGYR RAAKLGLGVS MLVYLAIPIS PQLAITAAFW AFEALAWSFY VPALEALLAE SVIPQQRGTA LAIYGASGAL TATVAAPLGA RLYSHWIAAP FLFSALCLGM AAMFAARTPP TNADYATIKS HN
|
| |