Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1862 |
Symbol | |
ID | 5733751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2195564 |
End bp | 2196862 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279006 |
Product | major facilitator transporter |
Protein accession | YP_001544633 |
Protein GI | 159898386 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTCGC TGCGGCGCTG GCAACAACAC CCAATGCTGC GCTTTGTGTT GCTTTGGTTT GGCCAAACTG GCTCGATGAT TGGCTCAAGC CTGACTGGCT TTGCCTTAGG CATTTGGGTC TATCAGCAGC ATGGCGTAGT AAGTGATTAC GCTTGGGTGC TGCTGACCAA CACCTTGCCC GCCACCTTAA TCGCGCCATG GGCCGGAGCC TTGAGCGATC GTTTTAATCG GCGCACGGTG ATGCTGATAA GCGATAGCAT CGCCGGCCTT AGCACGATTG GGCTGATCAT CTTATTTAGC ACTGGGCAAC TGGTGATTTG GCAAATTTAT CTTGCCAATA GCATCAATGC GCTTGCCCGT GCCTGTCAAT GGCCAGCCTA TGTTGCCAGC GTCCCACAAT TAGTCGATTC GACTCAACAT AATCGAGCGA ATGGCTTGAT GCAACTCAGC CATGCCTTGG CCCAATTGCT GGCTCCGCTG ATTGGTAGTA GTTTGTTGGC ATGGGTTGGC ATGCCCTTTA TCTTTACCCT CGATCTTCTG ACCTTTGGGC TGGCGTTGGT GATTACGGCG AGCACCCGTT TTGCGCCCCA ACCACAGGTG ACCGAGCATG CCAACGTGCC GTTATTGCAA ACGGCCTTGC TGGCATGGCG CGAGTTTTTG CGCTACCCCA GCCTCGTCGC TCTAACCAGC CTCATTATTT TGAGCAATTT TAGCGCTGGC AGCATCGAGG TCTTGATCAC GCCGTTGGTG CTGGAACTAC ATTCAGTGCC GACGCTGGGC ATGCTGCTGA CGTTTGGTGG TTTAGGTATG GTGGCGGGGA GTTTGTTGGC AAGTATGCTG CGCCCGCCGC GTCGTTTAGC CCGCGCGGTT TTACTGGCTG AACTGATTGG TGCATTAGCC ATGCTGTTGG CTGGTTGGCA ACCAAGCATT ATTGGCTTGG CAATCGCGGC AATCATCTAC TTTGGCATGT TGCCATGGGG CAGTGCCAAC CACACGAGCT TAATTCAACA ACAGCTGCCA AATGCGCTGC ATGGGCGGAT TTTTGCCTTG ATTAGTGCCC TCGCGTCGTT AGCACTCACG CTTGGCTTCG TTAGTGCGGG CTTGTTGGCC GATCGCTTTT TCACCCCAGC CATGCAACCC CATGGCTGGC TCAACCCGCT ATTCGCTTGG CTGGTGGGGA CGCAGCCGAG CAGTGGCATT CAACTGCAAT TTATAAGTTT GGGATTACTA ACCTTGCTGT GGACTATCGG GGTTGCTGGA TGGCATGCTC GTCAAACACC GCCATCTGAG CAACTATAG
|
Protein sequence | MISLRRWQQH PMLRFVLLWF GQTGSMIGSS LTGFALGIWV YQQHGVVSDY AWVLLTNTLP ATLIAPWAGA LSDRFNRRTV MLISDSIAGL STIGLIILFS TGQLVIWQIY LANSINALAR ACQWPAYVAS VPQLVDSTQH NRANGLMQLS HALAQLLAPL IGSSLLAWVG MPFIFTLDLL TFGLALVITA STRFAPQPQV TEHANVPLLQ TALLAWREFL RYPSLVALTS LIILSNFSAG SIEVLITPLV LELHSVPTLG MLLTFGGLGM VAGSLLASML RPPRRLARAV LLAELIGALA MLLAGWQPSI IGLAIAAIIY FGMLPWGSAN HTSLIQQQLP NALHGRIFAL ISALASLALT LGFVSAGLLA DRFFTPAMQP HGWLNPLFAW LVGTQPSSGI QLQFISLGLL TLLWTIGVAG WHARQTPPSE QL
|
| |