Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0984 |
Symbol | |
ID | 5732887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1126833 |
End bp | 1128278 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278118 |
Product | major facilitator transporter |
Protein accession | YP_001543760 |
Protein GI | 159897513 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.829965 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAACA CACCCATGCC ATCACAGCAT TCCACCGTTG TCTCGCAATC CACAATTTGG AAGGTCATCA TTGCATCGGC TTTGGGTACC ATGATTGAAT GGTACGATTT CTACATTTTT GGCAGTCTTG CTGCGGTCAT TGCGACAAAT TTCTATCAGT CGGGCAACGA AACGGCGGCC TTGCTGGAAA CCTTTGCCAC CTTTGGGGCG GGCTTTGCGG CGCGACCGTT CGGGGCACTG GTGTTTGGGC GCATTGGCGA TATTGTTGGG CGCAAATATG CCTTTTTGGT CACGTTGCTG ATCATGGGCG GCGCAACCAC GGTCATTGGG ATTTTGCCCA CCTATGCCTC AATTGGCATC CTCGCCCCGA TCATTTTGGT GATTATTCGG ATCATCCAAG GTTTGGCGCT TGGTGGTGAA TATGGCGGTG CGGCGGTCTA TGTCGCTGAA CATGTTCCCG ACCATAAGCG CGGTTTTTAC ACCAGTTTTA TTCAAATTAC CGCCACGCTT GGCTTATTTA TCTCGTTGCT GGTAATTTTG ATTGTACGAA CCTCGATGAG CAAGGCAGCC TTTGATAGCT GGGGCTGGCG GATTCCCTTC TTGCTCTCAA TTGTCTTGGT GGGTGTTTCA GTCTACATTC GCTCGAAGAT GAGTGAATCG CCGTTGTTTA CCAAACTCAA ACATGCAGGC AAAACCTCGA AAGCTCCGCT TAAAGATAGT TTTGGCAATC GGCGCAATTG GAAAGTGATT TTGACGGTGT TGTTTGGAGC CGCTGCGGGT CAAGCAGTAA TCTGGTATAC CGCTCAATTT TACGTGAACT CGTGGCTCAA AACCCAAGCC AAAGTGCCAG CTAACACCGT TGATACAATC GTGGCGATTG CTTTGTTCTT AGGCATGCCG TTTTTCGTCG TCATGGGAGC GCTTTCAGAT AAATGGGGGC GCAAAACGGT GATGATGGCA GGCAATTTAA TCGGTGCAAT TGCGATTTAT CCCGCCTTTA TGGCCCTGAA AGCGGCGGCT GGTCCAATTA CTCCGGCGGT TCTCGATGAA GCTGGAAAGG TTATCACGCC TGCGGTCGCC AACAATCCTA ACACCGTTCT ACTCACCTTG ATCATTTTTG GGTTGGTGTT GTGTGTTTGT ATGGTGTATG GCCCGATTGC GGCCTTTTTG GTGGAATCGT TTCCTGCCAA AATTCGCTAT ACCTCGGTTT CACTGCCCTA TCATGTTGGC AACGGCTACT TTGGCGGTTG GTTGCCCTTT ATCGCCACAG CAGTGGTTAG TAGTACCGGC AATATCTATG CTGGCCTATG GTTTCCAATT GCCATCGCTT TGTTGACCTT TGTGGTTGGG ATGGTCTTGC TCAAGGAAAC CAAGGATAAT TCGCTGCATG AAGAGGCTAG CGATAACCCA ATGGCGACTG AAATGGATTT AATTGCCCAA TCATAA
|
Protein sequence | MSNTPMPSQH STVVSQSTIW KVIIASALGT MIEWYDFYIF GSLAAVIATN FYQSGNETAA LLETFATFGA GFAARPFGAL VFGRIGDIVG RKYAFLVTLL IMGGATTVIG ILPTYASIGI LAPIILVIIR IIQGLALGGE YGGAAVYVAE HVPDHKRGFY TSFIQITATL GLFISLLVIL IVRTSMSKAA FDSWGWRIPF LLSIVLVGVS VYIRSKMSES PLFTKLKHAG KTSKAPLKDS FGNRRNWKVI LTVLFGAAAG QAVIWYTAQF YVNSWLKTQA KVPANTVDTI VAIALFLGMP FFVVMGALSD KWGRKTVMMA GNLIGAIAIY PAFMALKAAA GPITPAVLDE AGKVITPAVA NNPNTVLLTL IIFGLVLCVC MVYGPIAAFL VESFPAKIRY TSVSLPYHVG NGYFGGWLPF IATAVVSSTG NIYAGLWFPI AIALLTFVVG MVLLKETKDN SLHEEASDNP MATEMDLIAQ S
|
| |