Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4779 |
Symbol | |
ID | 5736623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6095956 |
End bp | 6097242 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281944 |
Product | major facilitator transporter |
Protein accession | YP_001547538 |
Protein GI | 159901291 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000206756 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGAAA AAAGAACTTT AGCTCAAACA TTGCATATTC CACGGCAATG GCCATATATG TTTCGGGCGC TGAACCATCG CAATTTTCGT TTATTTTGGT TTGGCCAACT TATTTCGCTG ATCGGCTCAT GGATGCAATC GCTGGCATTG CAATGGCTCG TCTATCAACT GACTGGTTCG GCTCTTGCGA TGGGCACGGT CGCCGCATTA ACCTCATTGC CTGTGCTCTT GCTCTCGCCA TTTATGGGCG TAGTGGTTGA TCGCTATGCC AAGCGCTGGG TGGTGTTTTG GGCGCAAATG GGCGCAATGA TTTCATCGTT GATTTTGGCA ATTTTGACCT TTAGTGATAG TGTGCAATAC TGGCATGTTT TAGCCTTAGC GCTGGTCAAT GGGGTAGTTA ATGCCGTCGA TATGCCTGCA CGTCAAGCCT TTACCTCAGA AATGATCGGC AACCGTGATG ATTTGATGAA TGCGATAGGC TTGAATGCCT CGATTTTCAA CGGTGCACGC GCGATTGGGC CAGCTGTTGC GGCAATGTTG GTGAGTGCTG TTGGCCTAGC TTGGGCTTTT TTGCTGAATG GCTTGAGCTT TATCGCAGTG CTCATTGGCT TGATGATGAT GACGGGCTTG ATTGCGCCAG CACCCCGCAA AAGCAACAGC TCATCAGTCA GCGATTTTAT GGATGGCGCA CGCTATTCGT TGCAAACACC CTTGATTCGA ATTATTTTGG TGCTGGTGCT GGTGCCAAGC ATTTTGGGCT TTGGCTATAC CTCGTTGCTG CCAATTTTTG CCGACCAAAT TCTGGTTACG CCGCTCATCC CTGAAGGTTC AACCCGCTTG GGCGCGATGA TGGTGGCAAA TGGGATTGGC GCATTGCTGG CGGCGCTGCG GGTTGCCCGT ACTAATGCCC AAACTGATCG GCGTAAATTA TTGTTGAATG GGGCGCTTGG GTTTGGCCTA GGCATCTGTT TGCTAGCCTT CAACCGCTCA TTCTGGTTGG CCTTACCAAT TATGACCTTT ACAGGCTTTT CGATGGTTAG CTTTTTGGCT ACCGCCAACA CGATTTTGCA AACCACCGCG ATTGATCGGT TGCGCGGGCG CGTGATGGGC TTTTATGTGA TGACCTTGGT GGGCTTGGGC TTGGTTGGAA GTTTACAGGC TGGGTTTGTG GCTGAACATT GGGGCACGCC GATCGCTACT GGAATTGGCG GTTTGGCCTG TGTGATTTCG GCCTTGCTTG GTTTACGTTC CAAAGCCTTG TTGACCTTGG TTCCGCGTAA CGAATAA
|
Protein sequence | MLEKRTLAQT LHIPRQWPYM FRALNHRNFR LFWFGQLISL IGSWMQSLAL QWLVYQLTGS ALAMGTVAAL TSLPVLLLSP FMGVVVDRYA KRWVVFWAQM GAMISSLILA ILTFSDSVQY WHVLALALVN GVVNAVDMPA RQAFTSEMIG NRDDLMNAIG LNASIFNGAR AIGPAVAAML VSAVGLAWAF LLNGLSFIAV LIGLMMMTGL IAPAPRKSNS SSVSDFMDGA RYSLQTPLIR IILVLVLVPS ILGFGYTSLL PIFADQILVT PLIPEGSTRL GAMMVANGIG ALLAALRVAR TNAQTDRRKL LLNGALGFGL GICLLAFNRS FWLALPIMTF TGFSMVSFLA TANTILQTTA IDRLRGRVMG FYVMTLVGLG LVGSLQAGFV AEHWGTPIAT GIGGLACVIS ALLGLRSKAL LTLVPRNE
|
| |