Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0019 |
Symbol | |
ID | 5736853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 22982 |
End bp | 24307 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277140 |
Product | Alpha-L-fucosidase |
Protein accession | YP_001542799 |
Protein GI | 159896552 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3669] Alpha-L-fucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCATG AACATCCGCT TATTACGGCG CGAAATCAGC GCACTGAATG GTTTTTGGCA GCACGCTTTG GCATGTTTAT CCACTGGGGG TTATATGCCA TTCCGGCACG CGGCGAGTGG GTACGCAGCC TAGAGAAAAT CAGCAACGAA GCCTATCAAC CCTACTTTGA GCAATTTAAC CCAACCGAGT ATCAGCCACG CGAATGGGCC AAGCTGGCCA AGGCGGCTGG TATGCGCTAT GCAGTGCTGA CTGCCAAGCA TCACGATGGC TTTTGCCTTT TCGATAGCCA ACTAACCGAC TATAAATCGA CGAATACTCC AGCCGGACGC GATTTGGTGC GCGAGTATGT TGCGGCCTTT CGGGCTGAGG GTTTGCAGGT TGGCTTATAT TATTCGCTGC TCGATTGGCA TCACCCCGAT TATCCTGCCT TCGACGACGA ATATCATCCG ATGCGTGGCA ACGAGGCCTA TCGTGATCAG CCGCGTGATT TTCAGCGCTA CCTCGATTAC ATGCATGGCC AAATTGCCGA ATTGCTGACC AACTATGGCA AAATTGATAT TATGTGGTTC GATTTTTCGT ATGGTGATCT GCATGGTGAG GCTTGGCGAG CCAGCGAGTT GATCAACAAA GCTCGTTCGT TGCAGCCAGA TTTAATTATT GATAATCGTT TGGCGGCGAG TGGCGAGGGC AATAATAGCT TTGGCACGCC AAATCCCGAA ATCTATGCTG GCGATTTTGC CTGCCCTGAA CAGTTGATTC CGCCAACTGG TTTGGTTGAC CAGTTTGGGC GTTCAGTGCC GTGGGAAGCC TGTATCACGC TCAATCAGCA TTGGGGTTAT TCGGCCAGCG ATCGCGATTT CAAATCGTCG CGCCAAGTCG TGCATGGCTT AATTGAATGC GTCAGCAAAA ATGGTAATTT GCTGCTGAAT GTTGGGCCAG ATGCCAAGGG CCGCATTCCG GCTGAGTGCC AGCTGATTTT GCAAGAAGTT GGGGTATGGA TGGCCGAGCA TAGCGAAAGT ATTTATGGTT GTGGTCGGAG CCAACTGGAA AAACCTGAGT GGGGACGCTA TACCCAACAT GGCTCCACCA TCTATGCCCA CATTTACGAG CGCGGGATTG GCCCGATTAA TTTTCGCGGC TTGAATGGCA AGGTCAAACG TGCTCGCCTG ATCGCCGATA ACTCCGAGGT TAAGCTTGAA TTGCCGTGGA TTGCCAAGGA CTACGCTGCC GATTTGTTTT TGAACTGGCC CTCGGCTCAA TTGCCCAACC AAACCGCGAC CGTTGTGGCG TTGGAGTTAG ACGAGGGGCC AGAGGCTAGG GGCTAG
|
Protein sequence | MTHEHPLITA RNQRTEWFLA ARFGMFIHWG LYAIPARGEW VRSLEKISNE AYQPYFEQFN PTEYQPREWA KLAKAAGMRY AVLTAKHHDG FCLFDSQLTD YKSTNTPAGR DLVREYVAAF RAEGLQVGLY YSLLDWHHPD YPAFDDEYHP MRGNEAYRDQ PRDFQRYLDY MHGQIAELLT NYGKIDIMWF DFSYGDLHGE AWRASELINK ARSLQPDLII DNRLAASGEG NNSFGTPNPE IYAGDFACPE QLIPPTGLVD QFGRSVPWEA CITLNQHWGY SASDRDFKSS RQVVHGLIEC VSKNGNLLLN VGPDAKGRIP AECQLILQEV GVWMAEHSES IYGCGRSQLE KPEWGRYTQH GSTIYAHIYE RGIGPINFRG LNGKVKRARL IADNSEVKLE LPWIAKDYAA DLFLNWPSAQ LPNQTATVVA LELDEGPEAR G
|
| |