Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3892 |
Symbol | |
ID | 5735753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4884400 |
End bp | 4885563 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281043 |
Product | Alpha-galactosidase |
Protein accession | YP_001546654 |
Protein GI | 159900407 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.77449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACTT CAGAACAGCC TCTCGCACCT ACGCCCCCAA TGGGTTGGAA CTCGTGGAAC ATGTTTGGTA GTACGATTCA TGAAGATTCA GTCCGTGCCA CTGCTGACGT GTTGGTTAGC TCAGGCCTCA AGGATTGTGG TTATAACTAT GTGGTGATTG ATGATTGCTG GTCAACCAAA GTTGGCCGCG ATGGCAACGG CGATTTGGTT GCCGACCCCG AAAAATTCCC CAGTGGCATC AAAGCGCTAG CCGATTATGT GCATAGCCTT GGTTTGAAAA TTGGCATCTA CTCCGATGCG GCGCATCTGA CTTGCGCTAG TTATCCTGGC AGCTTTGGCT TCGAGGAGCA AGATGCCCAA TTATGGGCTT CGTGGGGCAT CGATTTCCTC AAATATGATT TTTGTTTTGC GCCAACCGAC CAAGCCACTG CGATCGACCG TTACACCCGC ATGGGCGAGG CGTTGCGCAA AACGAAGCGC CAATTTCTCT ACTCGTTGTG TGAGTGGGGT GGCCGCAGCC CACAGCTCTG GGGTCGCTCG GTTGGCGGGC ATATGTGGCG GGTCACTGGC GATATTTTCG ATAGCTGGGT TGATATTTGG GTTGCGCCAC ACAAATATTA TGGGGTAGGC ATTGATACAG CGATTGATAT TGCCGCCAAT TTAGCCGAAT ACGCTGGCCC TGATGCCTGG AACGATTTGG ATATGCTGGT GGTTGGATTG AAGGGCAAGG GTCAAATCTC TGGCGGTGGC TTATCATTCA TCGAATATCA AACCCATATG TCGTTGTGGA CGATCGCCTG CTCGCCCTTG ATGATCGGCT GCGATATTCG CAATATGGAT CGCGATACCA CCAGTTTATT GACTAATCGC GAAGTTTTGG CCGTCAACCA AGATAGTTTG GGGATTGCTG GGCGGCGCGT GAAACAAACC GGCACGTGCG AAATTTGGAA AAAACCGCTC GCCGATGGCT CGTTGGCGGT TGCCTTGATT AATCGTGGCT CGATCGGCAG CGATTTAAGC TTGCGAGCCA GCGATATTGG CCTGCTCGAT ACGCCGAAAT CAGTACGAAA TTTGTGGGCG CAAGCAGATA TTGCCGAGTT TGGCGAGGCT TGGCAAACCC GCATTCAACC CCATGAAACA TTATTGCTCA AAATCAAAGC CTAA
|
Protein sequence | MTTSEQPLAP TPPMGWNSWN MFGSTIHEDS VRATADVLVS SGLKDCGYNY VVIDDCWSTK VGRDGNGDLV ADPEKFPSGI KALADYVHSL GLKIGIYSDA AHLTCASYPG SFGFEEQDAQ LWASWGIDFL KYDFCFAPTD QATAIDRYTR MGEALRKTKR QFLYSLCEWG GRSPQLWGRS VGGHMWRVTG DIFDSWVDIW VAPHKYYGVG IDTAIDIAAN LAEYAGPDAW NDLDMLVVGL KGKGQISGGG LSFIEYQTHM SLWTIACSPL MIGCDIRNMD RDTTSLLTNR EVLAVNQDSL GIAGRRVKQT GTCEIWKKPL ADGSLAVALI NRGSIGSDLS LRASDIGLLD TPKSVRNLWA QADIAEFGEA WQTRIQPHET LLLKIKA
|
| |