Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2408 |
Symbol | |
ID | 5734289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3070063 |
End bp | 3071406 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279549 |
Product | glycosyltransferase family 28 protein |
Protein accession | YP_001545176 |
Protein GI | 159898929 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGA TTGTTATCAG TATGTTTCCC GAAGAAGGCC ATCTTATTCC TAGTTTCAAG CTTGGCAAGA GCTTAAAAGC CCAAGGTCAT CAGGTTTATT ATTTAGCCTT GGCCGATTTT GAGGAGTATA TTCGCAAGCA GGGTTTTGAA TATTTGCCCT TGTTTGCTGA GGATTTTCCC AAGGGCTTTC GTGCTCAGCA AACCGAGCGA ATTGCGAACA CCCGTGGCCG AGCCTTTTTG AAAGAGGTTT CACAAACGGC GTTTTATCTC AAGTTGTTTC AAACTGTACA TGAAGATCAA AATCCAATCA AGCGTAGTTT GTTGGAAATT GGGACAGATT TATGTCTTTT TGATGGATTT TTGGCCCCGC TGAGCCTTAT GGCGCGGCAT GCTGGGCTGG AAGTAATTAG TCTGAGTATT AATATTAATC TTCCGCAGGC CGCCAATTAT CCGCCAGTTG TCACCAATAT TGTGCCCGAT AACACGCCTG CCTCGCTCTC TAAAGCTGGC ATGGCTTGGA AATTTAGTGG GCTAGCAATC AAAATAACCA ACCTTTTGAT TGGCTACAAT TTCCAGAAAA AGCTAACCGA ACTGGCAACC CACTTTGGGT TTTCAGCCGA TTTAGTTGTG CCAGCGGCCC TCTTTCCACG CTTGCGACCC CAACTTGAGA TACCCGAACT CGTGCTTTGC CCGCAAGCCT TTGATTTTCC ACGGCCAAGC GTTGAGCAAG GGATTTTCTA CTGTGAGCCA TCAATCGATC TTGATCGCCA AGAAGCGGCC TTTGATTGGT CGCAGATTGA CCCCAACAAG CCGCTGATTT TTTGTACCTT GGGCAGCCAA AGCCATATCT ACAAGCCAAG TCGGCGCTTT TTTCAAACCG TGATCGACAC CATGCGCAGT CGCCCCGATT GGCAATTGAT TATGGCGCTC GGCCAGAAAT TTCAAGCCCA TGAATTTGCC AATGTGCCAG CCAATGTGCA GTTGCTGCAA TGGGCTTCGG TCGAGCAAAT TTTGCCGCGC ACCAGCGTGA TGATTACCCA TGGCGGGGTT GGCACGATTA AGGAATGTGT CTATTTCAAC GTGCCAATGG TGGTTTTTCC AGGCAACCGT GATCAACCTG GCTATGCGGC GCGGGTTGTT TACCATGAGC TAGGTTTGAT GGGTTCGATG GGGAAAGTTT CGGCCCAAGC GCTGGAAACC ATGCTCAACC AAGTGATTCA AAATCCTAAC TTTAAACAAC GAGTTACGGC AATGGGCGAG GAATTTCGCG CCTTGGAAGC TGCTAGCCCA GCGCTGGAAT TAATTCAAAG CAAATTACCG CATACCCGAA CCGTTGCCTC GTAA
|
Protein sequence | MATIVISMFP EEGHLIPSFK LGKSLKAQGH QVYYLALADF EEYIRKQGFE YLPLFAEDFP KGFRAQQTER IANTRGRAFL KEVSQTAFYL KLFQTVHEDQ NPIKRSLLEI GTDLCLFDGF LAPLSLMARH AGLEVISLSI NINLPQAANY PPVVTNIVPD NTPASLSKAG MAWKFSGLAI KITNLLIGYN FQKKLTELAT HFGFSADLVV PAALFPRLRP QLEIPELVLC PQAFDFPRPS VEQGIFYCEP SIDLDRQEAA FDWSQIDPNK PLIFCTLGSQ SHIYKPSRRF FQTVIDTMRS RPDWQLIMAL GQKFQAHEFA NVPANVQLLQ WASVEQILPR TSVMITHGGV GTIKECVYFN VPMVVFPGNR DQPGYAARVV YHELGLMGSM GKVSAQALET MLNQVIQNPN FKQRVTAMGE EFRALEAASP ALELIQSKLP HTRTVAS
|
| |