Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1886 |
Symbol | |
ID | 5733775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2274270 |
End bp | 2275481 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279030 |
Product | glycosyl transferase family protein |
Protein accession | YP_001544657 |
Protein GI | 159898410 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGAGTTC TATTCACCAC CAATCCTGGG GTTGGTCATC TCAACCCAAT GTTGCCGTTG GCGCATGCCT TGCAACAGGC TGGGCATAGC CTCGCTTTTG CTTCAGCTGC GGCATTTGGC CCAACCATCG AAGCCCATGG GTTTCGCTGC TTTGCGGCAG GCCTCGATTG GCTGCAATCG CAGCTTGAAC ACTATTTTCC CGAAACAAGC ACAATGTCGC TGGAAGAGTT GAGCGCTTGG TTTATTAGCG ATCTCTTCGC CGATCTGGCG GCTCAATACA TGGTTCCCGA TTTGCTAGCA ATTTGTCACG AGTGGCAGCC TGATTTGATT GTGCGCAACG ATTTTGAGTT CGGCGGTTAC ATTGCAGCTG AGTGTTTGGG CATACCCCAT GCGACGCTAA GCATTAGTTT CTTTTTACCA ATTGCCACGC TTGAGCCATT AATTGGCGAT CAATTGGCAT TTTTGCGCAG CAGCCACGGC TTAGCACCCT ACCCAGCCTT AGATCGGCTA TACCATTATG CCTATCTAGC CAGTGGTCCG TTGCGCTGCC AAGAGCAAAT CATTCCAACA ATGCATGCGA TTCAGCCTCA AACCATGATC AAACAGTCAG CCAACCAACT GCCGACTTGG GTCACACAGT TAGGCGATCG GCCAACGGTG TATGCCTCGA TGGGCACGGT CTTTAATCGT ACCGCCGATA TCTTTCCAAC GATTTTAGCG GCCCTGCGTG ATGAGCCGAT TAATCTTATT TTGACCATTG GTCGCAACCA AGATCCAGCG CAGTTTGGCC CACAACCAGC CCATATTCAC ATAGAGCAAT TTATTCCCCA AGATTTACTT TTTCCATATT GTGATCTGTT TATCACTCAT ACCTCGTTTC ACACCATGAT GTCGGCATTT AAGTGTGGCC TGCCCTTGCT GATGTTGCCG ATCTCTGCTG ATGAGCCAGT CTGTGCGTTG CGCGGCCTAG AGCTTGGCAT CGGCAGGATT ATCAAACGGC CTAAGCAATT CGACCAGTTT TTTGACGATT CAATTCCTGA GTTATCGCCA ACAGCGGTTC GCACAGCGGT ACACGAGTTA CTACATAATC CAAGTTATCG CCAAAACGCC CAACAATTGC AGGCCGAAAT TCGAGCTTTG CCCGAGCTTG AGTCTGCGGT TGCGCTGTTA ACCAAGCTTG CGGCGGAAAA ACAACCCCAA CGTGCACACT AA
|
Protein sequence | MRVLFTTNPG VGHLNPMLPL AHALQQAGHS LAFASAAAFG PTIEAHGFRC FAAGLDWLQS QLEHYFPETS TMSLEELSAW FISDLFADLA AQYMVPDLLA ICHEWQPDLI VRNDFEFGGY IAAECLGIPH ATLSISFFLP IATLEPLIGD QLAFLRSSHG LAPYPALDRL YHYAYLASGP LRCQEQIIPT MHAIQPQTMI KQSANQLPTW VTQLGDRPTV YASMGTVFNR TADIFPTILA ALRDEPINLI LTIGRNQDPA QFGPQPAHIH IEQFIPQDLL FPYCDLFITH TSFHTMMSAF KCGLPLLMLP ISADEPVCAL RGLELGIGRI IKRPKQFDQF FDDSIPELSP TAVRTAVHEL LHNPSYRQNA QQLQAEIRAL PELESAVALL TKLAAEKQPQ RAH
|
| |