Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4524 |
Symbol | |
ID | 5736375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5790056 |
End bp | 5791321 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281686 |
Product | glycosyl transferase family protein |
Protein accession | YP_001547283 |
Protein GI | 159901036 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.186857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAATTT TAATTATTGC CTTTGGCACA CGCGGCGATG TTCAGCCGAT GGTGGCCTTG GGCTTGGCCT TGCAAGAGCG TGGGCATTCG ATAACCCTGT TGGTCAGCAG CAATTTTAAA AGCTGGGTTG AGGAGTTTGG GTTACAAGTG GCGACTGCGC GGGTCGATAT TCAGCAGATG ATGTTGAGCG ATCATGGCAA CGATTGGGTC AAACATGGGG CTAATCCAAT TAAACAGCGC AATGCGATGC GCCGTTTGTT GAAGCAACAT GCCTTGACCA TGGTTGAAGA TGCTTGGCAA GTCGCCCAAA ACTGCGATGT TTTGATCAGC AGTTTTACCT CGGATGTCTT TGCAGTGACG TTGGCTGAAG TGCTGAATGT GGTGCATATT AGCACGCCAC TGCAACCAGC CATGTTGGCC ACCCGTTGCG GCCCTGCTAG TGCGGCGGCA ATTCTACCAA ACCACGAGAG CATCATTAAT TATTGGTTTG GGCGCTGGGT GCTTGAGCCA TTTATGTGGC AAGTTGGCGG CGATTTTATT AATCAGTTTC GCCAGCAACA GCTCAAATTG CCAGCCCAAA GTGTGCGCGA ATATGCTCAA CGCTTGCGTC AAACCACGAT TATTCAAGGC TATAGCCCGG CAATCATTCC GCATCCCAGC GATTGGCCCG CCAATATTCA GACGGTTGGT TATTGGATGT TGCCGCCCGA TGAGGCTTGG CAAATGCCGC CTGAGCTTGA GCAATTTTTG GCCGATGGCC CAACTCCAAT CTATATAGGC TTTGGTAGCA TGACCGGAGC TAACCCTGAT GCTTTTACCG AATTGTTGCT CAAGGCGGTG GCACATAGCG GCCAGCGGGC AATTATCCAA ACTGGTTGGG CTGGCTTGGG CCAAATCGAA TTGCCCAAAA CTGTTTTTCG GATTGGCTCA GCGCCGCATG AACGGCTTTT TCGCCATGTC AAAGCGGCGG TACACCATGG CGGGGCTGGC ACAACGGCTG CAAGCTTAGC GGCTGGTTTG CCAACCGTCA TCGTGCCGCA CTTGGGCGAT CAACTGCGTT GGGGTCAGCG CGTGTTTGAT TTGGGCTTAG GGCCAAAGGC GATTCCGCGC AACAAACTTA CGGTTGATCG GTTGGCTTGG GCGATTTCGC AGGCCGCTAA CACGCCGAGC ATGCAACACA ATGCCCAAGC CATGGCCAAA ACCCTGCAAG CTGAGCAGGG CATCAGCCGC GCGGTCGAAA TTATTGAACA ACGGATACAA GCCTAG
|
Protein sequence | MRILIIAFGT RGDVQPMVAL GLALQERGHS ITLLVSSNFK SWVEEFGLQV ATARVDIQQM MLSDHGNDWV KHGANPIKQR NAMRRLLKQH ALTMVEDAWQ VAQNCDVLIS SFTSDVFAVT LAEVLNVVHI STPLQPAMLA TRCGPASAAA ILPNHESIIN YWFGRWVLEP FMWQVGGDFI NQFRQQQLKL PAQSVREYAQ RLRQTTIIQG YSPAIIPHPS DWPANIQTVG YWMLPPDEAW QMPPELEQFL ADGPTPIYIG FGSMTGANPD AFTELLLKAV AHSGQRAIIQ TGWAGLGQIE LPKTVFRIGS APHERLFRHV KAAVHHGGAG TTAASLAAGL PTVIVPHLGD QLRWGQRVFD LGLGPKAIPR NKLTVDRLAW AISQAANTPS MQHNAQAMAK TLQAEQGISR AVEIIEQRIQ A
|
| |