Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0362 |
Symbol | |
ID | 5732213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 433120 |
End bp | 434373 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277485 |
Product | sterol 3-beta-glucosyltransferase |
Protein accession | YP_001543141 |
Protein GI | 159896894 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0442187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATTA GCATCATCAG TTATGGTTCG CGCGGCGATG TTCAACCATT TGTAGCGATT GCACGGGCAT TACGCCATGT AGGGCATCAA GTTCAACTGA TTGGCCCAGC CAATTTTGCT GCGCTAAGCC ACGATGCAGG CGTGCCGTTC GTTTCGGTTG GAGTTGATAT TCAAGCCTAT TTACGTGAAC GCATCGCTAG CTTATCTGGC TCGCGCAATG TGATAGGGCT GCTCAAAAGC CTGCGTAACG AACTAAACGA ATTGATTGAA GGAATTGCCC AAGAAACATT GCAAGCCTGT CAAGGAACCG ATCTGATTCT TGGGACTGGC CCCCAGACTG CTAGTTTTGC TGAACGACTG GGGGTTCCAT TTATTGAAGC AGTGCTCCAA CCGCTAACCC CCACCCGCGC CTATCCCTCG CCAATTGCCC CGGCGTGGCT CCAACTTGGC GGATTCGCCA ACTATCTCAC GCATCTTGGT TTTGAGCAGA TTTTTTGGCA GATCTTTCGG CCTACGGTTA ATCGAGTTCG CAGCCATGTG CTTGGCTTAC CATCCTATGG CTTTACCAGC CCGTTTGGCA AAATCCGCGA GCAGGTTCCG TTGCGGCTAC ACGCCTATAG TGACTATGTT ATGCCAAGGC CAAACGATTG GGCCAAGCAA CATCAGGTCA CAGGCTTTTG GTTTCTGCCA GCACCAGCCG ATTGGTCGCC ACCAGCTGAG CTATGCGCCT TTCTGGCAGC TGGGCCGGCT CCAATTTATA TCGGCTTTGG CAGTATGATG GGCGGTGATC CACAACAATT AACCAGCATT GTGAAAGAAG CCTTAGCTCG CTCTGGCCAA CGGGGAATTT TGGCTGGCGG TTGGGGTGCA TTAGCCGAAA CCTCAGCGCC AAGCGATCAC TTATGCTTTG TTGAAAGCGT GCCGCATCAA TGGCTTTTCC CGCAAACAGC GGCAATTGTG CATCATGGCG GTGCTGGCAC CACTGGCGCA GCCTTACGCA GTGGCCGACC GTCAATCGTT GTGCCCTTTG CCTTCGATCA GACTTTCTGG GGGCGACGGG TGGCTGAGCT AGGCGTGGGC ACTGCACCCA TCGCACGTTC GCAAATCACG GTCGATCGGC TGACAGCAGC GATCAATCAG GTAACAACCC AAACCGCAAT TCGTGAACAA GCAGCCCAGC TTGGCAGCCA AATTCAGCAA GAATACGGCA CAGCCCAAGC GATTGACCAT ATTCATCGCG TATTTCGCCA TTAA
|
Protein sequence | MDISIISYGS RGDVQPFVAI ARALRHVGHQ VQLIGPANFA ALSHDAGVPF VSVGVDIQAY LRERIASLSG SRNVIGLLKS LRNELNELIE GIAQETLQAC QGTDLILGTG PQTASFAERL GVPFIEAVLQ PLTPTRAYPS PIAPAWLQLG GFANYLTHLG FEQIFWQIFR PTVNRVRSHV LGLPSYGFTS PFGKIREQVP LRLHAYSDYV MPRPNDWAKQ HQVTGFWFLP APADWSPPAE LCAFLAAGPA PIYIGFGSMM GGDPQQLTSI VKEALARSGQ RGILAGGWGA LAETSAPSDH LCFVESVPHQ WLFPQTAAIV HHGGAGTTGA ALRSGRPSIV VPFAFDQTFW GRRVAELGVG TAPIARSQIT VDRLTAAINQ VTTQTAIREQ AAQLGSQIQQ EYGTAQAIDH IHRVFRH
|
| |