Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0373 |
Symbol | |
ID | 5732224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 446120 |
End bp | 447373 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277496 |
Product | sterol 3-beta-glucosyltransferase |
Protein accession | YP_001543152 |
Protein GI | 159896905 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTATT GTATCTTGGC GCTTGGTTCG CGCGGCGATG TGCAGCCGTT TATCGCCTTG GCGCTTGGGT TGCAAACTGA GGGTCATCAG GTCGTTATCG CCGCAGCTCA TGATTATCGT AGCTTGGTTG AAAGCTATGG CTGCCGTTTT GCCCCGTTGG TTGGCTCAAT TTCGGCCTTG CTCAACCCTG AACAAATGGC GGCGGGCTTG GCAGCAGGTC GCAGCGCAAT CATCAAACAA TTTCTTCAAC AAACACCACC GATTATTCGT CAATTAATCG CTGATGCACT AGCGGCCTGC CAAACTGCCG ATTGTTTGAT CGTTTCCAGT TTGGGGATGT GGCCTGCGCT GCATCTAGCC GAGCATTTGC ACATTCCCGT AGTGTTGGTA CACCTGCACC CTTATGCTGC TAGCAGCCAA ACCGCCCATC ATTTTGCGCC GCAACTGGCT TGGGCCAGTT ATCGCCGCAT GAGCTATCGC GTGGCCGAGC AATTGCAATG GCAAGTCTTG CGCATGGCCT TCAATCAAGC ACGGCAGCAG ATTTTGCAAC GCCCAAGCCT AAGCATTGGC CAGCTTTGGC AACGGAGTCG CAATTTTCAG CCACCAACCT TGTATGCCTA TAGCGCGTTG GTTGCTCCGC CGCCAGCAAC TTGGTTTGAC GATGGCGCGA TCACTGGCTA TTGGTCACTG CCCCCAGCGG CAGATTGGCA AGCACCAACG GCATTACAGC AATTTCTGGC AGCAGGCCCA GCGCCAATCA CCATTAGTTT TGGCAGCATG TTGCATGGTC AAAAGCGCGG CAATCAATTA AGCCAATTGC TAATTACCGC CAGCCAAAAA GCCAAAGTAC GCATGATCAT CAACCAAGGC TGGGGCGATT TAGCTCAGGG CAAGTTGCCA GCCAACTGTT TAGCGATCAA TGGCCTAGCG TATGCTTGGT TATTTGAGCG GGTAGCGGCA GTTGTGCATC ATGGCGGGGC GGGCGTAACC GCTACAGCCT TAGGTGCAGG CAAGCCCGCC TTGGTCACAC CATTTTTGGG CGACCAATAT TTTTGGGGCC AGCGGGTGTA TGATCTCAAA GCGGGGCCAG CGCCTGTGCC AGCCAACCAA TTGCAAGTTG CACAGCTCGC TACTCTGCTG TGTAGCTTGA TTGAGCGCGA TGATTATCAG GCGGCGGCGC AACAACTTGC GACCCAATTA GCCCAAGAGC AAGGCGTAAC CAAGGCCATA GCTTGGTTAA AACAACGATT TTGA
|
Protein sequence | MNYCILALGS RGDVQPFIAL ALGLQTEGHQ VVIAAAHDYR SLVESYGCRF APLVGSISAL LNPEQMAAGL AAGRSAIIKQ FLQQTPPIIR QLIADALAAC QTADCLIVSS LGMWPALHLA EHLHIPVVLV HLHPYAASSQ TAHHFAPQLA WASYRRMSYR VAEQLQWQVL RMAFNQARQQ ILQRPSLSIG QLWQRSRNFQ PPTLYAYSAL VAPPPATWFD DGAITGYWSL PPAADWQAPT ALQQFLAAGP APITISFGSM LHGQKRGNQL SQLLITASQK AKVRMIINQG WGDLAQGKLP ANCLAINGLA YAWLFERVAA VVHHGGAGVT ATALGAGKPA LVTPFLGDQY FWGQRVYDLK AGPAPVPANQ LQVAQLATLL CSLIERDDYQ AAAQQLATQL AQEQGVTKAI AWLKQRF
|
| |