Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1306 |
Symbol | |
ID | 5733199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1516664 |
End bp | 1517893 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278446 |
Product | 3,4-dihydroxy-2-butanone 4-phosphate synthase |
Protein accession | YP_001544082 |
Protein GI | 159897835 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase [COG0807] GTP cyclohydrolase II |
TIGRFAM ID | [TIGR00505] GTP cyclohydrolase II [TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0432513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTTGG CAAGCATCGA AGAAGCACTG CAAGATTATG CAGCCGGAAA AATGGTGATC ATCGTAGACG ATGAAGATCG CGAAAACGAG GGTGATCTCG CGTGTGCTGC CCAGTTTGTT ACGCCACAGG CAATTAATTT TATGGCCAAA GAAGGCCGTG GGCTGATCTG TTTAGCCATT ACTGGTCAGC GCCTCGACGA GTTGGAAATT CCAATGATGG TGACCTCGAA TAACTCTAAG TTTGGCACCA ATTTCACGGT GTCGATCGAG GCTGCTCAAG GCGTAACCAC CGGAATTTCA GCCTATGATC GAGCGCATAC CATCCAAACC GTGATCAATC CAAACTCCAA GCCCAGCGAT ATTTCGCGGC CTGGCCATGT GTTTCCTTTA CGCGCTGCCG AGGGTGGCGT ATTGCGGCGG GTTGGCCAAA CTGAAGCCTC AGTCGATATG TCACGCTTGG CGGGTTTGTA TCCTGCTGGC GTAATTTGCG AAATTATGAA CGACGATGGC ACGATGGCTC GGATGCCTGA CCTTGAGATT TTCGCCGAAA AGCATGATCT CAAAATTATC AGCGTTGAGC AATTAATTCG CTTTCGTCGT GAGCGTGAAT TTCTAGTTAC GCGCACTGCT GAAGCTCGCT TACCAACGGC CTATGGCGAA TTTCAAATCG TCTCATATGA GAGCAAAATC AACACACCTG ATCTGCAAGC CAAGGAGGCA GTGGCCTTGA TTATGGGCGA TATTTCAACC GACGAGCCAG TGCTGGTGCG GGTGCACTCC GAATGTTTAA CTGGCGATGC CTTTGGCTCA TTGCGTTGTG ATTGCGGCCC ACAGCTTGAA AAAGCGATGC AACGGATTGC CCAAGAAGGC CGTGGCGTGT TGCTGTATTT GCGCCAAGAA GGCCGTGGAA TTGGTCTGAC CAACAAAATT CGCGCCTATA TGCTGCAAGA TCAGGGCTTG GATACGGTTG AAGCTAACGA GCGCCTAGGT TTCCCTGCTG ATTTACGCGA TTATGGCTTG GGAGCGCAAA TGCTGGCCGA TTTGGGTCTG CGCGATTTAC GCCTACTCAC CAATAATCCC AAAAAGATCA TTGGTTTTGA AGGCTATGGT TTGCATGTGG TTGAACAATT GCCCTTGGAA ACCGACCCTA ACACTGAAAA CCAAGCCTAT CTGCGCACCA AGCGCGAACG CATGGGCCAT ACGTTATCGA ATTTGAATTC GGCGGAATAG
|
Protein sequence | MPLASIEEAL QDYAAGKMVI IVDDEDRENE GDLACAAQFV TPQAINFMAK EGRGLICLAI TGQRLDELEI PMMVTSNNSK FGTNFTVSIE AAQGVTTGIS AYDRAHTIQT VINPNSKPSD ISRPGHVFPL RAAEGGVLRR VGQTEASVDM SRLAGLYPAG VICEIMNDDG TMARMPDLEI FAEKHDLKII SVEQLIRFRR EREFLVTRTA EARLPTAYGE FQIVSYESKI NTPDLQAKEA VALIMGDIST DEPVLVRVHS ECLTGDAFGS LRCDCGPQLE KAMQRIAQEG RGVLLYLRQE GRGIGLTNKI RAYMLQDQGL DTVEANERLG FPADLRDYGL GAQMLADLGL RDLRLLTNNP KKIIGFEGYG LHVVEQLPLE TDPNTENQAY LRTKRERMGH TLSNLNSAE
|
| |