Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3523 |
Symbol | |
ID | 5735384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4436121 |
End bp | 4436987 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280670 |
Product | intradiol ring-cleavage dioxygenase |
Protein accession | YP_001546287 |
Protein GI | 159900040 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATG ATGATATTCC AGTAGGCCGA ATTTTAAGCC GCCGCGAGGC GCTCAAGCTT TTTTCAGCGG TTGGCGCAGG CATGATGTTG GCCGCCTGTG GGAGCCAAAA CTCAACTGCG ACCACCGCAC CAGCAGCCAC CGCCACAACT GGTACAAGTG CCGCAACCGC CATACCAACT GCAACGACTG CCGCAACCAG CATTGCCAGT TTGCCATCGT GTGTGGTCAA ACCAGAGATG ACAGTTGGGC CATATTTTGT CGATGAGCAA CTCAATCGCT CGGATATTCG CAGCGAGCCA TCGGATAATT CGCTACGGGC GGGCGTGCCG CTGACCCTCA ATATCAACGT TTCACAAATT AGCAGCAGCG CTTGTACGGC CTTGGCAGGG GCGATGGTCG ATATTTGGCA ATGCGATGCA GAAGGAATTT ATTCGGGGGT GACTGATGCA GGCTTTCAAA CTGAGGGCTT AAAGTTTTTG CGCGGCTACC AAATAACCGA TTCTAATGGC GATGCCAGCT TTACCACGAT TTTCCCAGGC TGGTATCAAG GGCGCACCGT GCATATTCAT GTCAAAATTC GCACCACCAG CAACACCAAC GAAGCCTATG AATTTACCTC GCAATTTTAT TTCGATACCG CCTTGACCAA TGAAATTTTG GCGAATGCAC CCTACAAAGC TGGGAGCCAA CGCGACACCA CCAACGAAAA CGATATGCAC TACGCCAATG GTGGCGAGCA AATGCTGCTT AGCTTAACCA AAACCAACGA TGGCTACACC GCTGGCTTTC CAATCGCGCT CGATTTGAGC GACGCTGAAA CTGGCCAAGC TGATCGGTTT GAGCAAATGC AAGCGCCGCC ACGCTAG
|
Protein sequence | MDNDDIPVGR ILSRREALKL FSAVGAGMML AACGSQNSTA TTAPAATATT GTSAATAIPT ATTAATSIAS LPSCVVKPEM TVGPYFVDEQ LNRSDIRSEP SDNSLRAGVP LTLNINVSQI SSSACTALAG AMVDIWQCDA EGIYSGVTDA GFQTEGLKFL RGYQITDSNG DASFTTIFPG WYQGRTVHIH VKIRTTSNTN EAYEFTSQFY FDTALTNEIL ANAPYKAGSQ RDTTNENDMH YANGGEQMLL SLTKTNDGYT AGFPIALDLS DAETGQADRF EQMQAPPR
|
| |