Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2762 |
Symbol | |
ID | 5734643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3517256 |
End bp | 3518413 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279905 |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_001545528 |
Protein GI | 159899281 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000862823 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTTCT ATCACAAACT TGGCAACATT CCACACAAGC GCCATACCCA ATTTCGCAAG CCCGACGGGT CGCTGTATTC GGAACAGTTG ATGGGAACCA AGGGATTTAG CGGGGTCGAA GCCTTGCTCT ATCACCATTA TCCGCCAACT GCAATTCTCA AAGTGGAAGA TCTTGGTTCA ACCGCGATTG AACTTGAACC TGATGGAGCA CTTCGCCATC GCCATCTCAA AACCTTTGCC CATCAGCCAG GCGGCGATCC CATCGGTGGG CGACGTATGT TGTTGGTCAA CAATGACGTG CGCATGGGAA TTGTCCACCC CACCGAACCC CAAACCTATT TCTATCGCAA CGGCGAAGGC GATGAAATGT TGTTCATTCA CGAAGGTGAG GGAGTGCTCG AAACCATTTT TGGCAATATT CCCTATCGTC GTGGCGATTA CTTGGTGATT CCGATTGGCA CAACCTATCG GGTCAATACC AATGGCACGC CCACCAAAAT GTTGGTGCTC GAAACCATGG GCGAAATCAC CACGCCCAAT CGTTATCGCA ATGAACATGG CCAATTGCTT GAGCACGCCC CCTTCTGCGA ACGCGATATT CGGGTGCCGC TGGAGCTTAC CCCTCACGAC GAAAAAGGTG AATTTGCCGT CCATGTGCGG GCGCATGGCC GCATGACCAA ACATGTGCTC AATCATCATC CCTTTGATGT AGTGGGCTGG GATGGCTATC TCTATCCATT TGCCTTCAAC ATCGAAGATT TCGAGCCAAT TACGGGGCGG GTACACCAAC CGCCACCAGT TCATCAAACC TTCAGCGGGC CAAATTTTGT GGTCTGCTCG TTCGTGCCAC GGCTGTTCGA TTATCATCCT GAGGCAATTC CCGCACCCTA TAATCACTCA AATGTCGAAA GTGATGAGGT GTTGTATTAT GTCGAGGGCA ACTTTATGTC ACGGCGTGGG GTTGATCTTG GTTCGATCAC CTTGCATCCA TCGGGCATGC CGCACGGGCC GCACCCTGGC ACGGTCGAGG GTTCAATTGG CAAAGCGGCC ACTGAAGAAT TAGCGGTAAT GGTCGATACC TTCAAGCCGC TCTACCTCAC CAAAGCTGCC TTGAACCTTG AAGAACCCAA CTACACCTAT TCATGGGTCA ACCACTAA
|
Protein sequence | MPFYHKLGNI PHKRHTQFRK PDGSLYSEQL MGTKGFSGVE ALLYHHYPPT AILKVEDLGS TAIELEPDGA LRHRHLKTFA HQPGGDPIGG RRMLLVNNDV RMGIVHPTEP QTYFYRNGEG DEMLFIHEGE GVLETIFGNI PYRRGDYLVI PIGTTYRVNT NGTPTKMLVL ETMGEITTPN RYRNEHGQLL EHAPFCERDI RVPLELTPHD EKGEFAVHVR AHGRMTKHVL NHHPFDVVGW DGYLYPFAFN IEDFEPITGR VHQPPPVHQT FSGPNFVVCS FVPRLFDYHP EAIPAPYNHS NVESDEVLYY VEGNFMSRRG VDLGSITLHP SGMPHGPHPG TVEGSIGKAA TEELAVMVDT FKPLYLTKAA LNLEEPNYTY SWVNH
|
| |