Gene Haur_2762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2762 
Symbol 
ID5734643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3517256 
End bp3518413 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content52% 
IMG OID641279905 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_001545528 
Protein GI159899281 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000862823 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTTCT ATCACAAACT TGGCAACATT CCACACAAGC GCCATACCCA ATTTCGCAAG 
CCCGACGGGT CGCTGTATTC GGAACAGTTG ATGGGAACCA AGGGATTTAG CGGGGTCGAA
GCCTTGCTCT ATCACCATTA TCCGCCAACT GCAATTCTCA AAGTGGAAGA TCTTGGTTCA
ACCGCGATTG AACTTGAACC TGATGGAGCA CTTCGCCATC GCCATCTCAA AACCTTTGCC
CATCAGCCAG GCGGCGATCC CATCGGTGGG CGACGTATGT TGTTGGTCAA CAATGACGTG
CGCATGGGAA TTGTCCACCC CACCGAACCC CAAACCTATT TCTATCGCAA CGGCGAAGGC
GATGAAATGT TGTTCATTCA CGAAGGTGAG GGAGTGCTCG AAACCATTTT TGGCAATATT
CCCTATCGTC GTGGCGATTA CTTGGTGATT CCGATTGGCA CAACCTATCG GGTCAATACC
AATGGCACGC CCACCAAAAT GTTGGTGCTC GAAACCATGG GCGAAATCAC CACGCCCAAT
CGTTATCGCA ATGAACATGG CCAATTGCTT GAGCACGCCC CCTTCTGCGA ACGCGATATT
CGGGTGCCGC TGGAGCTTAC CCCTCACGAC GAAAAAGGTG AATTTGCCGT CCATGTGCGG
GCGCATGGCC GCATGACCAA ACATGTGCTC AATCATCATC CCTTTGATGT AGTGGGCTGG
GATGGCTATC TCTATCCATT TGCCTTCAAC ATCGAAGATT TCGAGCCAAT TACGGGGCGG
GTACACCAAC CGCCACCAGT TCATCAAACC TTCAGCGGGC CAAATTTTGT GGTCTGCTCG
TTCGTGCCAC GGCTGTTCGA TTATCATCCT GAGGCAATTC CCGCACCCTA TAATCACTCA
AATGTCGAAA GTGATGAGGT GTTGTATTAT GTCGAGGGCA ACTTTATGTC ACGGCGTGGG
GTTGATCTTG GTTCGATCAC CTTGCATCCA TCGGGCATGC CGCACGGGCC GCACCCTGGC
ACGGTCGAGG GTTCAATTGG CAAAGCGGCC ACTGAAGAAT TAGCGGTAAT GGTCGATACC
TTCAAGCCGC TCTACCTCAC CAAAGCTGCC TTGAACCTTG AAGAACCCAA CTACACCTAT
TCATGGGTCA ACCACTAA
 
Protein sequence
MPFYHKLGNI PHKRHTQFRK PDGSLYSEQL MGTKGFSGVE ALLYHHYPPT AILKVEDLGS 
TAIELEPDGA LRHRHLKTFA HQPGGDPIGG RRMLLVNNDV RMGIVHPTEP QTYFYRNGEG
DEMLFIHEGE GVLETIFGNI PYRRGDYLVI PIGTTYRVNT NGTPTKMLVL ETMGEITTPN
RYRNEHGQLL EHAPFCERDI RVPLELTPHD EKGEFAVHVR AHGRMTKHVL NHHPFDVVGW
DGYLYPFAFN IEDFEPITGR VHQPPPVHQT FSGPNFVVCS FVPRLFDYHP EAIPAPYNHS
NVESDEVLYY VEGNFMSRRG VDLGSITLHP SGMPHGPHPG TVEGSIGKAA TEELAVMVDT
FKPLYLTKAA LNLEEPNYTY SWVNH