Gene Phep_3280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3280 
Symbol 
ID8254399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3892129 
End bp3893286 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content44% 
IMG OID644936932 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_003093536 
Protein GI255533164 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.524985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATCT ATCATACATT AGGGACTATC CCTGCCAAAC GTCATACTGT TTTCCGTAAG 
CCCGATGGGA ACCTTTACGC TGAAGAACTT GTTTCTACAG AGGGCTTTTC CAGTTTATAC
TCGCTGGTGT ACCATTGCCA CCCTCCTACC ATTGTTAAAG CCTTAGGGGA ACCTTATTCA
GTTGAACCTA AAATTGCCAG GGAGAAACAT TTGAAACATA CCAGTCTGCT TGGCTTTAAC
ATTAAACCGG AAGATGATTA CCTGAAGAGC CGCAAGCCTG TACTGGTAAA CAGCGATCTG
CACATTTCGC TGGCTGCACC GAAAAAATCC ATGACGGATT ATTTTTATAA GAACAGCCAG
GCCGATGAAG TCATATTTAT CCATGAAGGT ACGGGGACAT TAAAGACAGG TTTTGGCAAA
ATCCGCTTTG GCTATGGCGA TTACGTGATT GTACCCAGGG GCACCATTTA CCAAATTGAA
TTTGATGATG AAAAAAACAG GTTATTTATT GTAGAGAGTT TTAGCCCGAT CCGTTCGCCC
AAGCGCTACC GCAATGAATA CGGACAGCTG ATGGAGCATT CTCCTTATTG CGAGCGTGAC
ATCAGACGGC CATCTGATCT GGAAACCATA GATGCTTATG GCGATTTTAA GGTGTTGATA
AAAAAACAGG GCCTGATTTA TCCTTATATA TACGGTACAC ATCCTTTTGA TTTTGTGGGT
TGGGATGGCT TTCATTATCC TTATGCCTTT TCTATTCATG ATTTTGAACC GATCACAGGA
AGGTTGCATC AGCCTCCCCC TGTGCACCAG ACTTTTGAAG GACACAATTT TGTGATCTGT
TCTTTTGTTC CCCGCAAATA CGATTATCAT CCTTTATCGA TACCAGCCCC CTATAACCAT
AGTAATGTAG ACAGTGATGA GGTGCTGTAT TATGTGGACG GTGATTTTAT GAGCAGGAAA
AGTGTGGTAA AAGGACAGAT TACGCTGCAT CCGGGAGGTA TTCCCCATGG GCCGCACCCG
GGCACAGTTG AGAAATCAAT AGGCAAGGAA AGTACGGAGG AACTGGCTGT GATGATAGAT
CCCTTCAGGC CCCTGATGCT GACAGAAGAT GCGTTGGCAA TAGAGGATGA GGATTACCAC
AAAAGCTGGC TGGAGTAA
 
Protein sequence
MPIYHTLGTI PAKRHTVFRK PDGNLYAEEL VSTEGFSSLY SLVYHCHPPT IVKALGEPYS 
VEPKIAREKH LKHTSLLGFN IKPEDDYLKS RKPVLVNSDL HISLAAPKKS MTDYFYKNSQ
ADEVIFIHEG TGTLKTGFGK IRFGYGDYVI VPRGTIYQIE FDDEKNRLFI VESFSPIRSP
KRYRNEYGQL MEHSPYCERD IRRPSDLETI DAYGDFKVLI KKQGLIYPYI YGTHPFDFVG
WDGFHYPYAF SIHDFEPITG RLHQPPPVHQ TFEGHNFVIC SFVPRKYDYH PLSIPAPYNH
SNVDSDEVLY YVDGDFMSRK SVVKGQITLH PGGIPHGPHP GTVEKSIGKE STEELAVMID
PFRPLMLTED ALAIEDEDYH KSWLE