Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3280 |
Symbol | |
ID | 8254399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 3892129 |
End bp | 3893286 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644936932 |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_003093536 |
Protein GI | 255533164 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.524985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTATCT ATCATACATT AGGGACTATC CCTGCCAAAC GTCATACTGT TTTCCGTAAG CCCGATGGGA ACCTTTACGC TGAAGAACTT GTTTCTACAG AGGGCTTTTC CAGTTTATAC TCGCTGGTGT ACCATTGCCA CCCTCCTACC ATTGTTAAAG CCTTAGGGGA ACCTTATTCA GTTGAACCTA AAATTGCCAG GGAGAAACAT TTGAAACATA CCAGTCTGCT TGGCTTTAAC ATTAAACCGG AAGATGATTA CCTGAAGAGC CGCAAGCCTG TACTGGTAAA CAGCGATCTG CACATTTCGC TGGCTGCACC GAAAAAATCC ATGACGGATT ATTTTTATAA GAACAGCCAG GCCGATGAAG TCATATTTAT CCATGAAGGT ACGGGGACAT TAAAGACAGG TTTTGGCAAA ATCCGCTTTG GCTATGGCGA TTACGTGATT GTACCCAGGG GCACCATTTA CCAAATTGAA TTTGATGATG AAAAAAACAG GTTATTTATT GTAGAGAGTT TTAGCCCGAT CCGTTCGCCC AAGCGCTACC GCAATGAATA CGGACAGCTG ATGGAGCATT CTCCTTATTG CGAGCGTGAC ATCAGACGGC CATCTGATCT GGAAACCATA GATGCTTATG GCGATTTTAA GGTGTTGATA AAAAAACAGG GCCTGATTTA TCCTTATATA TACGGTACAC ATCCTTTTGA TTTTGTGGGT TGGGATGGCT TTCATTATCC TTATGCCTTT TCTATTCATG ATTTTGAACC GATCACAGGA AGGTTGCATC AGCCTCCCCC TGTGCACCAG ACTTTTGAAG GACACAATTT TGTGATCTGT TCTTTTGTTC CCCGCAAATA CGATTATCAT CCTTTATCGA TACCAGCCCC CTATAACCAT AGTAATGTAG ACAGTGATGA GGTGCTGTAT TATGTGGACG GTGATTTTAT GAGCAGGAAA AGTGTGGTAA AAGGACAGAT TACGCTGCAT CCGGGAGGTA TTCCCCATGG GCCGCACCCG GGCACAGTTG AGAAATCAAT AGGCAAGGAA AGTACGGAGG AACTGGCTGT GATGATAGAT CCCTTCAGGC CCCTGATGCT GACAGAAGAT GCGTTGGCAA TAGAGGATGA GGATTACCAC AAAAGCTGGC TGGAGTAA
|
Protein sequence | MPIYHTLGTI PAKRHTVFRK PDGNLYAEEL VSTEGFSSLY SLVYHCHPPT IVKALGEPYS VEPKIAREKH LKHTSLLGFN IKPEDDYLKS RKPVLVNSDL HISLAAPKKS MTDYFYKNSQ ADEVIFIHEG TGTLKTGFGK IRFGYGDYVI VPRGTIYQIE FDDEKNRLFI VESFSPIRSP KRYRNEYGQL MEHSPYCERD IRRPSDLETI DAYGDFKVLI KKQGLIYPYI YGTHPFDFVG WDGFHYPYAF SIHDFEPITG RLHQPPPVHQ TFEGHNFVIC SFVPRKYDYH PLSIPAPYNH SNVDSDEVLY YVDGDFMSRK SVVKGQITLH PGGIPHGPHP GTVEKSIGKE STEELAVMID PFRPLMLTED ALAIEDEDYH KSWLE
|
| |