Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3281 |
Symbol | |
ID | 8254400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 3893358 |
End bp | 3894488 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644936933 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_003093537 |
Protein GI | 255533165 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.277548 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAACAC AAACATTTGC AGAAAAAATT GCCAGAGCGC AGGATTTTCT TCCAATAAAT GGAACGGATT ATATAGAATT TTATGTGGGC AATGCCAAAC AGGCTGCACA TTATTACAAA ACGGCCTTTG GCTTCCAGTC GCTTGCCTAT GCAGGGCCTG AAACAGGTGT ACGCGACAGG GCCTCGTATG TATTACAGCA GGGAAAGATC AGACTGGTGC TCACTACAGC TTTAAAATCG GAGGGCCCGA TTGCGGAGCA TGTTAAACGG CACGGCGATG GCGTAAAAAT ACTGGCCCTT TGGGTAGATG ATGCTTACAG TGCTTTTGAA GAGACCACCA AAAGGGGCGC AAGGCCTTAC CTGGAGCCGG TAACCCACAC AGATGACCAT GGTGAGGTCC GTATGTCGGG CATCTACACC TATGGCGAGA CCGTACATAT ATTTGTTGAA CGCAAAAATT ATAGGGGCAG CTTTATGCCA GGGTATGTAG ACTGGAAAAG CAATTATAAC CCTGCAGATA CAGGTTTATT GTATATAGAC CATTGCGTGG GCAATGTAGG CTGGAACAGG ATGAACGAAA CAGTGAAGTG GTATGAGGAT GTGATGGGTT TCGTAAACAT CCTTTCTTTT GATGATAAAC AGATCAATAC GGAATATTCT GCACTAATGA GCAAAGTGAT GAGCAATGGA AATGGGTATT CTAAGTTTCC GATCAATGAG CCGGCTGAAG GGAAGAAGAA ATCGCAGATT GAAGAATATC TGGAATTTTA CGAAGGCGAA GGGGTGCAAC ACATTGCTGT GGCAACAAAA GATATCCTGA CCACAGTAAG GGACTTAAAA GCTCGAGGGG TTGAGTTTTT GAGTGCACCC CCTGAAGCCT ATTACAATAT GATGCCTGAA CGTGTGGGCG AGATTGATGA AGAAATGGCA CAGCTAAAAG AACTGGGCAT TTTGGTAGAT TGTGATGAGG AAGGTTATCT GCTGCAGATC TTTACCAAAC CTGTTGAAGA CAGACCGACC TTATTTTTTG AGATTATACA GCGCAAAGGT GCACAATCTT TTGGGGCAGG GAATTTTAAA GCCCTATTTG AGTCTTTAGA ACGCGAACAG GAACTAAGGG GGAATTTATA G
|
Protein sequence | MSTQTFAEKI ARAQDFLPIN GTDYIEFYVG NAKQAAHYYK TAFGFQSLAY AGPETGVRDR ASYVLQQGKI RLVLTTALKS EGPIAEHVKR HGDGVKILAL WVDDAYSAFE ETTKRGARPY LEPVTHTDDH GEVRMSGIYT YGETVHIFVE RKNYRGSFMP GYVDWKSNYN PADTGLLYID HCVGNVGWNR MNETVKWYED VMGFVNILSF DDKQINTEYS ALMSKVMSNG NGYSKFPINE PAEGKKKSQI EEYLEFYEGE GVQHIAVATK DILTTVRDLK ARGVEFLSAP PEAYYNMMPE RVGEIDEEMA QLKELGILVD CDEEGYLLQI FTKPVEDRPT LFFEIIQRKG AQSFGAGNFK ALFESLEREQ ELRGNL
|
| |