Gene Phep_3281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3281 
Symbol 
ID8254400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3893358 
End bp3894488 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content45% 
IMG OID644936933 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003093537 
Protein GI255533165 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.277548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAC AAACATTTGC AGAAAAAATT GCCAGAGCGC AGGATTTTCT TCCAATAAAT 
GGAACGGATT ATATAGAATT TTATGTGGGC AATGCCAAAC AGGCTGCACA TTATTACAAA
ACGGCCTTTG GCTTCCAGTC GCTTGCCTAT GCAGGGCCTG AAACAGGTGT ACGCGACAGG
GCCTCGTATG TATTACAGCA GGGAAAGATC AGACTGGTGC TCACTACAGC TTTAAAATCG
GAGGGCCCGA TTGCGGAGCA TGTTAAACGG CACGGCGATG GCGTAAAAAT ACTGGCCCTT
TGGGTAGATG ATGCTTACAG TGCTTTTGAA GAGACCACCA AAAGGGGCGC AAGGCCTTAC
CTGGAGCCGG TAACCCACAC AGATGACCAT GGTGAGGTCC GTATGTCGGG CATCTACACC
TATGGCGAGA CCGTACATAT ATTTGTTGAA CGCAAAAATT ATAGGGGCAG CTTTATGCCA
GGGTATGTAG ACTGGAAAAG CAATTATAAC CCTGCAGATA CAGGTTTATT GTATATAGAC
CATTGCGTGG GCAATGTAGG CTGGAACAGG ATGAACGAAA CAGTGAAGTG GTATGAGGAT
GTGATGGGTT TCGTAAACAT CCTTTCTTTT GATGATAAAC AGATCAATAC GGAATATTCT
GCACTAATGA GCAAAGTGAT GAGCAATGGA AATGGGTATT CTAAGTTTCC GATCAATGAG
CCGGCTGAAG GGAAGAAGAA ATCGCAGATT GAAGAATATC TGGAATTTTA CGAAGGCGAA
GGGGTGCAAC ACATTGCTGT GGCAACAAAA GATATCCTGA CCACAGTAAG GGACTTAAAA
GCTCGAGGGG TTGAGTTTTT GAGTGCACCC CCTGAAGCCT ATTACAATAT GATGCCTGAA
CGTGTGGGCG AGATTGATGA AGAAATGGCA CAGCTAAAAG AACTGGGCAT TTTGGTAGAT
TGTGATGAGG AAGGTTATCT GCTGCAGATC TTTACCAAAC CTGTTGAAGA CAGACCGACC
TTATTTTTTG AGATTATACA GCGCAAAGGT GCACAATCTT TTGGGGCAGG GAATTTTAAA
GCCCTATTTG AGTCTTTAGA ACGCGAACAG GAACTAAGGG GGAATTTATA G
 
Protein sequence
MSTQTFAEKI ARAQDFLPIN GTDYIEFYVG NAKQAAHYYK TAFGFQSLAY AGPETGVRDR 
ASYVLQQGKI RLVLTTALKS EGPIAEHVKR HGDGVKILAL WVDDAYSAFE ETTKRGARPY
LEPVTHTDDH GEVRMSGIYT YGETVHIFVE RKNYRGSFMP GYVDWKSNYN PADTGLLYID
HCVGNVGWNR MNETVKWYED VMGFVNILSF DDKQINTEYS ALMSKVMSNG NGYSKFPINE
PAEGKKKSQI EEYLEFYEGE GVQHIAVATK DILTTVRDLK ARGVEFLSAP PEAYYNMMPE
RVGEIDEEMA QLKELGILVD CDEEGYLLQI FTKPVEDRPT LFFEIIQRKG AQSFGAGNFK
ALFESLEREQ ELRGNL