Gene M446_4755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4755 
Symbol 
ID6134786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5228730 
End bp5230643 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content73% 
IMG OID641644892 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001771519 
Protein GI170742864 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.304127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCG CGATCGCGAC CGTTTGCCTG AGCGGCACCC TCGGCGAGAA GCTGGAGGCC 
ATCGCCGCGG CCGGCTTCTC GGAGGTCGAG ATCTTCGAGA ACGACCTCCT CTCCTTCAGC
GGCACGCCCC GGGACCTGCG CCGCCGGGCG GAGGATCTCG GCCTCGCCGT CGCCGTCTAC
CAGCCCTTCC GCGACTTCGA GGGGATGCCG GCGCCGCAGC GCGCCAAGGC CTTCGCCCGG
GCCGAGCGCA AGTTCGACAC GATGCAGGAG CTCGGCTGCG ACCTCCTGAT GGTGTGCTCC
AACGTCTCGC CCGACAGCCT GGGCGGGCTC GACCGGGCGG CGGAGGATTT TCGCGAACTC
GGCGAGCGCG CGGCCCGGCG CGGCATGCGG GTCGGCTACG AGGCCCTGGC CTGGGGCCGG
CACGTCAGCG ATTACCGCGA CGCCTGGGAG ATCGTGCGCC GGGCCGATCA TCCGGCCGTC
GGCTTCGTCC TCGACAGCTT CCACGTGCTC GCCCGGGGCA CCGATCTCGG GGCGATCCGC
TCCATCCCGC GGGAGAAGAT CGTCCTGGTG CAGATGGCCG ACGCGCCCCG CCTCGCCATG
GATCACCTCT CCTGGAGCCG CCACTACCGC TGCTTCCCGG GCCAGGGCGA CCTGCCGATC
CCGGCCTTCG TGGACGCGCT CGGGGCGACG GGCTTCGACG GGATCCTGTC GCTGGAGATC
TTCAGCGACC GCTTCCGGGC GGGCTCGGCC CGCGGCGTCG CCCTCGACGG GCGCCGCTCG
CTCCTCGTCA TGCTCGACGA CCTGCGCCGC CGGGCGGGCG CGCCGGAGGA TCCCGCCGGG
CGGGGGCCGG CCCCTCTCCT GCCCGCCCTG CCGCCGCGGG CCGCCTGCGA GGGGATCGAG
TTCATCGAGT TCGCCATGGA CGAGGAGGAG GCGCGGGCAT TCGAATCGGT CCTGTCCGGC
CTCGGCTTCG CCCGGACCGC CCGCCACCGC TCCAAGGCCG TGACCCGCTG GAGCCAGGGC
GCGATCAACC TCGTGGTCAA TACCGAGAAG GAGGGCTTCG CCCATTCCTT CCAGGTCACG
CACGGCGCCT CGGTCTGCGC GGTCGCGCTG CGGGTCGACG ATGCGGGCGC GGCTCTGGAG
CGGGCGCGCG CCCTCCTCGA CGAGCCGTTC CGGCAGGCGG TGGCGCCGGG CGAACTCGAC
ATCCCGGCCG TCCGCGGCGT CGGCGGCAGC CTGCTCTACC TCGTCGATCG CCGCAGCGGC
CTCGACCGCC TCTGGGACGT GGATTTCGAG CCCCTCGCTC CGGAGCCGGT CCGCGGCGCC
GGGCTCGTCG CCGTCGATCA CCTCGCCCAG AGCATGCGCC ACGAGGAGAT GCTGACCTGG
CTCCTGTTCT ACACCGGGCT GTTCGACCTC GCGAAGCTGC CCGTGCAGGA CGTGGTCGAT
CCGGGCGGCG TGGTCGAGAG CCAGGCCGTC GAGGCCCCGG GCGCGGCCCT GCGCCTCGTC
CTCAACGCCT CGCAGAGCAG CCGCACCCTC TCCTCGCGCT TCCTGTCGGA GGCGCTCGGC
GGCGGGGTGC AGCACGTGGC GCTCGCCACC GACGACATCG TCGCCACCGT CGCGTCCCTG
CGCGCCGCCG GGGTCGCGCT CCTGCCGATT CCGGAGAACT ACTACGACGA CCTGGAGGCC
CGCACCGACC TGCCGCCCGA GACCCTGGCG CGGCTGCGCG ACGGGAACAT CCTGTACGAC
CGCGAGGGCG GGGCCGAGTT CTTCCAGGTC TACACCCGCG GCCTCCTCGG CGGCGGCTTC
GCCTTCGAGA TCGTCGAGCG GCGCGGCTAT CGCGGCTACG GCGCCGCGAA TGCCCCGATC
CGGCTTGCGG CCCAGACCCG CCTGGCCCCA CATCCGGCCC TCCCCACCCG GTAA
 
Protein sequence
MKLAIATVCL SGTLGEKLEA IAAAGFSEVE IFENDLLSFS GTPRDLRRRA EDLGLAVAVY 
QPFRDFEGMP APQRAKAFAR AERKFDTMQE LGCDLLMVCS NVSPDSLGGL DRAAEDFREL
GERAARRGMR VGYEALAWGR HVSDYRDAWE IVRRADHPAV GFVLDSFHVL ARGTDLGAIR
SIPREKIVLV QMADAPRLAM DHLSWSRHYR CFPGQGDLPI PAFVDALGAT GFDGILSLEI
FSDRFRAGSA RGVALDGRRS LLVMLDDLRR RAGAPEDPAG RGPAPLLPAL PPRAACEGIE
FIEFAMDEEE ARAFESVLSG LGFARTARHR SKAVTRWSQG AINLVVNTEK EGFAHSFQVT
HGASVCAVAL RVDDAGAALE RARALLDEPF RQAVAPGELD IPAVRGVGGS LLYLVDRRSG
LDRLWDVDFE PLAPEPVRGA GLVAVDHLAQ SMRHEEMLTW LLFYTGLFDL AKLPVQDVVD
PGGVVESQAV EAPGAALRLV LNASQSSRTL SSRFLSEALG GGVQHVALAT DDIVATVASL
RAAGVALLPI PENYYDDLEA RTDLPPETLA RLRDGNILYD REGGAEFFQV YTRGLLGGGF
AFEIVERRGY RGYGAANAPI RLAAQTRLAP HPALPTR