Gene Rpic12D_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpic12D_3041 
Symbol 
ID8020724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia pickettii 12D 
KingdomBacteria 
Replicon accessionNC_012856 
Strand
Start bp3209849 
End bp3210955 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content62% 
IMG OID644831838 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002982982 
Protein GI241664622 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTCA CGCCTTGGGA AAACCCGATG GGCACCGCCG GCTTCGAGTT CATCGAATAC 
GCCGCGCCGG ACCCCGTTGC CATGGGCAAG CTGTTCGAGA ACATGGGCTT TACCGCCATC
GCGAAACACC GTCACAAGAA CGTGACGCTG TACCGCCAGG GCGAGATCAA CTTCATCATC
AACGCCGAGC CCGATTCGTT CGCGCAGCGC TTTGCGCGCC TGCACGGCCC GTCGATCTGC
GCGATCGCGT TTCGCGTGCG AGACGCCGCG TTCGCCTACA AGCGTGCGCT GGAGCTGGGC
GCCTGGGGCT TCGACACCCA CAGCGGCCCG ATGGAGCTGA ACATTCCGGC CATCAAGGGC
ATCGGCGATT CGCTGATCTA CCTGGTCGAC CGCTGGACCG GCAAGAACGA CGCCAAGGCC
GGCGACATCG GCAACATCAG CATCTACGAC GTCGATTTCG TGCCCATCGC GGGCGCCAAC
CCGAACCCCA CCGGGCACGG CCTGACCTAC ATCGACCACC TGACGCACAA CGTCTACCGT
GGCCGGATGA AGGAATGGGC CGAGTTTTAC GAACGCTTCT TCAACTTCCG TGAGGTCCGC
TACTTCGACA TCGAAGGCCA GGTCACGGGC GTGAAGAGCA AGGCGATGAC GAGCCCGTGC
GGCAATATCC GCATCCCCAT CAACGAGGAA GGGACGGAGA AGGCCGGTCA GATCCAGGAG
TATCTGGACA TGTACCACGG CGAGGGCATC CAGCACATCG CGCTCGGTTC GACCAACCTG
TTCAACACGG TGGACGCGCT GCGCAGCAAG GGCATCAAGC TGCTGGACAC GATCGACACG
TATTACGAAC TGGTCGACAA GCGCATCCCC GGCCATGGCG AAGACGTGGC GGAACTGAAG
AAGCGCAAGA TCCTGATCGA CGGCGCACCG GGCGACCTTC TGCTGCAGAT CTTCTCGGAA
AACCAGCTCG GTCCGATCTT CTTCGAGTTC ATCCAGCGCA AGGGCAACCA AGGTTTTGGC
GAGGGCAACT TCAAGGCGCT CTTCGAGTCG ATCGAACTCG ACCAGATGCG CCGCGGCGTG
CTCAAGGCCG ACGACCAGCC GGCCTGA
 
Protein sequence
MQFTPWENPM GTAGFEFIEY AAPDPVAMGK LFENMGFTAI AKHRHKNVTL YRQGEINFII 
NAEPDSFAQR FARLHGPSIC AIAFRVRDAA FAYKRALELG AWGFDTHSGP MELNIPAIKG
IGDSLIYLVD RWTGKNDAKA GDIGNISIYD VDFVPIAGAN PNPTGHGLTY IDHLTHNVYR
GRMKEWAEFY ERFFNFREVR YFDIEGQVTG VKSKAMTSPC GNIRIPINEE GTEKAGQIQE
YLDMYHGEGI QHIALGSTNL FNTVDALRSK GIKLLDTIDT YYELVDKRIP GHGEDVAELK
KRKILIDGAP GDLLLQIFSE NQLGPIFFEF IQRKGNQGFG EGNFKALFES IELDQMRRGV
LKADDQPA