Gene Ajs_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_0520 
Symbol 
ID4672780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp542387 
End bp543505 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content66% 
IMG OID639837649 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_984846 
Protein GI121592950 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.365808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0158629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCG CCCTGCCCCT TCGCGCCACC CACGACGCCG ACGCCTGGGA GAACCCCATG 
GGCCTGATGG GCTTCGAGTT CGTGGAGTTC ACCTCGCCCA CCCCGGGGCT GCTGGAAGCG
GTGTTCGAGA AGCTGGGCTT TACGCTCGTG GCGCGCCACC GGTCCAAGGA CGTGCTGCTG
TACCGGCAGA ACCAGATCAA CTTCATCCTG AACCGCGAGC CGGCCAGCCA GGCCGCGTAC
TTTGGCGCCG AACACGGCCC CTCGGCCTGC GGGCTGGCCT TTCGCGTGAA GGATTCGCAC
CAGGCCTACC GGCGTGCGCT GGAGCTGGGC GCCCAGCCGA TCGAGATCCC CACCGGCCCC
ATGGAACTGC GCCTGCCGGC CATCAAGGGC ATCGGCGGCG CGCCGCTGTA CCTGATCGAC
CGCTTCGAGG ACGGCAAGTC CATCTACGAC ATCGACTTCG ACTTCCTGCC CGGCGTGGAC
CGGCGCCCGG TGGGGCACGG CCTGAACGAG ATCGACCACC TCACGCACAA CGTGTACCGC
GGACGCATGG GATTCTGGGC CAATTTCTAC GAAAAGCTCT TCAACTTCCG AGAGATCCGC
TACTTCGACA TCCAGGGCGA GTACACGGGC CTGACGTCGA AGGCCATGAC GGCGCCGGAC
GGCAAGATCC GCATCCCGCT GAACGAAGAG GCCAAGCAGG GCGGCGGGCA GATCGAGGAA
TTCCTGATGC AGTTCAACGG CGAGGGCATC CAGCACATCG CGCTGATCTG CGACTCGCTC
GTCGAGGTGG TGGACAAGCT GGGGCTGGCG GGCGTGCCCC TGGCCCCGGC GCCCAACGAC
ATCTATTACG ACATGCTGGA CACCCGCCTG CCGAGTCACG GCCAGAACGT GGCGGAACTG
CAGGCGCGCG GCATCCTGCT GGACGGCACC ACGGCCGACG GCACACCCCG TCTGCTGCTG
CAGATCTTCT CCACGCCCAT GCTGGGGCCG GTGTTCTTTG AGTTCATCGA ACGCCAGGGC
AACTACCGCG AGGGTTTCGG CGAAGGCAAC TTCAAGGCGC TGTTCGAATC GCTGGAGCGT
GACCAGATCC GGCGCGGCGT CCTGCAGACC CAGGCGTGA
 
Protein sequence
MTAALPLRAT HDADAWENPM GLMGFEFVEF TSPTPGLLEA VFEKLGFTLV ARHRSKDVLL 
YRQNQINFIL NREPASQAAY FGAEHGPSAC GLAFRVKDSH QAYRRALELG AQPIEIPTGP
MELRLPAIKG IGGAPLYLID RFEDGKSIYD IDFDFLPGVD RRPVGHGLNE IDHLTHNVYR
GRMGFWANFY EKLFNFREIR YFDIQGEYTG LTSKAMTAPD GKIRIPLNEE AKQGGGQIEE
FLMQFNGEGI QHIALICDSL VEVVDKLGLA GVPLAPAPND IYYDMLDTRL PSHGQNVAEL
QARGILLDGT TADGTPRLLL QIFSTPMLGP VFFEFIERQG NYREGFGEGN FKALFESLER
DQIRRGVLQT QA