Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_4766 |
Symbol | |
ID | 7971776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 5063741 |
End bp | 5064868 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644795351 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002946637 |
Protein GI | 239817727 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.578295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCACA CCGACGCGCC CCCCTTCACG CCCTGGGAGA ACCCCATGGG AACCGACGGC TTCGAATTCA TCGAATACGC GGCGCCGGAT CCGGTCGCGA TGGGCAAGGT GTTCGAGCGC ATGGGCTTCA AGGCCGTGGC GCGGCACCGC CACAAGAACG TGCTGCTGTA CCGCCAGGGC ACGATCAATT TCATCGTCAA CGCCGAGCCC GACTCCTTCG CGCAGCGCTT TGCGCGCGAA CACGGCCCGA GCGTCTGCGC CATCGCCTTC CGCGTGCAGG ACGCCAAGCA GGCCTACGAA CGCGCGATCT CGCTCGGCGC ATGGGGCTTT GCCGACAAGG CGGGTCCGGG CGAATTGAAC ATTCCCGCCA TCAAGGGCAT CGGCGACAGC CTGATCTACC TGGTCGACCG CTGGCCCGGC AAGAACGGCG CGAAGCCGGG CGACATCGGC AACATCGGCT TCTACGACGT CGACTTCGAA CCGCTGCCGG GCGTGGCCTC GCAGGATGCG CTGGCGCCCA AGGGCAACGG CCTGACCTAC ATCGACCATC TCACGCACAA TGTGTACCGC GGCCGCATGA ACGTCTGGGC CGGCTTCTAC GAGAAGCTCT TCAACTTCCG CGAGATCAAG TACTTCGACA TCGAAGGCCA GGTCACGGGC GTGAAGAGCA AGGCCATGAC CAGCCCCTGC GGCAAGATCC GCATCCCCAT CAATGAAGAG GGCAAGGAGC AGGCCGGCCA GATCCAGGAG TACCTGGACA TGTACCGCGG CGAAGGCATC CAGCACATCG CGATGGGCTC GGACAACCTG TACGAGACCG TCGACGCGCT GCGCGCCAAC GGCGTCACCC TGCTCGACAC CATCGACACC TACTACGAGC TGGTCGACAA GCGCATTCCC GGCCATGGCG AAAGCGTGGC CGAGCTGCAG AAGCGCAAGA TCCTGATCGA CGGCAAGAAG GACGCGCTGC TGCTGCAGAT CTTCAGCGAG AACCAGCTCG GCCCGATCTT CTTCGAGTTC ATCCAGCGCA AGGGCGACGA CGGTTTCGGC AACGGCAACT TCAAGGCGCT GTTCGAAAGC ATCGAGCTCG ACCAGATGCG CCGCGGCGTG CTGTCGGCCG CAAAATAA
|
Protein sequence | MSHTDAPPFT PWENPMGTDG FEFIEYAAPD PVAMGKVFER MGFKAVARHR HKNVLLYRQG TINFIVNAEP DSFAQRFARE HGPSVCAIAF RVQDAKQAYE RAISLGAWGF ADKAGPGELN IPAIKGIGDS LIYLVDRWPG KNGAKPGDIG NIGFYDVDFE PLPGVASQDA LAPKGNGLTY IDHLTHNVYR GRMNVWAGFY EKLFNFREIK YFDIEGQVTG VKSKAMTSPC GKIRIPINEE GKEQAGQIQE YLDMYRGEGI QHIAMGSDNL YETVDALRAN GVTLLDTIDT YYELVDKRIP GHGESVAELQ KRKILIDGKK DALLLQIFSE NQLGPIFFEF IQRKGDDGFG NGNFKALFES IELDQMRRGV LSAAK
|
| |