Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_3119 |
Symbol | |
ID | 4082705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | + |
Start bp | 3269701 |
End bp | 3270792 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638011504 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_618155 |
Protein GI | 103488594 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACC TGTTCGAAAA CCCGCTGGGC CTCGACGGCT TCGAGTTCGT CGAGTTTTCG GCGCCCGAAA AGGGCATTCT CGAGCCTGTG TTCGAACGCA TGGGCTTTAC CCGGATCGCG CGCCACCGGT CGAAAGACGT GCAATTGTGG CGTCAGGGCG ACATCAATCT GATCGCCAAT TACGAACCCC GCTCTCCCGC TGCCTATTTC GCCGCCGAAC ATGGCCCGTC GGCGTGCGGC ATGGGGTGGC GCTGCCGCGA TGCGGCCAAA GCCTATGCGG AAGCGATCGA GCGCGGCGCC GAGCCGGTCG AAACGACTCC CGGCCCGATG GAACTGCGCC TGCCCGCGAT CCGCGGCATC GGCGGCTCGA TTATCTATCT GATCGACCGC TATGGCGACG ATCTCAGCAT CTACGACATC GACTTCGTTT ACGAGGAGGG CGTCGACCGC CATCCGGTCG GCGCGGGGCT GAAGGTCATC GATCACCTGA CGCACAATGT CTATGGCGGC CGCATGGCGC ATTGGGCGGC GTTCTACGAG CGCATCGCGG GTTTTCGCGA AATCCGCTAT TTCGACATCA AGGGTGAATA TACCGGCCTC ACGTCAAAGG CGATGACCGC GCCCGACGGC AAGATACGCA TTCCGCTGAA CGAGGAGGGC GCCGGCGGCG GCGGCCAGAT CGAGGAATAT CTGCGCGCCT ACAATGGCGA GGGTATCCAG CATATCGCCT TTGCCTGCGA CGACCTCTAC GCCGCGTGGG ACAGGCTGAA AGCGCTCGGC AACCCCTTCG CGCCATCGCC GCCCGACACC TATTATGAAA TGCTCGCCGA GCGCCTGCCC GGCCATGGCG AGCCGGTCGA GGAACTGAAA TCGCGCGGCA TATTGCTCGA CGGTTCGACG ACCGAGGGCG ATCCGCGCCT GCTGCTCCAG ATTTTCGGGC AGACGGTGAT CGGCCCGGTT TTCTTCGAGT TCATCCAGCG CAAGAAGGAC GAAGGCTTCG GCGAGGGCAA TTTCACCGCG CTGTTCAAGT CGATGGAACT CGACCAGATC CGCCGCGGCG CGCTCAACGT CGAAGCGGAG CCCGCCGAAT GA
|
Protein sequence | MADLFENPLG LDGFEFVEFS APEKGILEPV FERMGFTRIA RHRSKDVQLW RQGDINLIAN YEPRSPAAYF AAEHGPSACG MGWRCRDAAK AYAEAIERGA EPVETTPGPM ELRLPAIRGI GGSIIYLIDR YGDDLSIYDI DFVYEEGVDR HPVGAGLKVI DHLTHNVYGG RMAHWAAFYE RIAGFREIRY FDIKGEYTGL TSKAMTAPDG KIRIPLNEEG AGGGGQIEEY LRAYNGEGIQ HIAFACDDLY AAWDRLKALG NPFAPSPPDT YYEMLAERLP GHGEPVEELK SRGILLDGST TEGDPRLLLQ IFGQTVIGPV FFEFIQRKKD EGFGEGNFTA LFKSMELDQI RRGALNVEAE PAE
|
| |