Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smal_3739 |
Symbol | |
ID | 6474621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stenotrophomonas maltophilia R551-3 |
Kingdom | Bacteria |
Replicon accession | NC_011071 |
Strand | - |
Start bp | 4206429 |
End bp | 4207499 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642732940 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002030121 |
Protein GI | 194367511 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.368699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTCA CCACCTTCGA GAACCCGATG GGCATCGACG GCTTCGAGTT CGTCGAATTC GCCGCCCCGG CCGGCCGTGG CCAGGAGCTG CACGAGTACT TCCGGAAGAT GGGCTTCAGC GCGGTGCTCA AGCACAAGCA GCGTCCGATT ACCGTCTATC GCCAGGGCGA CGTCAACTTC CTGGTCAATG AAGACCCGGA TTCGTTCGCT TCGGACTTCG CCGAAAAGCA CGGCCCGTGC GCCTGCGGCT TCGCCATCCG CTTCAAGAAG CCGGGCCAGG AGGTCTACCA GACCGCGCTG GGCAACGGCG CCGAAGCCAT CGCCTTCAAG CCGGACAGCA AGGCGGTCAG CGCGCCGGTC ATCAAGGGCA TCGGCGACTG CATGCTGTAC CTGGTCGACC GCTACGGCAG CGCCGGCAGC ATCTTCGATG GCGACTACGA GCTGATCGCC GGCGCCGAAC TGCGCCCGAA GGGCTTCGGC TTGACCTTCA TCGACCACCT GACCCACAAC CTGTACTTCG GCAACATGCA GCAGTGGTCG GACTACTACG AGCGCCTGTT CAACTTCCGC GAGATCCGCT ACTTCGACAT CAAGGGCCTG AAGACCGGCC TGGTGTCCAA GGCGATGACC GCGCCGGACG GCATCGTGCG CATTCCGCTG AATGAATCGT CCGACCCGAA GAGCCAGATC AACGAGTACC TGGATGCGTA CAAGGGCGAA GGCATCCAGC ACATCGCCTG CTTCACCGAG AACATCTACG AGACCGTCGA AGCGATGCGT GCGCAGGGCG TGGACTTCCT CGACACTCCG GAGACCTACT TCGACGTGAT CGACCAGCGC GTGCCGAACC ACGGTGAAGA CGTGGCGCGC CTGGCCAAGA ACAAGATCCT GATCGACGCT GATCCGGAAA CCCACCAGCG CAAGCTGCTG CAGATCTTCA CCCAGAACTG CATCGGCCCG ATCTTCTTCG AGATCATCCA GCGCAAGGGC AACGAAGGCT TTGGCGAAGG CAACTTCACC GCGCTGTTCG AAAGCATCGA GCGCGACCAG ATCCGCCGCG GCGTGCTGTA A
|
Protein sequence | MQVTTFENPM GIDGFEFVEF AAPAGRGQEL HEYFRKMGFS AVLKHKQRPI TVYRQGDVNF LVNEDPDSFA SDFAEKHGPC ACGFAIRFKK PGQEVYQTAL GNGAEAIAFK PDSKAVSAPV IKGIGDCMLY LVDRYGSAGS IFDGDYELIA GAELRPKGFG LTFIDHLTHN LYFGNMQQWS DYYERLFNFR EIRYFDIKGL KTGLVSKAMT APDGIVRIPL NESSDPKSQI NEYLDAYKGE GIQHIACFTE NIYETVEAMR AQGVDFLDTP ETYFDVIDQR VPNHGEDVAR LAKNKILIDA DPETHQRKLL QIFTQNCIGP IFFEIIQRKG NEGFGEGNFT ALFESIERDQ IRRGVL
|
| |