Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0582 |
Symbol | |
ID | 4076147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 619950 |
End bp | 621050 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638005879 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_612577 |
Protein GI | 99080423 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.764732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000507477 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGACCTT TCCCTCATGA TGCCCCCAAA TCGGTGATCA GCGCTGAGAA CCCGGCCGGA ACCGATGGAT TTGAATTTGT AGAGTTTGCC AGCCCGAACC CCGAAGAACT GCGCGAGCTC TTTGCCAAGA TGGGGTACGA GTTGGTCGGA CGTCACAAGA CCAAGCCGGG TATCGAGCTT TGGCAGCAGG GGGACATCAC CTACATCCTC AATGCCGAGA AAGGCTCTTT TGCGGAAAAG TTCGTTGAAC TTCACGGCCC CTGTGCCCCC TCGATGGGCT GGCGTGTGGT CGATGCGCAA AAGGCGTTTG AGCACGCGGT GGCCAAGGGG GCGGAGCCCT ATGAAGGCGA TGACAAAACA ATGGATGTGC CTGCAATCAA AGGGATTGGC GGCTCGCTCA TCTACTTCAT CGACCAGTAT TACGACACCT CGCCCTATAA CGAGGAATTC GAGTGGCTGA AGCAGTCCAA ACCGCGCGGC GTCGGTTTTT ATTACCTCGA TCACCTCACG CACAATGTCT TCAAAGGCAA CATGGACAAG TGGTTCCACT TTTATGGCGA CCTGTTCAAC TTCAAGGAAA TCCGGTTCTT TGACATTCAG GGCAAGTATA CCGGCCTCTT CAGCCGTGCC TTGACCTCGC CTTGCGGCCG CATTCGCATT CCGATCAACG AGGACCGTGG CGAGACCGGG CAGATCGTTG CCTATCTCAA GAAGTACAAT GGCGAAGGCA TCCAGCACAT CGCTGTGGGC GCGCGTGACA TCTATGATGC CACTGACGAG ATCTCCGAAC GTGGCATCCA GTTCATGCCG GCCCCGCCTG CAACCTATTA CGACATGAGC CACGACCGTG TCCAAGGCCA CGAAGAGCCG CTGGATCGTA TGAAAAAGCA CGGCATCCTC ATCGACGGCG AAGGCGTGGT GGACGGGGGC GAGACACGCA TCCTGCTGCA GATCTTCTCA AAAACGGTGA TCGGGCCGAT CTTCTTTGAG TTCATCCAGC GCAAAGGCGA TGACGGCTTT GGCGAGGGCA ACTTCAAGGC GCTCTTTGAA TCGATCGAAC AGGAGCAAAT CAACAACGGT GAAATCTCCG CTGCCGAGTG A
|
Protein sequence | MGPFPHDAPK SVISAENPAG TDGFEFVEFA SPNPEELREL FAKMGYELVG RHKTKPGIEL WQQGDITYIL NAEKGSFAEK FVELHGPCAP SMGWRVVDAQ KAFEHAVAKG AEPYEGDDKT MDVPAIKGIG GSLIYFIDQY YDTSPYNEEF EWLKQSKPRG VGFYYLDHLT HNVFKGNMDK WFHFYGDLFN FKEIRFFDIQ GKYTGLFSRA LTSPCGRIRI PINEDRGETG QIVAYLKKYN GEGIQHIAVG ARDIYDATDE ISERGIQFMP APPATYYDMS HDRVQGHEEP LDRMKKHGIL IDGEGVVDGG ETRILLQIFS KTVIGPIFFE FIQRKGDDGF GEGNFKALFE SIEQEQINNG EISAAE
|
| |