Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3467 |
Symbol | |
ID | 4075101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 491473 |
End bp | 492453 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638004976 |
Product | 3,4-dihydroxyphenylacetate 2,3-dioxygenase HpaD |
Protein accession | YP_611701 |
Protein GI | 99078443 |
COG category | [R] General function prediction only |
COG ID | [COG2514] Predicted ring-cleavage extradiol dioxygenase |
TIGRFAM ID | [TIGR02295] 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATTC CCGCCGCAAA TCTTTATCCG CCCTTCAACA TCACCCGCCT CAGCCATGTG GAATACGGCG TCACCGACCT TGCGGCCTCC CGCACCTTCT ACGTCGACAT CCTCGGCCTG CAGGTCACTC ATGAGGACGA CAGCCGGATT TATCTGCGCG CCATGGAAGA ACGCGGCCAC CACTGCATCA TCCTGCGCCA ATCCGACCAC GCGGGCGTCG CGTGCCTCGG CTTCAAACTC TATGACACGC CGGATCTTGA GAAGGCCGCC GCCTTTTTCG AGGGCAAAAG CCTGCCGGTG GAGTGGATCG AACGCCCCTT CATGGGGCCG AGCTTGCGCA CCCGCGATCC ATGGGGTGTG CCGCTGGAGT TCTACGTCAA GATGGACCGC CTCCCGCCGA TACATCAGCA GTACAGGCTC TATAATGGCG TGAAACCCCT CCGCATCGAC CACTTCAACA TGTTTTCGGC CAATGTCGAC GCGGCGGTGG CCTTCTACGG CGAGATGGGG TTTCGCGTCA CCGAATATAC CGAGGATGAC GACTCCGGCC GCGTCTGGGC AGCCTGGATG CACCGCAAGG GCGGCGTGCA TGATGTGGCC TTCACCAATG GAACCGGCCC GCGTCTGCAT CACACCGCCT TTTGGGTACC AACCCCGCTC AACATCATCG ACCTCCTCGA TCTGATGTCG ACCACCGGCT ATGTCGCCAA TATCGAACGC GGCCCCGGCC GCCACGGCAT TTCCAACGCG TTCTTCCTCT ATGTGCGCGA CCCCGACGGC CACCGGATCG AAATCTATTG CTCGGACTAT CAGACCTGCG ATGCGGATCT GGAGCCGATC AAATGGTCCC TCACCGACCC GCAGCGCCAG ACCCTCTGGG GCGCACCCGC ACCGCGCAGC TGGTTCGAAG AAGGCTCCCT GTTCGACGGG GCCGAAACAC GCGACAGTGA TCTCAAAGCA CAACCGATCA TCGCGCCGTA A
|
Protein sequence | MPIPAANLYP PFNITRLSHV EYGVTDLAAS RTFYVDILGL QVTHEDDSRI YLRAMEERGH HCIILRQSDH AGVACLGFKL YDTPDLEKAA AFFEGKSLPV EWIERPFMGP SLRTRDPWGV PLEFYVKMDR LPPIHQQYRL YNGVKPLRID HFNMFSANVD AAVAFYGEMG FRVTEYTEDD DSGRVWAAWM HRKGGVHDVA FTNGTGPRLH HTAFWVPTPL NIIDLLDLMS TTGYVANIER GPGRHGISNA FFLYVRDPDG HRIEIYCSDY QTCDADLEPI KWSLTDPQRQ TLWGAPAPRS WFEEGSLFDG AETRDSDLKA QPIIAP
|
| |