Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0891 |
Symbol | |
ID | 3969810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 985316 |
End bp | 986662 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637924007 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_530780 |
Protein GI | 90422410 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.669112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCA CCACCGCTCC CGGGCTGATC GGCCGCAGCA CGCAAGCCAT CACGCCCGGC TATATGTCGG GCTTCGGCAA TTCGTTCGAG ACCGAGGCGC TGCCCGGCGC GCTGCCGATC GGGCGCAACT CGCCGCAGCG CGCGCCCTAC GGGCTTTACG CCGAGCAATT GTCGGGCTCG CCGTTCACCG CGCCGCGCGG CTCTAACGAG CGCTCCTGGC TGTATCGCAT CCGCCCCTCG GTGCAGCACT CCGGCCGCTT CGAGAAAGCC GAGGCCGGGC TGTGGCGCTC CGCACCCTGC CATGAGCACG ACATGCCGAT CGCGCAATTG CGCTGGGACC CGCCGCCGCT GCCGCAGCGC GCGCAGACCT TTCTGCAGGG CGTCGAGACC ATGACCACGG CGGGCGACGT CAATACGCAA GCCGGCATGG CGGCGCATAT GTATTTGATC AGCGCCTCGA TGGTGAACCA GCATTTCTAC AATGCCGACG GCGAATTGAT GTTCGTGCCG CAGCAGGGTG GCTTGCGCCT CGTCACCGAA TTCGGCGTGA TCGGCGTCGC GCCCGGCGAG ATCGCGGTGA TTCCGCGCGG CGTCAAGTTT CGCGTCGAGC TGATCGACGG GCCGGCGCGC GGCTATCTGT GCGAGAATTA CGGCGGCGGC TTCACGCTGC CGGAGCGCGG CCCGATCGGG GCCAATTGCC TTGCGAACGC ACGCGACTTC CTCACGCCGG TCGCGGCTTA TGAAGATAGC GACACGCCGA CCGAGCTCTA CGTCAAATGG GGCGGCGCGC TGTGGGTGAC GCAGTTGCCG CATTCGCCGA TCGACGTGGT GGCCTGGCAC GGCAACTACG CGCCGTACAA ATATGATCTG CGCACCTTCT CGCCGGTCGG CGCGATCGGC TTCGATCATC CCGATCCGTC GATCTTCACC GTGCTGACCT CGCCCTCGGA GACCGCCGGC ACCGCCAATA TCGACTTCGT GATCTTCCCG GAGCGCTGGA TGGTGGCGGA GAACACCTTC CGCCCGCCGT GGTATCACAT GAACATCATG TCGGAATTCA TGGGGCTGAT TTATGGCGTG TACGACGCCA AGCCGCAGGG CTTTCTGCCC GGCGGCGCCT CGCTGCACAA CATGATGCTG CCGCACGGTC CGGACCGCGA GGCGTTCGAT CACGCGTCGA ACGCCGAGCT GAAGCCGGTG AAGCTCGAAG GCACCTTGGC CTTCATGTTC GAGACCCGCT ATCCGCAGCG CGTCACCGTG CACGCCGCGA CTTCCAGCAC GCTGCAGGCC GACTACGCTG AGTGCTGGCG CGGGTTGCAA AAGCGCTTCG ATCCGACCAA ACCCTGA
|
Protein sequence | MNITTAPGLI GRSTQAITPG YMSGFGNSFE TEALPGALPI GRNSPQRAPY GLYAEQLSGS PFTAPRGSNE RSWLYRIRPS VQHSGRFEKA EAGLWRSAPC HEHDMPIAQL RWDPPPLPQR AQTFLQGVET MTTAGDVNTQ AGMAAHMYLI SASMVNQHFY NADGELMFVP QQGGLRLVTE FGVIGVAPGE IAVIPRGVKF RVELIDGPAR GYLCENYGGG FTLPERGPIG ANCLANARDF LTPVAAYEDS DTPTELYVKW GGALWVTQLP HSPIDVVAWH GNYAPYKYDL RTFSPVGAIG FDHPDPSIFT VLTSPSETAG TANIDFVIFP ERWMVAENTF RPPWYHMNIM SEFMGLIYGV YDAKPQGFLP GGASLHNMML PHGPDREAFD HASNAELKPV KLEGTLAFMF ETRYPQRVTV HAATSSTLQA DYAECWRGLQ KRFDPTKP
|
| |