Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1018 |
Symbol | |
ID | 4021493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1156630 |
End bp | 1157976 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637961209 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_568157 |
Protein GI | 91975498 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.266165 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTCA ACGCCGCGCC TGAGATCGTC GGCCGTGCTT CGCAGGGCGT CACGCCGGGC TACATGTCCG GCTTCGGCAA TTCGTTCGAG ACCGAGGCGC TGCCCGGCGC GTTGCCGGTC GGCCGCAATT CGCCGCAGCG TGCGGCCTAT GGGCTCTATG CCGAGCAGTT GTCCGGCTCG CCCTTCACCG CGCCGCGCGG CGCCAATGAG CGCTCGTGGC TGTATCGCAT CCGACCGTCG GTGAAGCATT CCGGCCGGTT CGCGAAAGCC GATATGGGGT TGTGGCGCTC GGCGCCTTGC CTCGAACACG ACATGCCGAT CGCTCAGCTC CGGTGGGATG CGCCGCCGAT GCCGACCGAG GAGGTGACCT TCGTGCAGGG CGTGCGGACG ATGACCACGG CCGGCGATGT GAACACCCAA GCCGGCATGG CCGCGCATAT GTACCTGATC AGCCGGTCGA TGGTCGATCA GCATTTCTAC AATGCCGATG GCGAGCTGAT GTTCGTGCCG CAGCAAGGTC GATTGCGGCT CGTCACCGAA TTCGGCGTGA TCGCGATCGA GCCGGCCGAG ATCGCGGTGA TCCCGCGCGG CGTCAAGTTC CGCGTCGAGC TGGTCGATGG TCCGGCGCGC GGTTATCTCT GCGAGAATTA CGGCGGCGCG TTCACCCTGC CGGAGCGCGG CCCGATCGGC GCCAATTGCC TCGCCAATTC GCGCGATTTC CTGACGCCGG TCGCATCCTA CGAGGACAAG GACACGCCGA CCGAGCTGTT CGTGAAATGG GGCGGGGCGC TGTGGCGGAC GAGTTTGCCG CATTCGCCGA TCGACGTGGT CGCCTGGCAC GGCAACTACG CGCCGTATAA ATACGATCTG CGAACGTTCT CGCCGGTCGG CGCGATCGGC TTCGACCATC CCGATCCGTC GATCTTCACC GTGCTGACCT CGCCGTCGGA GACCGCGGGC ACGGCGAATA TCGACTTCGT GATCTTTCCC GAGCGCTGGA TGGTGGCGGA AAACACCTTC CGCCCGCCCT GGTATCACAT GAACATCATG TCGGAGTTCA TGGGGCTGAT CTACGGCGTC TATGACGCCA AGCCGCAGGG CTTCGCTCCG GGCGGCGCGA GCCTGCACAA CATGATGCTG CCGCACGGGC CGGATCGCGA AGCGTTCGAT CATGCGTCGA ACGGCGAGCT GAAACCGGTC AAGCTCACCG GCACGATGGC CTTCATGCTG GAGACCCGCT ATCCGCAGCG CGTCACCGAA TACGCGGCGA CCGCCGACAC CTTGCAGGAT GACTACGCCG ATTGCTGGCG CGGCCTCGAG AAGCGCTTCG ATCCGAGCCG GCCATGA
|
Protein sequence | MNVNAAPEIV GRASQGVTPG YMSGFGNSFE TEALPGALPV GRNSPQRAAY GLYAEQLSGS PFTAPRGANE RSWLYRIRPS VKHSGRFAKA DMGLWRSAPC LEHDMPIAQL RWDAPPMPTE EVTFVQGVRT MTTAGDVNTQ AGMAAHMYLI SRSMVDQHFY NADGELMFVP QQGRLRLVTE FGVIAIEPAE IAVIPRGVKF RVELVDGPAR GYLCENYGGA FTLPERGPIG ANCLANSRDF LTPVASYEDK DTPTELFVKW GGALWRTSLP HSPIDVVAWH GNYAPYKYDL RTFSPVGAIG FDHPDPSIFT VLTSPSETAG TANIDFVIFP ERWMVAENTF RPPWYHMNIM SEFMGLIYGV YDAKPQGFAP GGASLHNMML PHGPDREAFD HASNGELKPV KLTGTMAFML ETRYPQRVTE YAATADTLQD DYADCWRGLE KRFDPSRP
|
| |