Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0907 |
Symbol | |
ID | 3909087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1044169 |
End bp | 1045515 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637882800 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_484529 |
Protein GI | 86748033 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.78575 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCA ACGCCGCACC GCAGATCATT GGCCACGGTT CGCAGGGCGT CACGCCCGGC TACATGTCGG GCTTCGGCAA TTCGTTCGAA ACCGAAGCTC TCCCCGGCGC GCTGCCGATC GGCCGCAACT CGCCGCAGCG CGCGGCTTAC GGCCTCTATG CCGAGCAATT ATCAGGCTCG CCGTTCACCG CGCCGCGCGG CGCCAATGAG CGAAGCTGGC TGTATCGCAT CCGCCCCTCG GTGAAGCACT CCGGCCGCTT TACCAAGGCG GACATGGGCC TGTGGCGCTC GGCGCCGTGT CTCGAATACG ACATGCCGAT CGCGCAGCTG CGCTGGGACG CGCCGTCGAT GCCGCAGGAG GATCTGACGT TCCTGCAAGG CGTGCGAACG ATGACGACCG CCGGCGATGT GAATACGCAG GCCGGCATGG CGACGCATAT GTATCTGATC ACCCAATCGA TGGTCGATCA GCATTTCTAC AATGCCGACG GTGAATTGAT GTTCGTGCCG CAGCAGGGCA GCCTGCGGCT GGTCACGGAA TTCGGCGTCA TCAGCATCGA GCCCGCCGAA ATCGCGGTGA TCCCGCGCGG CGTCAAGTTT CGCGTCGAAC TGGTCGACGG CCCGGCGCGC GGCTATTTGT GTGAGAATTA CGGCGGCGCC TTCACGCTGC CGGAGCGCGG CCCGATCGGC GCCAATTGCC TGGCCAATTC GCGCGATTTC CTGACGCCGG TGGCGGCCTA TGAGGACAGG GACGTGCCGA CCGAATTGTT CGTGAAATGG GGCGGGGCGC TGTGGCAGAC CACGCTGCCG CATTCGCCGA TCGATGTGGT CGCGTGGCAT GGCAACTACG CGCCGTACAA ATACGATCTG CGCACCTTCT CGCCGGTCGG CGCGATCGGC TTCGATCATC CCGATCCGTC GATCTTCACC GTGCTGACGT CGCCGTCGGA AACCGCCGGC ACCGCCAATA TAGACTTCGT GATCTTCCCC GAGCGCTGGA TGGTGGCGGA AAACACCTTC CGGCCGCCCT GGTACCACAT GAATATCATG TCGGAGTTCA TGGGGTTGAT CTGCGGCGTC TACGACGCCA AGCCGCAGGG CTTCGTCCCC GGCGGCGCGT CGCTGCACAA CATGATGCTG CCGCACGGGC CGGATCGCGA GGCGTTCGAT CATGCCTCGA ACGGCGAGCT GAAGCCGGTG AAACTCACCG GCACGATGGC CTTCATGTTC GAGACCCGCT ATCCGCAGCG CGTCACCGAA TATGCCGCGA CCGCCGGCAC GCTGCAGGAC GACTACGCCG ATTGCTGGCG CGGCCTGGAG AAGCGCTTCG ACCCGAGCCG GCCATGA
|
Protein sequence | MNINAAPQII GHGSQGVTPG YMSGFGNSFE TEALPGALPI GRNSPQRAAY GLYAEQLSGS PFTAPRGANE RSWLYRIRPS VKHSGRFTKA DMGLWRSAPC LEYDMPIAQL RWDAPSMPQE DLTFLQGVRT MTTAGDVNTQ AGMATHMYLI TQSMVDQHFY NADGELMFVP QQGSLRLVTE FGVISIEPAE IAVIPRGVKF RVELVDGPAR GYLCENYGGA FTLPERGPIG ANCLANSRDF LTPVAAYEDR DVPTELFVKW GGALWQTTLP HSPIDVVAWH GNYAPYKYDL RTFSPVGAIG FDHPDPSIFT VLTSPSETAG TANIDFVIFP ERWMVAENTF RPPWYHMNIM SEFMGLICGV YDAKPQGFVP GGASLHNMML PHGPDREAFD HASNGELKPV KLTGTMAFMF ETRYPQRVTE YAATAGTLQD DYADCWRGLE KRFDPSRP
|
| |