Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1175 |
Symbol | |
ID | 6408831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 1245750 |
End bp | 1247108 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642711073 |
Product | Phthalate 4,5-dioxygenase |
Protein accession | YP_001990190 |
Protein GI | 192289585 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.769494 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGCC AGGAACAGAA CGATCTGATC ACCCGGGTCG GGCCGGGGAC GCCGTGCGGC AAGCTGATGC GCGCCTATTG GCAGCCGGCC GCGCTCGTCG ACGAGCTCGA AGGCGAGCGG CCGATCAAGC CGGTGCGACT GCTCGGCGAA GACCTCGTGC TGTTCAAGGA CGAGACCGGC CGCTACGGCC TGATCGATCG CGACTGTCCG CACCGCGGCG CCGATCTCGC GTTCGGGCGG CTGGAGAACG GCGGCCTGCG CTGCGCCTTC CACGGTTGGC TGTTCGACGT CGACGGCAAG TGCATCGACA CCCCCGCCGA GCCTGCCGGC TCGCCGCTGT GCAAGAACAT CAAGCAGCGC GCGTTTCCGG TCGTCGCCAA GGGCGGCATC CTGTGGGCCT ATCTCGGCGC GGGCGAACCG CCGGCGTTTC CGGAGATCGA TTGCTTCATC GCCCCCGACA CCCATGTGTT CGCGTTCAAG GGCCTGATGG AATGCAACTG GCTGCAGGCG CTCGAGGTCG GCATCGATCC GGCGCACGCC TCGTTCCTGC ACCGCTTCTT CGAGGATGAG GACACCTCGC AGGCCTACGG CAAGCAATTC CGCGGTGCCT CGGCCGGCAG CGATCTGCCG ATGACCAAGG TGCTGCGCGA ATACGATCGC CCGATCATCA ATGTCGAGCA CACCGAATAC GGCTTGCGGC TGATCGCGCT ACGCGAGATC GACGACGAAC GCACCCATGT TCGCGTCACC AATCAGCTGT TCCCGCACGG CTTCGTCATC CCGATGAGCA CAGAGATGAC GATCACGCAA TGGCACGTGC CGGTCGACGA CACCCACTGC TATTGGTATG CGATCTTCAC CAGCTACGCC GCGCCGGTCG ATAAGGTGAA GATGCGCGAC CAGCGCCTCG AGCTCTACGA GTTGCCGGAC TACAAGTCTC GCCGCAACAA GACCAACGAT TACGGCTTCG ATCCGCACGA GCAGGCGACC GCGACCTACA CCGGCATGGG GCTGGACATC AACGTCCACG ATCAGTGGGC GGTGGAGTCG ATGGGCGCGA TCCAGGACCG CACCCGCGAG CATCTCGGCC AGTCCGACAA GGCGATCATT CAGTATCGCC GGCTGCTGCG TCAGGAAATC GAGAAGGCCG CCTCCGGTGG CAAGCCGTTG CTGGCGCTCG ACGAGGCCGC GGCGCGCGCG ATCCAGGGAC CGGCCACGAT GGACGGCATC GGCCCGAGCC GCGGCTGGGA GACCTATTGG ATGGAGGTCG ACGTCAAGCG TCGCCGCGGT GCGCCCTGGG CGGCACCGGT GCCGTCCGAG ATCGCCGCCA AGGTCCCGCA TCTGACGGCC GCAGAATGA
|
Protein sequence | MMSQEQNDLI TRVGPGTPCG KLMRAYWQPA ALVDELEGER PIKPVRLLGE DLVLFKDETG RYGLIDRDCP HRGADLAFGR LENGGLRCAF HGWLFDVDGK CIDTPAEPAG SPLCKNIKQR AFPVVAKGGI LWAYLGAGEP PAFPEIDCFI APDTHVFAFK GLMECNWLQA LEVGIDPAHA SFLHRFFEDE DTSQAYGKQF RGASAGSDLP MTKVLREYDR PIINVEHTEY GLRLIALREI DDERTHVRVT NQLFPHGFVI PMSTEMTITQ WHVPVDDTHC YWYAIFTSYA APVDKVKMRD QRLELYELPD YKSRRNKTND YGFDPHEQAT ATYTGMGLDI NVHDQWAVES MGAIQDRTRE HLGQSDKAII QYRRLLRQEI EKAASGGKPL LALDEAAARA IQGPATMDGI GPSRGWETYW MEVDVKRRRG APWAAPVPSE IAAKVPHLTA AE
|
| |