Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4009 |
Symbol | |
ID | 3911816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4574992 |
End bp | 4575903 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885913 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_487613 |
Protein GI | 86751117 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0408] Coproporphyrinogen III oxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.586573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00557292 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCATCG ACACCGACCT CGACCGCACG CCCGCTGTTT CCGATGACCC GGTGGAAGCC CGACGGACGC AGGCACGCGC CTGGTTCGAG GGCCTGCGCG ATCGGATCTG CGCCGAGATC GAAACGCTGG AACGCGAGGC GCCGGCCGCA TTGTTTCCCG GCGAGCCCGC GACCTTCAGC TACAAGCCGT GGCAGCGCAA AACCGGCGCC GGCGGCGGCG TCGGCGGTTT CCTCTCGGGC GGCCGGCTGT TCGAGAAGAT CGGCATCCAC ACCTCGTCGG CCAACGGCAC GCTGACGCCG GAGATGGCGA AGAATCTGCC CGGCGACGGC GTCACGCGCG ACTACGTCTC GACCAGCATC AGCCTGATCA TGCATCCGCG CAGTCCGCGG GTCCCCACCG TGCACATGAA CACCCGGTTC CTGTCGACGT CGCAGGGCTG GTTCGGCGGC GGCGCCGATC TGACGCCGAT GCTGCCCGAG CAGCGCAGCC AGGACGCCGA GGACGCGGTG ATGTTCCACG CCGCCATGAA GGCCGCCTGC GACGCCCACG ATCCGGCTTA CTACGCGAAA TTCAAGCCCT GGGCCGACAC TTACTTCTTC CTGCCGCATC GCGGCACCGC GCGCGGCGTC GGCGGCATCT TCTACGATCA CCTGAACAGC GGCGATTTCG AGCGCGACTT TGCCTTCACC CGCGACGTCG GCGCGGCATT GCTCGAGATC TATCCGAGGA TCGTGCGCAA GCGCATGGTC GAGCCGTGGA CCGAGCAGGA GCGGGCGCAG CAACTCGCCT GCCGCGGGCT CTATGTCGAG TTCAACCTGT TGTACGACAA GGGCACGATG TTCGGCCTGC AGACCGGCGG CAACACCGAG ACCATCATGA GCTCGATGCC GCCGCTGGTG AGCTGGAGCT GA
|
Protein sequence | MTIDTDLDRT PAVSDDPVEA RRTQARAWFE GLRDRICAEI ETLEREAPAA LFPGEPATFS YKPWQRKTGA GGGVGGFLSG GRLFEKIGIH TSSANGTLTP EMAKNLPGDG VTRDYVSTSI SLIMHPRSPR VPTVHMNTRF LSTSQGWFGG GADLTPMLPE QRSQDAEDAV MFHAAMKAAC DAHDPAYYAK FKPWADTYFF LPHRGTARGV GGIFYDHLNS GDFERDFAFT RDVGAALLEI YPRIVRKRMV EPWTEQERAQ QLACRGLYVE FNLLYDKGTM FGLQTGGNTE TIMSSMPPLV SWS
|
| |