Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4512 |
Symbol | |
ID | 3912328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5100183 |
End bp | 5101169 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637886415 |
Product | molybdopterin dehydrogenase, FAD-binding |
Protein accession | YP_488106 |
Protein GI | 86751610 |
COG category | [C] Energy production and conversion |
COG ID | [COG1319] Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.195104 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.684454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCCTT TCGATTATTC CCGGGCCGGC GATGTCGCCG AAGCGCTGCG GGCCGGCGCC GGCGTGCAGA CCAAGTTTCT CGGCGGCGGC ACCAATCTGA TCGACCTGAT GCGGGAAACG ATCGAGCGCC CGGCGGCGCT GGTCGATATC ACCGGTCTGC CGGCCGAGAT CACCGCGCGC GACGACGGCG GCCTGCTGAT CGGCGCGGCG GTGCGCAACA CCGCGCTGGC CGAGCACCGC GCCGTGCGGA CGCGCTATCC GATGCTGTCG CGCGCCATTC TGGCGGGCGC CTCGGCGCAG ATCCGCAACA TGGCGACGGT CGGCGGCAAT CTGCTGCAGC GGACGCGCTG CGCTTACTTC TACGACGATG CCGGCTCGCG CTGTAACAAG CGCCAACCGG GGCAAGGCTG CGATGCCATC GACGGCTTCA ACCGCAACCA CGCCATCCTC GGCGCGTCGG AGTCGTGCGT GGCGACGCAT CCGTCGGACA TGTGCGTGGC GCTCGCCGCG CTCGATGCGG TGGTGCATCT CGCCGGCAGC GGCGGCCAGC GCACGTTGCC GTTCAACGAC GTCCATCGGC TGCCGGGCGA TCGGCCGGAC CTCGAAACCA TGCTGCGGCC CGGCGAACTG ATCACCGCGA TCGAACTGCC GGCGCAGCCG ATCGCGGCGC GCTCGACCTA TCGCAAGGTG CGCGACCGCT CGAGCTACGC GTTCGCGCTG GTCTCGGTCG CGGCCGCGGT GGAGGTCGAG CGCGGCAGCG TCAAGGATTT GCGGCTGGCG CTCGGCGGCG TCGCGCACAA GCCGTGGCGC GCGCACAAGG CCGAGCAGGC GCTGCGCGGC GGTCCCGCCA CGGTCGAGGC GTTTCGCGCC GCGGCCGAGG CCGAACTCGC CGACGCCGTG CCGCTTCGCG ACAACGGCTT CAAGATCGAA CTGGCAAAGC GCACCATTAT TGCCGTGCTC GGCGAACTGG CAGGAGTGGC CCGATGA
|
Protein sequence | MTPFDYSRAG DVAEALRAGA GVQTKFLGGG TNLIDLMRET IERPAALVDI TGLPAEITAR DDGGLLIGAA VRNTALAEHR AVRTRYPMLS RAILAGASAQ IRNMATVGGN LLQRTRCAYF YDDAGSRCNK RQPGQGCDAI DGFNRNHAIL GASESCVATH PSDMCVALAA LDAVVHLAGS GGQRTLPFND VHRLPGDRPD LETMLRPGEL ITAIELPAQP IAARSTYRKV RDRSSYAFAL VSVAAAVEVE RGSVKDLRLA LGGVAHKPWR AHKAEQALRG GPATVEAFRA AAEAELADAV PLRDNGFKIE LAKRTIIAVL GELAGVAR
|
| |