Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3112 |
Symbol | |
ID | 3836558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 3588701 |
End bp | 3590299 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637827227 |
Product | 4-cresol dehydrogenase (hydroxylating) |
Protein accession | YP_428194 |
Protein GI | 83594442 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGAGG ATCGGGTCTT GACCCTTCAC GCCTTCCTTG AGGGCGCGCG CGCCCTTCTT CCCCCGGCCG CCATTCTTTT CGGCGAGGCG GTGACCCCTT TTCTTTCCTC GACCAGCGGC CATCGCCGGC GGGTGCCGCT GCTCGTGCGG CCGACAAACA CCGGCGAGGT TCAGGCCCTC GCCCGCCTTG CCACCACCTG CGCGGTGACG CTCTATCCGA TCAGCAAGGG GCGCAACTGG GGCCTGGGCT CGCGGCTGCC CGGCTGCGAG GATTGCGTGA TCCTCGATCT GGGCGGGCTC GACCGCATCA ATGCCATCGA TGAACGGTTC GGCATCGCCG TCATCGAACC CGGGGTGACG CAGGCCGCCC TGGCCGACGC TCTGGCGGCG CGGGGCTCGG CCTTCTTCCT TGATGTGACC GGATCGGGGC GCGAGACCAG CGTGCTTGGC AATACCCTGG AACGCGGGGT GGCCTATAAT TCGCTGCGCG CCGAATTGGT GCAGTCGCTC GAGGTGGTCC TGGCCGATGG CAGCTTGCTG CGCACCGGCT TCGCCCATTA TCCGACCAGC CGCCTGGGCG GGCTCAGCCG CTTCGCCCCC GGGCCCGATC TGAGCGGGCT GTTCGTTCAA TCCAACCTGG GCATCGTCGT TGGCGGAGCG GTGGCGCTGT TGCCAAGGCC CGAGCGCCAG ATGACCTTCA TGGTCTCGGT GAAGGACGAG GCGCGGCTTC CCGCGTTCTT CGACGCCCTG CGGGCCCTGC GCCGCGAGGG AACCCTGAGC AGCGTCGTTC ATGTCGGCAA CCGTCGGCGC AGCGAAATCA CCCTGACGCC GCTGGTCCAT GCCGAGATGG CGGCGCGCGG GCGCGATCCG ACGCGGGCCG AGGCCCAAAA GCTGACCGAT CGCTTCCTGA CCGGCCGCTG GAGCGCCATC GGCTCGGTGA TGGGGCCGGC CGCCCAGGTT CGGGTAGCGC GCCGCCGCAT CGCCCGGGCT TTGGGCGGCC TTGGCGCCGT GCGCTTCCTG TCGCCGGGCT TTCGCCGCTT CGCCAAGGCG CTCAGCGCCC GGATCCCGGG GCTGGGCGAC GTCAATTGTT TCCTCTGCGC CGTCGATCCC TTGCTTGACC TCACCAGCGG CCGGCCGACC AACGCGGCGC TGCACAGCAC CTATTGGCCC CATGCCGATC AATCCGAGGC GGCCGCCGAT CCCGATCGCG GGCCGGGCGG CATCGTCTTC GCCGCCCCCG TGGTGCCCTT GGACGGCGCG GCGGTGCGCG AGGCCGTCGA CCTGACCTAT GAGCTTTGCC GCGCCCACGG CTTCGAAGCG GCGATCACCC TTAATCTGAT GAACGACCGC ACCCTGGAAG GCGTGGTCAG CATCGATTTT CGCCGCGACG ACCCCGAGAA CTTGGCCGCC GCCCATCGCT GCCTGCGGGC GCTCAACCAG GGTTATGTGG AAAACGGCTT CACCCCCTAT CGGGTCGATA TCGATTCGAT GGATCTGGTC GTCGATCCGG CCGATCCCTT CTGGGCCACG GTGTCGCGCC TCAAGCAGGC CCTGGATCCG GCGGGGGTGG TGGCGCCGGG CCGCTATTGC CCGCCATGA
|
Protein sequence | MLEDRVLTLH AFLEGARALL PPAAILFGEA VTPFLSSTSG HRRRVPLLVR PTNTGEVQAL ARLATTCAVT LYPISKGRNW GLGSRLPGCE DCVILDLGGL DRINAIDERF GIAVIEPGVT QAALADALAA RGSAFFLDVT GSGRETSVLG NTLERGVAYN SLRAELVQSL EVVLADGSLL RTGFAHYPTS RLGGLSRFAP GPDLSGLFVQ SNLGIVVGGA VALLPRPERQ MTFMVSVKDE ARLPAFFDAL RALRREGTLS SVVHVGNRRR SEITLTPLVH AEMAARGRDP TRAEAQKLTD RFLTGRWSAI GSVMGPAAQV RVARRRIARA LGGLGAVRFL SPGFRRFAKA LSARIPGLGD VNCFLCAVDP LLDLTSGRPT NAALHSTYWP HADQSEAAAD PDRGPGGIVF AAPVVPLDGA AVREAVDLTY ELCRAHGFEA AITLNLMNDR TLEGVVSIDF RRDDPENLAA AHRCLRALNQ GYVENGFTPY RVDIDSMDLV VDPADPFWAT VSRLKQALDP AGVVAPGRYC PP
|
| |