Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3875 |
Symbol | |
ID | 5210857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4850074 |
End bp | 4851294 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640597470 |
Product | putative oxygen-independent coproporphyrinogen III oxidase |
Protein accession | YP_001278178 |
Protein GI | 148657973 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATA ATCAACCCGG CGCAGACGTG CGCCACCTGT ACATCCATAT TCCGTTCTGT CATCGCCGCT GTGCGTACTG TGATTTTAAC ACGTATGCGA ATATGGAAGA CCGCATGGAG GCGTATGTGA CAGCGCTCTG CGCCGAACTG CGCTCACACG CGCCGCTGAG CCGCGCCGTC GCTCCGCCGC TTCCGATGAC TGCCGATCTG ACCCGCGCGA TGCTGCGTCC GACGATCTTC CTCGGCGGCG GCACACCCAG TATGCTGCCG GACGCATTGA TGGCGCGCAT ACTGTCCGCT GCCGACGCGA TTGTGCCGCT GGACAGCGCC GAGGTGACGG TCGAATGCAA TCCGGGAACG GTGCTGGCGC GCGATTATCT GCGCGCCCTG CGGGATCTGG GGGTGAACCG GATCAGCCTG GGGGTTCAAA GTCTCCACGA TCCGACGTTG CGTGTGCTGG GACGCATCCA TACTGCGGCT GAGGCGTATG CCTCGTTCAA CGATGCGCGC GCTGCCGGGT TCGAGAGTAT TAATCTGGAT TTCATCTTCG GATTGCCGGG GCAGACGGTG GAGCAGTGGG AGGAGACGCT GCGCGAGATC GTGACCTGGG GAGCCGATCA TTTCGCTCTC TATGCGCTGA TTCTGGAGGA GCGCACGCCA CTCTACGCGC AGGTGATCAG TGGGCGGGTC ACGGTTCCTG ACGATGATGT CACGGCGGTG ATGTACGAAT GCGCGCTTGA GCATTTTGCC GCTGCCGGAT ATGTGCAGTA CGAGATCAGC AACTGGGCGC GCACCGATGA TCCGTCGTCG CCTGTGCCGA CGCACGCCTG TCACCATAAT CTGGCATACT GGCTCAACGC CGATTATCTG GCAGCTGGCG CTGGCGCTCA CGGGCATCGC TACCCGCAGC GGTATGTCAA CGTGATGGGG ATCGACGACT ATATTGCGCG CGTGTCCGCC GGTGAGTCGC CGGTGGCGGA GATCACGGCG TTGACACCGC GCGATCTGGC GGCAGAGACG ATGTTCATGG GGTTGCGGTT GAATGTTGGG GTGAGCGCGA CCCATTTTCG TGATCGCTGC GGGGTGGAGA TGGATGCCGT GTTTGGGGCG GAACTGGCGG AACTGGCGGA ACTGGGGTTG ATCGAGCGCG ACGAACGCGG CGTGCGCCTG ACCAGCCGCG GACGGATGAT CGGCAATCGG GTATTTGAGC GGTTTGTGTA A
|
Protein sequence | MSNNQPGADV RHLYIHIPFC HRRCAYCDFN TYANMEDRME AYVTALCAEL RSHAPLSRAV APPLPMTADL TRAMLRPTIF LGGGTPSMLP DALMARILSA ADAIVPLDSA EVTVECNPGT VLARDYLRAL RDLGVNRISL GVQSLHDPTL RVLGRIHTAA EAYASFNDAR AAGFESINLD FIFGLPGQTV EQWEETLREI VTWGADHFAL YALILEERTP LYAQVISGRV TVPDDDVTAV MYECALEHFA AAGYVQYEIS NWARTDDPSS PVPTHACHHN LAYWLNADYL AAGAGAHGHR YPQRYVNVMG IDDYIARVSA GESPVAEITA LTPRDLAAET MFMGLRLNVG VSATHFRDRC GVEMDAVFGA ELAELAELGL IERDERGVRL TSRGRMIGNR VFERFV
|
| |