Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_4891 |
Symbol | sorA |
ID | 4041753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 1554681 |
End bp | 1555913 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637980312 |
Product | sulfite:cytochrome c oxidoreductase subunit A |
Protein accession | YP_587022 |
Protein GI | 94313813 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.935481 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0507087 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAATC GAGTCCGGCG ACGCCGGTTT CTCAAGCAGG CTGGCGCGTT GACGGCCAAT GCGGTCGTCG GCACCACCGC CCTGAACGCC CTCGCCGCCA GCGGCGACAA GCAGGTCCAG CTCCCCTTCG ACAACGGCGC GCGCGAGCTT GTGGCGTTTC CGCAGAAGCG GCCGCTGATC CTGCTGACCA GCCGGCCGCC GCAGTTGGAA ACCCCCTTTA GCGTGTTCAA CGAGGGGATT CTGACGCCTA ACGACGCGTT TTTCGTGCGT TATCACTGGA GCGGCATTCC GACCTCGGTC GACCCACAGA CCTATCGGCT CCGCATTGGC GGCAAGGTGA AAACGCCGCT CGAGCTATCA CTGGCCGACA TCAAGGCCAT CGCGCAGCCG ATGGAAGTCG TCGCCGTCAA CCAGTGCTCG GGCAACAGCC GTGGCTTTCT TGCTCCGCGC GTCAATGGCG GCCAGCTCGC CAACGGCGCC ATGGGCAACG CGCGCTGGAC CGGCGTGCCG CTGAAGGCCG TGCTCGAACG AGCCGGCGTG CAACCCGGCG CTGTCCAGGT CAGCTTCAAT GGCCTCGACC GGCCACCACT CGACAACAGC CCCGACTTCG TCAAATCGCT CGATATCGAC CACGCGCTCG ATGGCGAGGT CATGCTGGCG TGGTCCATGA ATGGCGAAGA CTTGCCGATG CTCAATGGCT ACCCGCTGCG GCTGGTGGTG CCGGGCCACT ACGGTACGTA CTGGGTGAAA CACCTGTCCG ATATCACCGT TCTCGACAAA CCGTTCGACG GTTTCTGGGT ACAGAGCGCC TATCGCATTC CCGACAACGC CGGGGCCAAT GTCGAACCCG GCTCGGCACC GGCACGCACA CGCCCGATCG GGCGCTTCAA CGTGCGCTCG TTCATCACCA GCCTGACCGA AGGCGCGACG GTCCCGTCCG GACGAGAGAC CGTGGTAAGA GGCATTGCGT TCGACGGCGG CTACGGTATC GCCGAAGTCG CATTCTCGGC AGACGGCGGC CGAAGCTGGA CGGAAGCCAC GCTCGGCCAG GATCTCGGCA AGTACTCGTT CCGCGAATGG CGCGCCGCAT TCCGCCCGCC ACGCAAGGGT GCCTATGACC TCCGGGTGCG CGCGGTCAAT CGTATCGGCC AGAGCCAGCC GATGACGGCG TTGTGGAATC CGGCCGGCTA CATGCGCAAC GTGGTGGAAA CCACGCGCGT CAAGGCAGTC TGA
|
Protein sequence | MENRVRRRRF LKQAGALTAN AVVGTTALNA LAASGDKQVQ LPFDNGAREL VAFPQKRPLI LLTSRPPQLE TPFSVFNEGI LTPNDAFFVR YHWSGIPTSV DPQTYRLRIG GKVKTPLELS LADIKAIAQP MEVVAVNQCS GNSRGFLAPR VNGGQLANGA MGNARWTGVP LKAVLERAGV QPGAVQVSFN GLDRPPLDNS PDFVKSLDID HALDGEVMLA WSMNGEDLPM LNGYPLRLVV PGHYGTYWVK HLSDITVLDK PFDGFWVQSA YRIPDNAGAN VEPGSAPART RPIGRFNVRS FITSLTEGAT VPSGRETVVR GIAFDGGYGI AEVAFSADGG RSWTEATLGQ DLGKYSFREW RAAFRPPRKG AYDLRVRAVN RIGQSQPMTA LWNPAGYMRN VVETTRVKAV
|
| |