Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4957 |
Symbol | |
ID | 6412649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5337024 |
End bp | 5338319 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642714840 |
Product | oxidoreductase molybdopterin binding |
Protein accession | YP_001993921 |
Protein GI | 192293316 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAA AGCCCACATC CGACGTCCTC AACCGCCGCC GGTTTCTCGG CGCGGCGGGC CTTGGAGTAG CCGGTCTCGC CGGTGCTGGG TCGATGCTGC CCTCGCTCGC GGCCAAAGCC AGTGAGGCGG CCAAGCCCGA TCCGGCGATC ACCGAGATCA AGGATTGGAA TCGCTATCTC GGCGACGGCG TCGACAAGCG TCCCTATGGC GTGCCCTCGA AATTCGAGAA GGACGTGATC CGCCGCGACG TGGCGTGGCT CACCGCGTCG CCGGAGTCCT CGGTCAACTT CACACCGCTG CACGCGCTCG ATGGCATCAT CACCCCGTCC GGCCTGTGCT TCGAACGGCA TCACGGCGGC GTTGCCGAGA TCGATCCGGC GCAGCACCGG CTGATGATCC ATGGCCTGGT TGACACCCCG CTGGTGTTCA CTATGGACGA CATCAAGCGG ATGCCGCGCG TCAACAAGAT CTACTTCCTG GAATGCGCGG CGAACTCCGG CATGGAGTGG CGCGGCGCGC AGCTCAACGG CTGCCAGTTC ACCCACGGCA TGATCCACAA CGTGATGTAC ACCGGCGTCA CGCTGAAGAC GCTGCTCGAG CAGGCCGGCG TGAAGTCCAA CGCCAAATGG TTGCTGCTCG AAGGCGCTGA CTCTGCCGGG ATGGATCGGT CGCTGCCGCT GGAGAAGGCG CTCGACGACG TCATGATCGC CTATGCGATG AACGGCGAGG CGCTGCGTCC GGAGAACGGC TATCCGCTGC GCGCCGTGAT CCCCGGTTGG CAGGGCAATC TGTGGGTGAA GTGGCTGCGC CGGATCGAAG TCGGCGATAT GCCGTGGCAG ACCCGCGAAG AGACCTCGAA GTACACCGAC CTGATGCCGG ACGGCCGCGC GCGCAAGCAT ACGTTCGTGA TGGACGCCAA GAGCGTGATC ACCAGCCCGT CGCCGCAGAT GCCGCTGAAG TTCAAGGGCC GCAACGTGCT CACCGGCATT GCCTGGTCCG GGCGCGGCAC CGTCAAGCGC GTCGACGTCT CGATGGACGG CGGACGCAAC TGGTACGAAG CGCGGATCGA CGGCCCGGTG CTGAACAAAT CGATCGTGCG GTTCTACGTC GACTTCGACT GGAACGGCGA AGAGCTGATG CTGCAATCGC GCGCGATTGA CGAGACAGGC TACGTGCAGC CGACCAAGGC GGAGCTGCGC AAGATCCGCG GCGTCAATTC CGTGTACCAC AACAACGGCA TCCAGACCTG GCTCGTGCAT CCCGACGGAG TGACCGAAAA TGTCGAAATC GCTTAA
|
Protein sequence | MSEKPTSDVL NRRRFLGAAG LGVAGLAGAG SMLPSLAAKA SEAAKPDPAI TEIKDWNRYL GDGVDKRPYG VPSKFEKDVI RRDVAWLTAS PESSVNFTPL HALDGIITPS GLCFERHHGG VAEIDPAQHR LMIHGLVDTP LVFTMDDIKR MPRVNKIYFL ECAANSGMEW RGAQLNGCQF THGMIHNVMY TGVTLKTLLE QAGVKSNAKW LLLEGADSAG MDRSLPLEKA LDDVMIAYAM NGEALRPENG YPLRAVIPGW QGNLWVKWLR RIEVGDMPWQ TREETSKYTD LMPDGRARKH TFVMDAKSVI TSPSPQMPLK FKGRNVLTGI AWSGRGTVKR VDVSMDGGRN WYEARIDGPV LNKSIVRFYV DFDWNGEELM LQSRAIDETG YVQPTKAELR KIRGVNSVYH NNGIQTWLVH PDGVTENVEI A
|
| |