Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4056 |
Symbol | |
ID | 3969305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4506287 |
End bp | 4507594 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637927160 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_533901 |
Protein GI | 90425531 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.326439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAA AGCTTCATCC CGATACGCTC GCGCTTCACG CCGGCTGGCG CGCCGACCCG GCCACCGGCT CAGTCGCGGT GCCGATCTTT CAGACCACCT CGTACCAGTT CCACAACACC GAGCACGCCG CCAACTTGTT CGCGCTGAAG GAACTCGGCA ACATCTACAC GCGGATCGGC AACCCGACCA ATGACGTGCT GGAGCAGCGC GTGGCGGCGC TTGAAGGCGG CGTCGCGGCG CTCGCGGTGT CGTCGGGCCA AGCCGCTTCG GCGTTCTCGC TGCAGAATCT TGCCCGGGTC GGCGACAACG TCGTCAGTTC CACCGACCTC TATGGCGGCA CCTGGAATCT GTTCGCCAAC ACGCTGAAGG ACCAGGGCAT CGAAGTGCGC TTCGTCGACC CGGCGGATCC CGAAGCCTTC GCCCGCGCCA CCGACGATCG CACCCGCGCC TACTACGCCG AAACCCTGCC GAACCCGAAG CTGGCGGTGT TTCCGATCGC CGAAGTCGCG GCGATCGGCC GCAAGTTCGG CATTCCGCTG ATCGTCGACA ACACCGCCGC CCCGTTGCTG GTGCGTCCGT TCGATCATGG CGCGGCGGTC GTGGTGTATT CGGCCACCAA ATATCTCAGC GGCCACGGCA CCTCGATCGG CGGCCTGATC GTCGACGGCG GCAATTTCGA CTGGGAGAAA TTCCCGGAAC GCCAGCCGGC GCTGAATACG CCCGATCCGA GCTATCACGG CGCGGTCTGG GTCGAGGCGG TCAAGCCGAT CGGCCCGGTC GCCTACATCA TCAAGGCCCG CACCACGCTG TTGCGCGACA TCGGCTCGGC GCTGTCGCCG TTCAACGCGT TCCAGATCAT TCAGGGCCTT GAAACCCTGC CCTTGCGCAT CGAGCGCCAC GTGCAGAACG CGCAAGCCGT CGCCGACTTC CTGGAGAAGC GCCCCGAGGT CACCAAGGTG ATCCATCCCT CCAAGTTGAC CGGGGTCGCC CGCGAGCGCG CCGACAAATA TCTCAACGGC AAGTTCGGCG GCCTGGTCGG CTTTGAACTC GCCGGCGGCA AGGAGGCCGG GCGCAAATTC ATCGACGCGC TGCAGCTGCT GTACCACGTC GCCAATATCG GCGATGCTCG CAGTCTGGCG ATCCATCCGG CCTCGACCAC GCATTCGCAG CTCTCGGTCG AGGACCAACT CGCCACCGGC GTGTCGGACG GCTACGTGCG GCTGTCGGTC GGCCTCGAGC ACATCGACGA CATCATCGCC GATCTGGAAA CCGGCCTTGC CGCGGGACGT CTGGCCGCCG CCGCGTAA
|
Protein sequence | MTRKLHPDTL ALHAGWRADP ATGSVAVPIF QTTSYQFHNT EHAANLFALK ELGNIYTRIG NPTNDVLEQR VAALEGGVAA LAVSSGQAAS AFSLQNLARV GDNVVSSTDL YGGTWNLFAN TLKDQGIEVR FVDPADPEAF ARATDDRTRA YYAETLPNPK LAVFPIAEVA AIGRKFGIPL IVDNTAAPLL VRPFDHGAAV VVYSATKYLS GHGTSIGGLI VDGGNFDWEK FPERQPALNT PDPSYHGAVW VEAVKPIGPV AYIIKARTTL LRDIGSALSP FNAFQIIQGL ETLPLRIERH VQNAQAVADF LEKRPEVTKV IHPSKLTGVA RERADKYLNG KFGGLVGFEL AGGKEAGRKF IDALQLLYHV ANIGDARSLA IHPASTTHSQ LSVEDQLATG VSDGYVRLSV GLEHIDDIIA DLETGLAAGR LAAAA
|
| |