Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5071 |
Symbol | |
ID | 6412765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5457734 |
End bp | 5459038 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642714956 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_001994035 |
Protein GI | 192293430 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.177033 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAACG AGACGCTTGC CATCCACGCC GGCTACGAGC CCGATCCGAC CACCCATGCG GTTGCGGTGC CGATCTATCA GACTGCGTCC TACGCATTCG ACAGCGCCGA CCACGGCGCG GCGTTGTTCA ATCTCGAGAC CGAAGGCTAT CGCTATTCTC GGATCGCCAA TCCGACCACA AGCGTGCTGG AAAAGCGCGT TGCTGAGCTG GAAGGCGGCG TCGGCGCCCT GGCGGTGGCG AGCGGGCAGG CGGCGCTGCA TTTCGCCTTC GTCAATCTCG CCGATCACGG CGGCAACATC GTCTCGGTGC CGCAGCTCTA TGGCACCACG CATACGCTGC TGTCGCACAT CCTGCCGCGA CAGGGCATCA CTGGCCGCTT CGCTGCCAGT GACAAGCCAG ACGACATCGC CAAGCTTATC GATGAGGGCA CCCGTGCGGT GTTCTGCGAA ACCATCGGCA ATCCGGCCGG CAATGTCTGC GACATCGAAG CGATCGCCGA CGTGGCGCAT CGCGCCGGCG TGCCGCTGAT CGTCGACAAT ACGGTAGCGA CGCCGATCCT GTTCAAGCCG ATCGCGTATG GTGCCGATGT CGTGGTGCAC TCGCTCACCA AGTTCCTCGG CGGCCACGGT ACCACGCTCG GCGGCGCCAT CGTCGACAGC GGACGATTCG ACTGGGCCAA GCACCCCGAG CGGTTTCCGG CATTCAACCA GCCGGACCAC TCCTATCACG GCATGGTCTA TGCGGAGCGG TTTGGGCCGA CAGCTTACGT TGAGCGCGCG CGGTCGATCT ATCAGCGCAC CATGGGATCC GTGTTGTCGC CGTTCAACGC CTTTCTGCTG CTGCAGGGCA TCGAGACAGT AGCGCTGCGG ATGGAGCGCC ACGTCGAGAA CGCCCGCAAA GTCGCCGAAT TCCTGCGCGA CGATCCGCGC GTTGCCTGGG TCAATTACAC CGGCTTCCCG GACAGCCCGT ATTATCCGCT GGTGCAGAAG TATCTCGACG GCCGCGCGTC GTCGCTGTTC ACCTTCGGCA TCAAGGGTGG CATGGAAGCC GGCAAGGCGT TCTACGATTC GCTCAAGCTG ATCACCCGGC TGGTGAACAT CGGTGACGCC AAGTCGCTCG CGTGCCACCC GGCGTCGACC ACCCATCGCC AGATGTCGGC CGAGCAGCAG CGTCAGGCCG GAGTTTTGCC GGAGACGATC CGGCTGTCGA TCGGCATTGA ACACATCGCC GACATCATCG AGGATCTCGA TCAGGCGCTC GCGCAAGCCT GCGGTTCGCA GCCGCGTCTG GCGGCGGCCG AATAG
|
Protein sequence | MRNETLAIHA GYEPDPTTHA VAVPIYQTAS YAFDSADHGA ALFNLETEGY RYSRIANPTT SVLEKRVAEL EGGVGALAVA SGQAALHFAF VNLADHGGNI VSVPQLYGTT HTLLSHILPR QGITGRFAAS DKPDDIAKLI DEGTRAVFCE TIGNPAGNVC DIEAIADVAH RAGVPLIVDN TVATPILFKP IAYGADVVVH SLTKFLGGHG TTLGGAIVDS GRFDWAKHPE RFPAFNQPDH SYHGMVYAER FGPTAYVERA RSIYQRTMGS VLSPFNAFLL LQGIETVALR MERHVENARK VAEFLRDDPR VAWVNYTGFP DSPYYPLVQK YLDGRASSLF TFGIKGGMEA GKAFYDSLKL ITRLVNIGDA KSLACHPAST THRQMSAEQQ RQAGVLPETI RLSIGIEHIA DIIEDLDQAL AQACGSQPRL AAAE
|
| |