Gene Rpal_5071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5071 
Symbol 
ID6412765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5457734 
End bp5459038 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content64% 
IMG OID642714956 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001994035 
Protein GI192293430 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.177033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAACG AGACGCTTGC CATCCACGCC GGCTACGAGC CCGATCCGAC CACCCATGCG 
GTTGCGGTGC CGATCTATCA GACTGCGTCC TACGCATTCG ACAGCGCCGA CCACGGCGCG
GCGTTGTTCA ATCTCGAGAC CGAAGGCTAT CGCTATTCTC GGATCGCCAA TCCGACCACA
AGCGTGCTGG AAAAGCGCGT TGCTGAGCTG GAAGGCGGCG TCGGCGCCCT GGCGGTGGCG
AGCGGGCAGG CGGCGCTGCA TTTCGCCTTC GTCAATCTCG CCGATCACGG CGGCAACATC
GTCTCGGTGC CGCAGCTCTA TGGCACCACG CATACGCTGC TGTCGCACAT CCTGCCGCGA
CAGGGCATCA CTGGCCGCTT CGCTGCCAGT GACAAGCCAG ACGACATCGC CAAGCTTATC
GATGAGGGCA CCCGTGCGGT GTTCTGCGAA ACCATCGGCA ATCCGGCCGG CAATGTCTGC
GACATCGAAG CGATCGCCGA CGTGGCGCAT CGCGCCGGCG TGCCGCTGAT CGTCGACAAT
ACGGTAGCGA CGCCGATCCT GTTCAAGCCG ATCGCGTATG GTGCCGATGT CGTGGTGCAC
TCGCTCACCA AGTTCCTCGG CGGCCACGGT ACCACGCTCG GCGGCGCCAT CGTCGACAGC
GGACGATTCG ACTGGGCCAA GCACCCCGAG CGGTTTCCGG CATTCAACCA GCCGGACCAC
TCCTATCACG GCATGGTCTA TGCGGAGCGG TTTGGGCCGA CAGCTTACGT TGAGCGCGCG
CGGTCGATCT ATCAGCGCAC CATGGGATCC GTGTTGTCGC CGTTCAACGC CTTTCTGCTG
CTGCAGGGCA TCGAGACAGT AGCGCTGCGG ATGGAGCGCC ACGTCGAGAA CGCCCGCAAA
GTCGCCGAAT TCCTGCGCGA CGATCCGCGC GTTGCCTGGG TCAATTACAC CGGCTTCCCG
GACAGCCCGT ATTATCCGCT GGTGCAGAAG TATCTCGACG GCCGCGCGTC GTCGCTGTTC
ACCTTCGGCA TCAAGGGTGG CATGGAAGCC GGCAAGGCGT TCTACGATTC GCTCAAGCTG
ATCACCCGGC TGGTGAACAT CGGTGACGCC AAGTCGCTCG CGTGCCACCC GGCGTCGACC
ACCCATCGCC AGATGTCGGC CGAGCAGCAG CGTCAGGCCG GAGTTTTGCC GGAGACGATC
CGGCTGTCGA TCGGCATTGA ACACATCGCC GACATCATCG AGGATCTCGA TCAGGCGCTC
GCGCAAGCCT GCGGTTCGCA GCCGCGTCTG GCGGCGGCCG AATAG
 
Protein sequence
MRNETLAIHA GYEPDPTTHA VAVPIYQTAS YAFDSADHGA ALFNLETEGY RYSRIANPTT 
SVLEKRVAEL EGGVGALAVA SGQAALHFAF VNLADHGGNI VSVPQLYGTT HTLLSHILPR
QGITGRFAAS DKPDDIAKLI DEGTRAVFCE TIGNPAGNVC DIEAIADVAH RAGVPLIVDN
TVATPILFKP IAYGADVVVH SLTKFLGGHG TTLGGAIVDS GRFDWAKHPE RFPAFNQPDH
SYHGMVYAER FGPTAYVERA RSIYQRTMGS VLSPFNAFLL LQGIETVALR MERHVENARK
VAEFLRDDPR VAWVNYTGFP DSPYYPLVQK YLDGRASSLF TFGIKGGMEA GKAFYDSLKL
ITRLVNIGDA KSLACHPAST THRQMSAEQQ RQAGVLPETI RLSIGIEHIA DIIEDLDQAL
AQACGSQPRL AAAE