Gene RoseRS_1758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1758 
Symbol 
ID5208715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2174373 
End bp2175659 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content60% 
IMG OID640595364 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001276098 
Protein GI148655893 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00831826 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.279429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACG AAATCCGTTT CACCGGCTTC GAGACCCTGG CGCTCCACGC GGGGCAGCAA 
CCCGACCCGA CCACCGGCGC ACGCGCCGTT CCGATCTATC AGACGACATC GTATCAGTTC
AAGGATACCG ATCACGCTGC GCGGTTGTTT GGGTTGCAGG AGTTCGGCAA TATCTATACC
CGGATTATGA ACCCGACGAC CGATGTGCTT GAACAGCGCA TCGCCGCACT CGAAGGCGGC
GTCGGCGCTC TGGCGCTGGC ATCGGGGCAG GCGGCGGAAA CGCTGGGCAT CCTGAATGTA
GCAGGCGCCG GTGATAATAT CGTCTCATCG AGCGACCTGT ACGGCGGCAC CTACAATCTG
TTCCGCCATA CGTTCCCGAA ACTCGGCATT ACGACTCGCT TCGTCGATGC CCGCGATCAC
GAAGGGTTTC GCCGGGCAAT CGATGATCGC ACGAAACTGG TCTTCCTCGA ACTGGTCGGC
AACCCGCGTC TGGATATTGT CGATCTGCAA ACCATCGCCA ATATCGCCCA CGAACACGGC
GTGGCGGTGA TGGTCGACTC GACCACTGCA ACCCCATATC TGTGCCGCCC GTTCGAGTGG
GGCGCCGATA TTGTCGTCCA CTCTGCGACC AAATACCTCG GCGGGCACGG GACCAGCATC
GCCGGTCTGC TGGTCGATAG CGGGAAGTTC GACTGGACCA ATGGGCGCTA CCCTGAGTTT
ACGACCCCCG ATCCTTCCTA TCACGGTCTG GTGTATACAC AGGCGTTCGG CAACCTGGCA
TACATCTTGA AGGTGCGGGT GCAGTTGCTG CGCGATATTG GCGCATGCCT CAGCCCGTTC
AACTCCTTCC TGCTGCTCCA GGGCATCGAG ACACTGGGAT TGCGGATGGA GCGTCACAGT
CAGAATGCGC TGGCAGTGGC CCAGTTCCTC AAAGAGCACA GCAAGGTGGA ATGGGTGCTG
TACCCCGGAC TTCCGGAGCA TCCGAGTTAT GCGCTGGCGC AGAAGTATAT GCCCAGGGGA
CAGAGCGGTA TTGTCGGGTT CGGGATCAAA GGCGGGCGCG CAGCCGGTGC AAAGTTCATC
AACAGCCTGC GCCTGTTCTC GCACCTGGCG AATATCGGCG ATGCCAGGAG TCTTGCGATT
CATCCAGCCA GCACGACCCA CAGCCAGTTG ACGCCCGATG AGCAGCGGCT CACCGGCGTC
ACCGACGATT TTGTGCGTCT GTCGGTCGGG ATTGAGACCA TCGACGATAT TATCGCCGAC
CTGGATCAGG CGCTGGCAAA AGTGTAA
 
Protein sequence
MTDEIRFTGF ETLALHAGQQ PDPTTGARAV PIYQTTSYQF KDTDHAARLF GLQEFGNIYT 
RIMNPTTDVL EQRIAALEGG VGALALASGQ AAETLGILNV AGAGDNIVSS SDLYGGTYNL
FRHTFPKLGI TTRFVDARDH EGFRRAIDDR TKLVFLELVG NPRLDIVDLQ TIANIAHEHG
VAVMVDSTTA TPYLCRPFEW GADIVVHSAT KYLGGHGTSI AGLLVDSGKF DWTNGRYPEF
TTPDPSYHGL VYTQAFGNLA YILKVRVQLL RDIGACLSPF NSFLLLQGIE TLGLRMERHS
QNALAVAQFL KEHSKVEWVL YPGLPEHPSY ALAQKYMPRG QSGIVGFGIK GGRAAGAKFI
NSLRLFSHLA NIGDARSLAI HPASTTHSQL TPDEQRLTGV TDDFVRLSVG IETIDDIIAD
LDQALAKV