Gene Rcas_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2040 
Symbol 
ID5539518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2613951 
End bp2615237 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content61% 
IMG OID640894175 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001432146 
Protein GI156742017 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0425815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG ACATCCGTTT CACCGGCTTC GAGACGCTGG CGCTCCACGC AGGACAGCAA 
CCCGACCCTG CCACCGGCGC ACGCGCGGTG CCGATCTATC AGACGACCTC GTATCAGTTC
AAGGATACCG ATCACGCGGC GCGGTTGTTC GGATTGCAGG AATTTGGGAA CATCTACACC
CGCATCATGA ACCCGACGAC CGACGTGCTT GAGCAGCGCA TTGCGGCGCT CGAAGGCGGC
GTCGGCGCGC TGGCGCTCTC ATCGGGGCAG GCGGCGGAGA CGCTGGGTAT CCTGAATGTG
GCAAGCGCCG GCGATAACAT CGTATCGTCG AGCGACCTGT ATGGCGGCAC TTACAATCTC
TTCCGCCATA CATTGCCCAA ACTCGGCATT ACGACGCGCT TCGTCGATGC GCGCGATCAC
GAGGGCTTCC GCAAAGCCAT CGATGATCGC ACCAAACTGG TCTTTCTCGA ACTGGTCGGC
AATCCACGCC TGGATATTGT CGATCTCCAA ACGATTGCCA CTATTGCGCA CGAGCGTGGC
GTGGCGGTGA TGGTCGACTC GACGACGGCA ACCCCCTATT TGTGCCGTCC GTTCGAGTGG
GGCGCCGACA TTGTCATTCA TTCCGGCACG AAGTACCTGG GCGGGCATGG CACGAGCATT
GCCGGTCTGC TGGTCGATAG CGGCAGATTC GACTGGACGA ACGGGCGCTA CCCGGAGTTC
ACCACGCCGG ATCCGTCTTA CCACGGGCTG GTCTACACAC AGGCGTTCGG CAACCTGGCG
TATATTCTGA AAGTGCGGGT GCAGTTGCTG CGCGACATCG GCGCATGCCT CAGCCCGTTC
AATTCTTTCC TGCTCCTTCA GGGTATTGAA ACACTGGGGC TGCGCATGGA GCGCCATAGC
CAGAATGCGC TGGCAGTGGC GCAGTTTCTC AAAGAGCACA GCAAGGTGGA GTGGGTGCTG
TACCCTGGTC TGCCGGACCA CCCCAGTTAT GCCCTGGCGC AGAAATATAT GCCGAAAGGT
CAGAGCGGCA TCCTCGGCTT TGGGATTCGT GGCGGGCGCG CGGCCGGCGC GACGTTTATC
AATAGTCTGC GCCTCTTCTC GCACCTGGCG AATATCGGCG ATGCCAAGAG CCTTGCCATC
CATCCCGCCA GCACGACTCA CAGCCAGTTG ACACCCGAAG AGCAGCGGCT TACCGGCGTC
ACCGACGATT TTGTGCGCCT GTCGGTGGGC ATCGAAACGA TTGACGACAT CATCGCCGAC
CTGGATCAGG CGCTGGCGAA GGTGTAG
 
Protein sequence
MSDDIRFTGF ETLALHAGQQ PDPATGARAV PIYQTTSYQF KDTDHAARLF GLQEFGNIYT 
RIMNPTTDVL EQRIAALEGG VGALALSSGQ AAETLGILNV ASAGDNIVSS SDLYGGTYNL
FRHTLPKLGI TTRFVDARDH EGFRKAIDDR TKLVFLELVG NPRLDIVDLQ TIATIAHERG
VAVMVDSTTA TPYLCRPFEW GADIVIHSGT KYLGGHGTSI AGLLVDSGRF DWTNGRYPEF
TTPDPSYHGL VYTQAFGNLA YILKVRVQLL RDIGACLSPF NSFLLLQGIE TLGLRMERHS
QNALAVAQFL KEHSKVEWVL YPGLPDHPSY ALAQKYMPKG QSGILGFGIR GGRAAGATFI
NSLRLFSHLA NIGDAKSLAI HPASTTHSQL TPEEQRLTGV TDDFVRLSVG IETIDDIIAD
LDQALAKV