Gene Rcas_3473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3473 
Symbol 
ID5540972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4533113 
End bp4534231 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content64% 
IMG OID640895591 
Productcystathionine gamma-synthase 
Protein accessionYP_001433541 
Protein GI156743412 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.400432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.010715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCCG AAACTGCATC CATTCACGCC GGTCAGGAAA TCGACCCAAC GACGGGCGCG 
GTCATTCCGC CGATCTACCT GACGACAACC TTCGAGCGCG CGTCGGACGG GAGTTTTCCG
CGCGGGTACA TCTACACCCG CAATGGCAAC CCTAACCGCG CAATGCTCGA AACTTGCCTC
GCGGCGCTTG AGGGCGGCGC AACATGTGTG GCGTTCAGTT CCGGTCTCGC GGCAGCCATG
AGCGTCTTTC AGGCGCTGCG TCCCGGTGAT CACGTGATCG CTCCCGACGA CGCCTACCAC
GGAATCACGC GGCTGCTGCG CGAGATTATG GCGCCATGGG GACTGGAATA CAGCCGCGTC
GATATGCGCG ACCCGCAGAA CGTCGCAGCG GCGTTGCGCC CGAATACGCG CCTGGTGTGG
ATTGAGACTC CGTCGAATCC GCTGCTCAAA ATCGCTGATA TTGCCGCAAT TGCGGAGATT
GCCCGTCAGG CTGGCGCACT CTGCGCGGTG GACAACACAT GGGCGACGCC GGTGCAGCAG
CGCCCGCTGG AATTGGGCGC CGACATCGTG ATGCACGCGA CAACGAAGTA CATCGGCGGG
CATAGCGACG TGCTCGGCGG GGCTGTAGTG TTTGGCGCGA ACGATGCGTT TGCCGAACGG
GTGCGTTTCC TTCAGATCAA TGGCGGCGCC GTTCCGTCGC CGTTCGACTG CTGGCTGGCG
CTGCGCGGCA TTCAGACGCT CCCCTATCGC GTGCGCGCGC ATGCCGCAAA TGCGATACAG
GTCGCCCGTT TCCTGGCGGA GCACCCGCGC ATCGAGCGCG TTCACTATCC CGGACTGGAA
ACGCACCCCG GTCACGCGGT TGCTGCGCGG CAGATGCGCG GCTTTGGCGG CATGCTGTCG
ATTGAAGTCG AAGGCGGCGA GGCGGAAGCC ATGGCAGTCG CTGCGAAGGT CAAGATCATT
ACCCGCGCCA CCAGTCTCGG CGGCGTCGAG AGCCTGATCG AACATCGCGC ATCGGTAGAG
GGACCGGAGA GCACAACGCC GCCCAACCTG CTGCGCATTT CGGTCGGACT CGAACATCCC
GATGACCTGA TCGCCGATCT GGCGCAGGCG CTGACATAG
 
Protein sequence
MHPETASIHA GQEIDPTTGA VIPPIYLTTT FERASDGSFP RGYIYTRNGN PNRAMLETCL 
AALEGGATCV AFSSGLAAAM SVFQALRPGD HVIAPDDAYH GITRLLREIM APWGLEYSRV
DMRDPQNVAA ALRPNTRLVW IETPSNPLLK IADIAAIAEI ARQAGALCAV DNTWATPVQQ
RPLELGADIV MHATTKYIGG HSDVLGGAVV FGANDAFAER VRFLQINGGA VPSPFDCWLA
LRGIQTLPYR VRAHAANAIQ VARFLAEHPR IERVHYPGLE THPGHAVAAR QMRGFGGMLS
IEVEGGEAEA MAVAAKVKII TRATSLGGVE SLIEHRASVE GPESTTPPNL LRISVGLEHP
DDLIADLAQA LT