Gene Sare_0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0855 
Symbol 
ID5705958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp955614 
End bp956984 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content69% 
IMG OID641270374 
Productcystathionine beta-synthase 
Protein accessionYP_001535764 
Protein GI159036511 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01137] cystathionine beta-synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0877269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGTACT ACGACAACGT CGTCGAGTTG ATCGGCAACA CCCCGCTGGT ACGCCTGCGC 
AACGTGACCG AGGGTATCCA GGCCACCGTG CTGGCGAAGG TGGAGTACCT GAATCCAGGT
GGGTCGGTCA AGGACCGGAT CGCCCTGCGC ATGGTGGAGG ACGCCGAGCA GGCGGGGATC
CTGCGGCCGG GCGGCACGAT CGTCGAGCCG ACCAGCGGCA ACACCGGCGT GGGGCTGGCT
CTGGTGGCGC AGCTCAAGGG CTACCGGTGC GTGTTCGTCT GCCCGGACAA GGTCAGTCAG
GACAAGCAGG ACGTGCTGCG TGCCTACGGT GCCGAGGTGG TGGTCTGCCC GACCGCTGTC
GCGCCCGCGG ACCCACGGTC CTACTACAAC GTCTCTGACC GCCTTGCCCG GGAGATCCCC
GGCGCCTGGA AGCCCAACCA GTACGCGCAC CCGGCGAACC CCCGCTCCCA CTACGAGACC
ACCGGGCCGG AGCTGTGGGC GCAGACCGAG GGCCGGATCA CCCATTTCGT CGCCGGTGTC
GGCACCGGTG GCACGATCTC CGGCATCGGC CGTTACCTGA AGGAGGTGTC CGGGGGGCAG
GTCAAGGTGA TCGGCGCTGA CCCGGAGGGG TCGGTCTACT CCGGTGGCAC CGGGCGGCCG
TACCTGGTCG AGGGCGTCGG CGAGGACTTC TGGCCGGAAA CCTACGACCG GGGGGTCGCC
GACGGGATCG TCGAGGTCTC CGACAAGGCG TCGTTCGAGA TGACCCGCCG CCTGGCCCGC
ACCGAGGGCC TGCTGGTCGG TGGCTCCTGC GGGATGGCGG TCGTCGCGGC GTTGGAGGTG
GCCCGTGCGG CTGACCCGGA CGACGTGGTC GTGGTACTCC TGCCGGACGG TGGTCGCGGA
TACCTCTCCA AGATCTTCAA CGACTCGTGG ATGGCCCGGT ACGGTTTCGT GGACAACTCT
GGCAGTGAGC CGACCATCGC CGAGACGCTC GCCGGCAAGC CAGGTGGGCT GCCCGAACTG
GTGCACGTAC ACCCCACCGA GACGGTCCGT GACGCGATCG ACTACCTGCG CGAGTACGGT
GTCTCCCAGC TGCCGGTGCT GAAGGCCGAA CCGCCGGTGG TTACCGGCGA GGTGGCCGGA
TCGGTCGCGG AGCGAGACCT GCTCGACGCG CTCTTCACCG GCCAGGCGCA GCTACACGAC
ACCATCGAGC GGCACATGGC CGCGCCGCTG CCGATGATCG GCGGTGGGCA GCCGGTCAGC
GAGGCGGTCG CCCTGCTGGA GAAGTCCGAC GCCGCGCTGG TGCTGATCGA TGGCAAGCCG
AAGGGCGTGC TCACCCGGCA GGACCTGCTC GCGCACCTCG GTTCCCGCTG A
 
Protein sequence
MQYYDNVVEL IGNTPLVRLR NVTEGIQATV LAKVEYLNPG GSVKDRIALR MVEDAEQAGI 
LRPGGTIVEP TSGNTGVGLA LVAQLKGYRC VFVCPDKVSQ DKQDVLRAYG AEVVVCPTAV
APADPRSYYN VSDRLAREIP GAWKPNQYAH PANPRSHYET TGPELWAQTE GRITHFVAGV
GTGGTISGIG RYLKEVSGGQ VKVIGADPEG SVYSGGTGRP YLVEGVGEDF WPETYDRGVA
DGIVEVSDKA SFEMTRRLAR TEGLLVGGSC GMAVVAALEV ARAADPDDVV VVLLPDGGRG
YLSKIFNDSW MARYGFVDNS GSEPTIAETL AGKPGGLPEL VHVHPTETVR DAIDYLREYG
VSQLPVLKAE PPVVTGEVAG SVAERDLLDA LFTGQAQLHD TIERHMAAPL PMIGGGQPVS
EAVALLEKSD AALVLIDGKP KGVLTRQDLL AHLGSR