Gene Sare_2956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2956 
Symbol 
ID5707810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3353374 
End bp3354771 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content72% 
IMG OID641272405 
Productcystathionine beta-synthase 
Protein accessionYP_001537773 
Protein GI159038520 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01137] cystathionine beta-synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.192465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0470778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGC ACGAGTCGTT GACGGAACTG ATCGGCGACA CCCCGCTGGT CCGCCTCCGC 
AGGGTCGTCG CCCCGAGGTC CGCCCCGCTG TACGCCAAGG TCGAGTACCT CAACCCGGGC
GGGTCGGTGA AGGACCGGAT CGCGTTGCGC ATGGTGGAGG CGGCCGAGGC CGACGGCCTG
CTGCGGCCGG GCGGAACCCT GGTCGAGCCG ACGTCCGGCA ACACCGGTGT CGGACTGGCC
ATCGTCGCTC AGCAGAAGGG CTACCGCTGC GTCTTCACCT GCCCGGACAA GGTCTCCCAG
GACAAGATCG ACGTGCTGCG GGCATACGGC GCCGAGGTGA TCGTCTGTCC CGCCGCGGTC
CGGCCGGAGC ACCCGGCGTC CTACCGCAGC CAGGCGACCC GGGTCGCCGC AGAGCGTCCC
GGGGCGGTTC AGCTGAACCA GTACGCCAAC GGCAACAACC CGGACTCCCA CTACCGGGGC
ACCGGGCCGG AACTGTGGCG GCAGACGGAG GGTCGGCTGA CCCACTTCGT TACCGGCATC
GGCACCGGCG GCACGATCTC TGGTACCGGT CGCTACCTCA AGGAGGTCTC CGGCGGCGCG
GTGCAGATCG TCGGTGTCGA CCCGGACGGC TCGATCTACT CGGGCGGAGA GGGCCGGCCC
TACCTGCTGG AGGGGGTCGG CCAGCCCAGC TTTCCGGCAT CGTACGACTC CACGGTGGTC
GACGAGGTCA TCGCCGTGAC CGACGCCGAG GCGATCCTGA TGACCCGTCG CCTGGCCCGG
GAGGAGGCGC TGCTGACCGG CGGCTCCGGG GGCATGGCCG CCGAGGCCGC GGTGCGGGTC
GCCGCCGGGC TCGGCCCAGG GACGCTTGCG GTTGTTCTGA TCCCCGACTC CGGCCGCGGC
TACCTGTCCA AGATCTTCAA CGACAGCTGG TTGGCCGGGC TCGGCCTGTA CGACCCGCCG
ACCACCCGGC CCCGGGTCCG TGACGTCGCG CCGGCCGGTC CGCTTCCGCT GATCCGACCG
GACGAGACCC TCGCCAACGC CGTCAAGACC CTGCGCGCCA CGGAGGGGGG CCACCTGCTG
GTAAGCGCCG CGCCGCCGCC GGTCCGGCTG GCCGAGATCC TGGGCACCGT GACCGACGCC
GTGCTGGTCG CCGCTCTCGC CGGTGGTGCG GACCTCGGTG ACCCGATCGG GGCGCACCTG
AACCCGCCGC CACCGCAGGT CGGCGCCGGG CAACCGCTGT CGGCGGTGAC CGAGATGCTG
TCGGATCCGT CGTCCGCCGT CCTGGTGGTG GACGGCGGAC TCCCATGTTG CCTGCTGACC
GCCGCGGACC TCGTCGCGTT GCTGGCCACG CGACGACCGG ACGCGGCAGC CCGATCGCCT
GTGCTGGGGG GAACGTGA
 
Protein sequence
MSMHESLTEL IGDTPLVRLR RVVAPRSAPL YAKVEYLNPG GSVKDRIALR MVEAAEADGL 
LRPGGTLVEP TSGNTGVGLA IVAQQKGYRC VFTCPDKVSQ DKIDVLRAYG AEVIVCPAAV
RPEHPASYRS QATRVAAERP GAVQLNQYAN GNNPDSHYRG TGPELWRQTE GRLTHFVTGI
GTGGTISGTG RYLKEVSGGA VQIVGVDPDG SIYSGGEGRP YLLEGVGQPS FPASYDSTVV
DEVIAVTDAE AILMTRRLAR EEALLTGGSG GMAAEAAVRV AAGLGPGTLA VVLIPDSGRG
YLSKIFNDSW LAGLGLYDPP TTRPRVRDVA PAGPLPLIRP DETLANAVKT LRATEGGHLL
VSAAPPPVRL AEILGTVTDA VLVAALAGGA DLGDPIGAHL NPPPPQVGAG QPLSAVTEML
SDPSSAVLVV DGGLPCCLLT AADLVALLAT RRPDAAARSP VLGGT