Gene Sare_4509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4509 
Symbol 
ID5707030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5097782 
End bp5098873 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content69% 
IMG OID641273923 
Productpyridoxal-5'-phosphate-dependent protein beta subunit 
Protein accessionYP_001539272 
Protein GI159040019 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.263951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00963798 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTCAAC TGGATCGGTG CGACGACGCT GTTCGGGGGT GGGTGACCGA GGCGATCGCC 
GCCGTGGAGG CGGACGCCAA CCGGTCCGCC GACACCCACC TGCTGCCGTT TCCACTACCT
CGGCGGTGGG GAATCGACCT CTACCTCAAG GACGAGTCGG TGCACCCGAC TGGTTCCCTG
AAGCACCGGC TGGCCCGGTC GCTGTTCTTC TACGGGCTCT GCAACGGTTG GATCGGTCCG
GACACCACGA TCGTGGAGGC CTCCTCAGGG TCGACGGCGG TGTCCGAGGC ATACTTCGCC
CGGATGCTCG GGTTGCCGTT CATCGCGGTG ATGCCGGCGT CCACCTCGGC GGAAAAGATC
GCCAAAATCG AGTTCCAGGA GGGGCGCTGT CACCTGGTGG ACGACCCGGC CAAGGTGGTC
GTCGAGGCAC GCTGGCTGGC CGAGGACACC GGCGGCCACT TCATGGACCA GTTCACCTAT
GCCGAGCGGG CCACCGACTG GCGGGGCAAC AACAACATCG CCGAGTCGAT CTTCTCGCAG
CTCGCTCTGG AACGGCACCC GGTGCCGGCG TGGATCGTGG TCGGTGCCGG CACCGGCGGC
ACCAGCGCCA CCATCGGCCG GTACGCGCGC TACCGTCGGC TGCCCACCAA ACTCTGCGTG
GTGGACCCGG AGAACTCGGC GTTCTACCCG GCCTGGCAGA CATCCGACTG GTCGGTGCGC
ACCGGCCGCG GCTCCCGAAT CGAGGGCATC GGGCGACCGA CCGTGGAAGC ATCCTTCCTG
CCGGCGGTGG TCGACCGGAT GGTGCAGGTG CCCGACGCCG CCTCGTTGGC CGCGATGCGG
GCTGGCTCCG ACGTGCTGGG CCGTCGGGTC GGTGGCTCGA CCGGCACCAA CCTCTGGGGC
GCCTTCGGGC TGATCGCCGG ACTGCTCGCG GCCGGGCAGT CCGGCTCGGT GGTCACCCTG
ATCTGCGACG CCGGGGACCG CTACGCCGAC ACCTACTACG CCGACGACTG GGTCAGCGCC
CAGGGGCTCG ACCTGTCGCC GCACCTGGCG ACCATCGAGC GTTTCCTGGA CACCGGCGCC
TGGCCGGCCT GA
 
Protein sequence
MTQLDRCDDA VRGWVTEAIA AVEADANRSA DTHLLPFPLP RRWGIDLYLK DESVHPTGSL 
KHRLARSLFF YGLCNGWIGP DTTIVEASSG STAVSEAYFA RMLGLPFIAV MPASTSAEKI
AKIEFQEGRC HLVDDPAKVV VEARWLAEDT GGHFMDQFTY AERATDWRGN NNIAESIFSQ
LALERHPVPA WIVVGAGTGG TSATIGRYAR YRRLPTKLCV VDPENSAFYP AWQTSDWSVR
TGRGSRIEGI GRPTVEASFL PAVVDRMVQV PDAASLAAMR AGSDVLGRRV GGSTGTNLWG
AFGLIAGLLA AGQSGSVVTL ICDAGDRYAD TYYADDWVSA QGLDLSPHLA TIERFLDTGA
WPA