Gene Sare_2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2649 
Symbol 
ID5703594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3018412 
End bp3019362 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content71% 
IMG OID641272107 
Productalpha/beta hydrolase fold 
Protein accessionYP_001537477 
Protein GI159038224 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0585324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00353536 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGATCCGAG AAGGGGAGCC GGGGACCGAA TTCGCCTATC GAGCGGACAC TAAACGGTTA 
GCGTACGAGG TCTCGGGTGC ACCCGACGGG CACCCGGTCT TCCTCATGCA CGGCACCCCG
GGGAGCCGCA AGGGGCCAAA GCCGCGAGGA ATCGTCCTCT ATCGATTAGG CGTAAAACTG
ATCACCTACG ACCGGCCTGG CTACGGCGAC TCGGACCGGT TCGAAGGGCG CGACGTGGCC
GACGCGGCAC GCGACGTGGA GGCCATCGCG GAGCACCTGG GGCTGGCCCG CTTCGCCGTC
GTCGGCAGAT CCGGCGGCGG ACCGCACGCC CTCGCCTGCG CCGCCGACCC CACGCTGCGC
CACCGGGTGA CCCGGGTGGC GGTGCTGGTC GGCTTCGCGC CCGCCAACGC GCCGGAGCTG
GACTGGTTCG CCGGGATGAA CACCGACAAC GTCCAGGGCT TCGGCGCCGG CCGGTCCGAC
ACCCCCGCCA TAGTGGAGGA GATCCGCCGC CGGGCGCAGC GGGCCAGCGA AGATCCACGG
CTGCTGCTGG ACGAACTGAC AACACAGATG ACCGCGGCGG ACCGACGGGT CATCCGCGAT
CCAGCACTGC GGCGGATGCT CACCGACACG TTCGCCGACG CGCTGCGCGC CGGCCCGTAC
GGGTGGATCG ACGACGTCCT CGCGCTGCGC CGGGACTGGA AGTTCGACCT CGGCCTGATC
GACTCCTCGG CGACGAAGGT GCGGCTCTGG CACGGCGCCG AGGACACCTT CGCCCCGGTC
GGCCACACCC GGTGGCTCGC CTCCCGCATT CCCGGCGCGG AGCTCGAGGT GCAGGCCGGC
GCGGCGCACT TCGACGCGGT GGAGGAACTG CCACGCATCC TGAGCTGGCT CACCACCGAC
GACGCGGCGG TGCCCCAGGA CCTCCTGATC GGCGCCCGGT TCGGTCAGTA G
 
Protein sequence
MIREGEPGTE FAYRADTKRL AYEVSGAPDG HPVFLMHGTP GSRKGPKPRG IVLYRLGVKL 
ITYDRPGYGD SDRFEGRDVA DAARDVEAIA EHLGLARFAV VGRSGGGPHA LACAADPTLR
HRVTRVAVLV GFAPANAPEL DWFAGMNTDN VQGFGAGRSD TPAIVEEIRR RAQRASEDPR
LLLDELTTQM TAADRRVIRD PALRRMLTDT FADALRAGPY GWIDDVLALR RDWKFDLGLI
DSSATKVRLW HGAEDTFAPV GHTRWLASRI PGAELEVQAG AAHFDAVEEL PRILSWLTTD
DAAVPQDLLI GARFGQ