Gene Sare_0858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0858 
Symbol 
ID5705123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp959000 
End bp960193 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content70% 
IMG OID641270377 
Productpeptidase M42 family protein 
Protein accessionYP_001535767 
Protein GI159036514 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID[TIGR03106] hydrolase, peptidase M42 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.967973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0317113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGA ACAAGCCCGC ACCGCTGCCG CTCGACCTCG ACTACCTACG CCAAGTGCTG 
GTTGAGCTGC TGGAGATCCC GAGCCCGTCC GGCCGCACCG ATCACGTACA GCAGTACGTG
GGCGAGCGGC TGGCGGCGCT CGGGATCCCG TCGACGCTGA CCCGGCGGGG TGCCCTCAGC
GCCTGCCTCC CGGGACCGCG TACCACCGGT GCGGACCGGG CGATCGTGGT GCACACCGAC
GTCATCGGCG GAATGGTCAA ACGGCTCAAG GAGAACGGCC GGCTGGAGCT CAAACCGATC
GGGACACACA GTGCGCGCTT CGCGGAGGGC GCCCACGTAC GCGTCTTCAC CGACCACCTG
GATCAGGTGA TCACCGGCCA GGTGCTACCG CTCAAGGCCA GTGGCCACCG CTACCACGAG
GCGGTGGACT CCCAGGGCAT CGGCTGGGAG TTGGTCGAGG TCCGGGTGGA CGAGCCGGTG
GACGACATCG CCGGCCTGCG CGCACTGGGG ATCGACGCGG GCGACTTCGT GGCGCTCCTG
CCCAACCCGC AGGTCACCCC CTCCGGGTAT GTCAAATCCC GCCACCTGGA CGACAAGGCG
GGCGTGGCGG CGGTGCTGAC CGCCTGCAAG GCGTTGGTCG ACGCGGGTGT CACCCCGGCG
GTCAGCGCAC ACTTGTTGAT CACTGTCACG GAGGAGATCG GCCACGGCGC CTCGCACGGG
CTGGATCCGG ATGTGGCCGA GATCGTCTCG GTGGACGCGG CCGTGGTGGC CCCCGGGCAG
CAGTCCCGGG AGGATGCGGC AACCCTGGCG ATGGGCGACG GGGTCGGCCC GTTCGACTAC
CACCTGACCC GCAACCTGGC GGCGATCGCC CGTGAGCACG ACGTCGACCT GGTCCGCGAT
GTCTTCGACT ACTACCGTTC GGACGTCGCG GCGGCGGTCG AGGCCGGTGC GCACGCCCGG
GTGGCGCTGC TCGGGTTCGG GGTGGACGCC ACCCACGGCC ATGAACGCAC CCACCTGGAC
GGGCTGCGTC ACCTGACCCA ACTGCTGTGC CTCTACCTCC AGAGCGAGTT GGTCTTCCCG
GAGTGGGACG CGGAACCGGC GGGCGAACTC GCCGACTTCC CGTCGCTGGC CGTTCAACCC
GCCCAGGAGG ACGGGCCGCG GGACGGTCCG ATCGGCATCA CCGCCGTTTC GTGA
 
Protein sequence
MTPNKPAPLP LDLDYLRQVL VELLEIPSPS GRTDHVQQYV GERLAALGIP STLTRRGALS 
ACLPGPRTTG ADRAIVVHTD VIGGMVKRLK ENGRLELKPI GTHSARFAEG AHVRVFTDHL
DQVITGQVLP LKASGHRYHE AVDSQGIGWE LVEVRVDEPV DDIAGLRALG IDAGDFVALL
PNPQVTPSGY VKSRHLDDKA GVAAVLTACK ALVDAGVTPA VSAHLLITVT EEIGHGASHG
LDPDVAEIVS VDAAVVAPGQ QSREDAATLA MGDGVGPFDY HLTRNLAAIA REHDVDLVRD
VFDYYRSDVA AAVEAGAHAR VALLGFGVDA THGHERTHLD GLRHLTQLLC LYLQSELVFP
EWDAEPAGEL ADFPSLAVQP AQEDGPRDGP IGITAVS