Gene Sare_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0049 
Symbol 
ID5707250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp56756 
End bp58192 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content71% 
IMG OID641269575 
Productprotein serine/threonine phosphatase 
Protein accessionYP_001534976 
Protein GI159035723 
COG category[T] Signal transduction mechanisms 
COG ID[COG0631] Serine/threonine protein phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.24969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTGA CCCTGCGCTA CGCGGCCCAC AGTGACCGCG GTCTGATCCG CGACGGTAAC 
CAGGACTCCG TCTATGCCGG GCCACGGCTG CTCGCCGTAG CTGACGGTAT GGGTGGTATG
GCCGCCGGTG ACGTCGCCAG CAATATCGTC ATCGGCGCAA TGGCGCCGCT GGACGAGGAC
GTCCCGGGCA ATGCCCTCGT GGACGCGCTC CGTTCGGCCG TCACCAACGC CACCCAGCAA
CTGCGCGAGA CGGTTGACGC CAACCCGCAG TTGGAGGGGA TGGGGACCAC GCTCACCGCG
ATCCTCTTCT CCGGCAGCAA GCTGGGCATG GTGCACATCG GCGACTCCCG TGCCTACCTG
CTGCGTGCCG GCGAGTTCGC CCAGATCACC AAGGACGACA CCTACGTCCA GATGCTGGTC
GACGAGGGCC GGATCAGTGC CGAGGAGGCG AGTAGCCACC CCCAACGGTC ACTGCTGACC
CGGGCCCTCG ACGGCCGGGA CATCGACGCG GAGTACTCGG TGCGCCAGGT CCTCACCGGT
GACCGGTACC TGATCTGCAG CGACGGCCTC TCGGGCGTGG TGAGCGCGGA TACGATCGCC
GACACGATGC GGGAGTACGC CGACCCGCAG CAGTGCGTGG AGCGGCTGGT GCAGCTCGCG
TTGCGCGGTG GCGGCCCGGA CAACATCACC GTGATCATCG CTGACGCGAC GGACCAGGAC
ATCGTCGAGG CGACCCCGAT CGTCGGCGGC GCCGCCTCCC GGGACCGGGG CATGGCGACC
TCCGCTGACG ACTCCACCCC CGCGGCCCGT GCCTCCGCGC TCTCCGCGTC ACCGCCGGCC
CCGCCGGACG AGCCGACGGC CGGGGCCGAC GACGAGCCGG AGCGCCGTCG CCGCCGGCCG
ATCCGTGTCG CGGCGGTGAC CCTCGCCCTG CTCGTCCTTG TGGGTGGTGC GGTCTACGGG
GGATGGAGCT ACACCCAGCG GCAGTACTAC GTCGGAGCCA CCGAGGGCGG CCAGGTCGCC
GTGTTCCGCG GCATCGAAGG TCAGGTCGCC GGTGTGGATC TCTCCACCGT GCATTCGACC
AGCTCCGCCG AACTGGATGA CCTCACGCTC GCCGCGCAGG AGCGGGTGAA ACAGGGCATC
CCGGCCAAAA GCGAATCGGA TGCGGCGCGC CGACTCGCCG AGCTGACCGC TGACGTGCCG
ACCAACCCCA ACCTGAAGCC GACCTGTCCC CCCAGCCCGA GCCCCAGCCC GTCAGCGGTG
TCCGTCACGC CGACGCCGAC CCCTGTTGCG GGTACGCCTT CCCCCAGCCC CAGCGCCGCG
ATCCCGAGCG CAGCGGCGAC CGACCAGCCG ACCGAACCGA CCACCTCGCC CGACGCCTTG
CCCGCCGATC CGGCTCCGCC GGCCGTCGAT CCGGCCGCCT GCCGGTCGCC CGAGTGA
 
Protein sequence
MTLTLRYAAH SDRGLIRDGN QDSVYAGPRL LAVADGMGGM AAGDVASNIV IGAMAPLDED 
VPGNALVDAL RSAVTNATQQ LRETVDANPQ LEGMGTTLTA ILFSGSKLGM VHIGDSRAYL
LRAGEFAQIT KDDTYVQMLV DEGRISAEEA SSHPQRSLLT RALDGRDIDA EYSVRQVLTG
DRYLICSDGL SGVVSADTIA DTMREYADPQ QCVERLVQLA LRGGGPDNIT VIIADATDQD
IVEATPIVGG AASRDRGMAT SADDSTPAAR ASALSASPPA PPDEPTAGAD DEPERRRRRP
IRVAAVTLAL LVLVGGAVYG GWSYTQRQYY VGATEGGQVA VFRGIEGQVA GVDLSTVHST
SSAELDDLTL AAQERVKQGI PAKSESDAAR RLAELTADVP TNPNLKPTCP PSPSPSPSAV
SVTPTPTPVA GTPSPSPSAA IPSAAATDQP TEPTTSPDAL PADPAPPAVD PAACRSPE