Gene Sare_2283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2283 
Symbol 
ID5706042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2621371 
End bp2623452 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content73% 
IMG OID641271761 
Productsulfatase 
Protein accessionYP_001537132 
Protein GI159037879 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00157811 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGCTGATC AGCCCGCCAC GCCCGAACCG GCACCACCCG AGGACGGCGG TACCCAGGTG 
GGTCGATCGG CCGGACCAGT GGCTCCCGAC GGTGGGTCAC GGCGTGGCTG GCGAGCGGAG
GGCGGCCGGC TGCTGGAGAT CACCGCACTG CTCGGGCTCG CGGTCACTCA GCCGTTGCTG
GACGTGCTCG GCCGCAGTCC GGACTTCTTC CTGTTCCATC GTGCCGGCCG GGGTGAGATT
CTGCAGTTGG TCGCACTGGT GGCGATCGTG CCGACCGTCG CGGTCGGTCT GGTCGCGGCG
GCATCGCGGT TGGCCGGCCG CACCGCCCGG AAACTGACCC ACGCACTGCT CGTGGGTCTC
CTGCTCACCG CACTGGCCGT GCAGGTCGGT CGACACACGA CACCAGTGCG GGGCCTACCG
CTGTTGGTGC TGGCGGTTGT CGTCGCGGCG GCCGGGGTGG CCGCCTACCG GCGTTGGCGC
GCCCTGGGGC GGGTGCTGCG GGTCGCGGCG GTCGGGCCGG CGGTCTTCGT CGTGTTGTTC
CTGGTTGCCT CCCCGACCTC GACCGTGGTG TTGCCGCGCG GGGACGGTGG TGCCGCCGGG
TTGGCCCGCG CCGGCGGTCA CCCACCAGTG GTCCTGCTGG TCCTCGACGA GCTGCCCCTG
GTTTCCCTGC TGGCCCCGAA CGGTCGGATC GACGCAGCTC GGTTCCCGCA CTTCGCGGAG
CTGGCCGCCG GCTCGACCTG GTACCGCAAC GCGACCGGGG TCAGCGGCTG GACACCGTAC
GCGCTGCCGG CAATGCTGAC CGGCCGCTAT CCGGCCACCG GGGCGGCCCC ACACTACTCG
CAGCACCCGG ACAACCTGTT CACCGCGTTC GGCGGCCTGT ACGACATTCG TGCCGAGGAG
AGCATCACCC GCCTCTGCCC GCCCAGCCGC TGCGACACAC CGCCGGACCG GGAGCAGGGG
ATGGGGGTGC TGGTACGGGA GAGCACGAAA CTGCTGGCCC GGCTCTCCGC GCCGGCGGAC
AGCCGGGTCG ATCCCGCCGA CTCGTACCGG GAGCGGACCG CCGCCGAGGC GGGCATCGAC
GCCGCCGAGC CCATTCCGGA CGATCCGAAG TTCCGCTGGG ACCGGTTGAA CGCCAACCAG
CCGGCCCGGT TCAGCAGTTT CCTCGCCGGG CTCCGGCCGT CTGACCGCCC AACGCTGCAC
TTCCTGCACC TGCTGATGCC GCATTCGCCG TGGGCGTACC TGCCCTCGGG CGTGCGCTAC
GAGGCACCCG AGGACTTCCC GAACGAGGGG GAGGGCTGGG TGGAGTTGGC CCGCCAGCGG
CACCTGGCCC AACTCGGGTA CACCGACCGG CTGATCGGCG AAACTCTGCG TACGCTGCGC
GCCACCGGAC TGTACGACGA TGCCCTGCTG GCGGTCACCG CCGACCACGG GGTGAGCTTC
ACCAAGGGGG CGCAGGGGCG GGGGATGGGC GCCATCGAGG CCGCCGCCGA CGAGGTGGCC
TGGGTGCCGC TGTTTGTCAA GTACCCCGGG CAGCGTACCG GCCGGCTCGA CGACCGGAAC
TGGCAGCATG TCGACCTGCT GCCCACCCTT GCCGACGAGG CGGCGATCCG GCTGCCCTGG
TCGGTCGACG GCCAGTCGGC GCGGGAGGCG CCCCGGGCCG AGGCGGGCAA GGTCTTCTAT
GACCGGCCCG CCCAGCCGAC TCCGATCAAC GGTGGGGTTC CCGCCGCGAT ACCGCCCGCC
GCGCCGCATC CGCTGGTCGG TACCACCGTG CCGGACCAGC CGGTGGCAGG CTCGGCCCGG
GTCGGGAACC TGGCCGCCTT TCGCGAGGTG GACCCGGACC GCGGCTCGCT GCCCGCGTTG
GTCTGGGGTG ATCTGCCCGA CGACATCCCC GACGGCACCC CGCTGGCGGT CGCCGTCAAC
GACCGGGTCG CCGTTGTGGT GCCGGTGGTT CCCCGGGACG AGGGCGGGCG CCGGTTCGCG
GCCCTGATTG CCGACGACCG ACTCTTCCGG TCCGGGGTCA ACCGCCTCGG CCTGTTCCTC
GTCTCCGCCG ATGGCACGCT GAACCGGCTC GCGCTCTCCT GA
 
Protein sequence
MADQPATPEP APPEDGGTQV GRSAGPVAPD GGSRRGWRAE GGRLLEITAL LGLAVTQPLL 
DVLGRSPDFF LFHRAGRGEI LQLVALVAIV PTVAVGLVAA ASRLAGRTAR KLTHALLVGL
LLTALAVQVG RHTTPVRGLP LLVLAVVVAA AGVAAYRRWR ALGRVLRVAA VGPAVFVVLF
LVASPTSTVV LPRGDGGAAG LARAGGHPPV VLLVLDELPL VSLLAPNGRI DAARFPHFAE
LAAGSTWYRN ATGVSGWTPY ALPAMLTGRY PATGAAPHYS QHPDNLFTAF GGLYDIRAEE
SITRLCPPSR CDTPPDREQG MGVLVRESTK LLARLSAPAD SRVDPADSYR ERTAAEAGID
AAEPIPDDPK FRWDRLNANQ PARFSSFLAG LRPSDRPTLH FLHLLMPHSP WAYLPSGVRY
EAPEDFPNEG EGWVELARQR HLAQLGYTDR LIGETLRTLR ATGLYDDALL AVTADHGVSF
TKGAQGRGMG AIEAAADEVA WVPLFVKYPG QRTGRLDDRN WQHVDLLPTL ADEAAIRLPW
SVDGQSAREA PRAEAGKVFY DRPAQPTPIN GGVPAAIPPA APHPLVGTTV PDQPVAGSAR
VGNLAAFREV DPDRGSLPAL VWGDLPDDIP DGTPLAVAVN DRVAVVVPVV PRDEGGRRFA
ALIADDRLFR SGVNRLGLFL VSADGTLNRL ALS