Gene Strop_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2135 
Symbol 
ID5058598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2414840 
End bp2416924 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content72% 
IMG OID640474398 
Productsulfatase 
Protein accessionYP_001158964 
Protein GI145594667 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.142592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0273474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGACC TGGGCGCCGT GCCCACATCG GACCACACCG AGGGCGTTGA CGGCGCCTCT 
GATCGACGTG CTGGCGCAGC GAGGGATCCG GACGGCGCGC CGCGGCGTGG CTGGAGAGCG
GAGGGTCGTC GGCTGCTGGA GCTCTCCGCT CTGCTCGGGC TCGCGGTTAC TCAGCCGTTG
TTGGATGTGC TCGGCCGGAG TCCGGACTTC TTCCTTTTCC ACCACGCCAG CCGCGGCGAG
ATCGTGCAGC TGGTCGTGCT GGTCGCGATC GTGCCGACCG TCGCGGTCGG AACGATCGCG
GCGACGTCGC GGCTGGCCGG CCGTACCGCT CGGGAGGTGA CCCACACGCT GCTCGTGGGT
CTCCTGCTTA CCGCGCTGGC CGTACAGATC GGACGGCACA CGACACCGTT GCGGGGCGTA
CCGCTGCTGG TGCTGGCGGC TGTCGTCGCG GTGGCCGCGA CGGTGGCGTA CCGGCGGTGG
CGTGCCCCGG CCCGCGTGCT GCGGGTCGCG GCGCTTGGGC CCGCGGTCTT CGTCGCGCTA
TTCCTGCTCG TCTCGCCCAC CTCGACGGTG GTGTTGCCGC GCGGCGACGG CGGTGCCGTC
GGGCTCGCCG GCGCCGGTGG CCATCCCCCG GTGGTCGTGC TGGTCCTCGA CGAACTGCCC
CTGCTCTCGC TGCTCGACCC GGACGGGCAG GTTGACGCGG ACCGGTTCCC GCACTTCGCG
GAGCTGGCCG CCGGCTCGAC GTGGTACCGC AACGCGACCG GGGTCAGCGG TTGGACACCG
TACGCGCTGC CGGCGATGTT GACCGGCCGC TATCCGGCCA CCGGGGCGGC TCCCCACTAC
TCGCAGCATC CGGACAACCT GTTCACCGCG TTCGGTGGCC TGTACGACAT TCGCGCCGAG
GAGAGCATCA CCCGCCTCTG CCCACCCAGC CGCTGTGAGA CACCGGTGGG CCGGGAACAG
GGGATGGGGG TACTGGTCCG GGAGAGTGCG AAGCTGCTGG CCCGGATGTC GGCGCCAACG
GAGAGTCGGG TTGACCCGGC GGACTCCTAC CGGGAGCAGA CCGCCGCTGA GGCCGGCATC
GACGCCGCCG AGCCGATCCC GGAGGATCCG AAGTTTCGCT GGGACCGGCT GAACGACAAC
CAGCCGGCCC GGTTCAGTAG CTTCCTCGCC GGACTTCGGC CGTCGGACCG GCCCACCCTG
CACTTCCTGC ACCTGCTGAT GCCGCATTCG CCATGGGCGT ACCTGCCCTC GGGAGTGCGC
TATGAGGCGC CAGAGGACTT CCCGAACGAG GGGGAGGGCT GGGTGGAGTT GGCCCGCCAG
CGGCATCTGG CCCAGCTCGG CTACACCGAC CGGCTGATCG GCGAGACCCT GCGTACGCTG
CGCGCGACCG GGCTCTACGA CGACGCCCTG CTGGTGGTCA CCGCCGACCA CGGGGTGAGC
TTCACCAAGG GGGCACAGGG GCGGGGGATG GGTGCGATCG AGGTCGCCGC CGACGAGGTG
GCCTGGGTGC CGTTGTTTGT CAAGTACCCG GGGCAGCGCA CCGGTCGGCT CGATGACCGG
AACTGGCAGC ACGTTGACCT GCTGCCCACC CTCGCCGACG AGGCGGCGAT CCGGCTGCCC
TGGGCGGTGG ACGGCCGGTC GGCGCGGGAA GCGCCCCGGG CGCAGGCGGA CAAGGTCTTC
TATGACCGGC CCGCCCAGCC GACCCCGATC GGCGGTGGCG TACCCACCCC GGTGCCGCCC
GCCGCGCCGC ACCCGCTGAT CGGCACCACG GTGCCGGATC AGCCGACGGC GGGGGTGGCC
CGTGTCGAGA ACCTGGCTGC CTTCCGGGAG GTGGACCCGG ACCAGGGCAT GCTGCCCGCG
CTGGTCTGGG GTGACCTGCC CGACGAGATC CCTGACGGCA CCCCGCTGGC GGTGGCCATC
AACGGCCGGG TCGCCGTCGT GGTTCCGGTG GTTGGCCGGG ACGAGGGCGG GCGTCGGTTC
GCCGCCCTGA TCAGCGATGA CCAGCTCTTT CGGTCGGGAA CGAACCAGCT CGGTCTCTTC
CTCGTCGCCG CCGACGGCAC GCTGACCCGG CTCGCGCTCT CCTGA
 
Protein sequence
MADLGAVPTS DHTEGVDGAS DRRAGAARDP DGAPRRGWRA EGRRLLELSA LLGLAVTQPL 
LDVLGRSPDF FLFHHASRGE IVQLVVLVAI VPTVAVGTIA ATSRLAGRTA REVTHTLLVG
LLLTALAVQI GRHTTPLRGV PLLVLAAVVA VAATVAYRRW RAPARVLRVA ALGPAVFVAL
FLLVSPTSTV VLPRGDGGAV GLAGAGGHPP VVVLVLDELP LLSLLDPDGQ VDADRFPHFA
ELAAGSTWYR NATGVSGWTP YALPAMLTGR YPATGAAPHY SQHPDNLFTA FGGLYDIRAE
ESITRLCPPS RCETPVGREQ GMGVLVRESA KLLARMSAPT ESRVDPADSY REQTAAEAGI
DAAEPIPEDP KFRWDRLNDN QPARFSSFLA GLRPSDRPTL HFLHLLMPHS PWAYLPSGVR
YEAPEDFPNE GEGWVELARQ RHLAQLGYTD RLIGETLRTL RATGLYDDAL LVVTADHGVS
FTKGAQGRGM GAIEVAADEV AWVPLFVKYP GQRTGRLDDR NWQHVDLLPT LADEAAIRLP
WAVDGRSARE APRAQADKVF YDRPAQPTPI GGGVPTPVPP AAPHPLIGTT VPDQPTAGVA
RVENLAAFRE VDPDQGMLPA LVWGDLPDEI PDGTPLAVAI NGRVAVVVPV VGRDEGGRRF
AALISDDQLF RSGTNQLGLF LVAADGTLTR LALS