Gene Snas_1580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1580 
Symbol 
ID8882769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1656122 
End bp1657501 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content69% 
IMG OID 
Productsulfatase 
Protein accessionYP_003510376 
Protein GI291299098 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.37197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCTC GGCCGAACGT CATCGTGGTC TTCACCGACC AGCAGCGCTG GGACACCACG 
GGGGCGCAGG GAAATCCGCT GGGACTGACG CCGAACTTCG ACCGAATGGC CGACACCGGC
ACCCACGCGC GGCTGGCGGT GACCCCCAAT CCGGTGTGCG GCCCGGCGCG GGCGGCGTTG
CAGACCGGCC GGTATCCGAC GGCCAACGGG TGCTACCGCA ACGCGATTCC GCTGCCGGAG
TCGGAGCGCA CGCTGGCGCA CCACTTCGCC GACGCCGGGT ACGACACCGG TTACGTGGGC
AAATGGCACC TGGCCGACGC CGATCCGGTG CCGGAGTCGC AGCGCGGCGG CTACGGGGAA
TGGCTGGCGG CCAACACGCT CGAGTTCACC TCCGACGCGT ATCGGACCAT TGTGTACGGG
CAGGACGGCG AGCCGGTGCT GCTGCCGGGA TATCGCTCCG ACGCGTGTTT CGACGCCGCG
ATCCGGTTCG TCACCGACCA CCACGATCGT CCGTTCTACC TTTTCTTGTC GCTGTTGGAG
CCGCACCACC AAAACGAGGT GGACGACTAT CCGGCGCCCG ACGGGTACGA GCAGCGGTAC
CAGGGGCGTT GGATGCCACC GGATCTAGCG GCGCTGTCGG CGAACGGCGG CACCGCGCAC
CGGCACATGG GCGGGTATCT GGGTCAGATC GCCCGTGTCG ACGAGGGTCT GGGGCGGTTG
CACGACGCGT TGCGCAGTCT GGGGCTGGCC GAGGACACGA TCGTCGCGTA CACCTCGGAT
CACGGTTGTC ACTTCAAGAC CCGCAACTCC GAGTACAAGC GGTCGGCGCA CGACGCCTCG
ATCCGGGTGC CGCTGGCGAT CTCGGGGCCG GGTTTTACCG GCGGGAGCCG CATCGACCGG
CCGGTCAGCA CTGTGGACCT GCCGCCGACG CTGCTGGACG CCGCCGGGAT CGCGGTGCCG
GAGGCGATGC AGGGGACGTC GTTCCTGCCG CTGGTGCGCG ATCCGGGGGC CGAGTTCCCC
GACGAGGCGT TCATCCAGGT CAGCGAGGCC CAGTGCGGGC GCGCGATCCG GACGAAGCGG
TGGCTGTACT ACGTGTCGGA TCCCGACGCG GATGGCTGGG ACGACGCGGC CAGTTCCCGT
TACGTCGAGA CCGAGTTGTA CGACCTGGAG CACGATCCGC ACCAGTTGAA CAACCTGGCC
GGGTATCCGT CGCACCGTGG GGTGTGTGAT GAGCTGCGGT CGCGGTTGTT GGCGCGGCTG
GCAGCCGCCG GTGAGGATGC GGCCGAGATC GTGGCGGCTC CCGAGCCAGT CGGGGCGCCG
GTGCGGCACG TGGATCCGGT GGCGCGTTCG CTGTCGGTGC CGCCGATTCG GTTCAGCTGA
 
Protein sequence
MSARPNVIVV FTDQQRWDTT GAQGNPLGLT PNFDRMADTG THARLAVTPN PVCGPARAAL 
QTGRYPTANG CYRNAIPLPE SERTLAHHFA DAGYDTGYVG KWHLADADPV PESQRGGYGE
WLAANTLEFT SDAYRTIVYG QDGEPVLLPG YRSDACFDAA IRFVTDHHDR PFYLFLSLLE
PHHQNEVDDY PAPDGYEQRY QGRWMPPDLA ALSANGGTAH RHMGGYLGQI ARVDEGLGRL
HDALRSLGLA EDTIVAYTSD HGCHFKTRNS EYKRSAHDAS IRVPLAISGP GFTGGSRIDR
PVSTVDLPPT LLDAAGIAVP EAMQGTSFLP LVRDPGAEFP DEAFIQVSEA QCGRAIRTKR
WLYYVSDPDA DGWDDAASSR YVETELYDLE HDPHQLNNLA GYPSHRGVCD ELRSRLLARL
AAAGEDAAEI VAAPEPVGAP VRHVDPVARS LSVPPIRFS