Gene Snas_3261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3261 
Symbol 
ID8884460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3444181 
End bp3446538 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content69% 
IMG OID 
Productsulfatase 
Protein accessionYP_003512024 
Protein GI291300746 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0125079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGTCA TGACCGCACC CGACCGCACC CGGCTTCCGA TCGACGTCCG TTCCGCCACC 
CGTGACGCGC AGTCCGCGAC CGGGGCCGCG ACGGTCAGTC CCCCGGAGCC GGTGGTGCCG
GTGCGGCCGC CGTCGGGGGC GCCCAACGTC GTGGTCGTGT TGGTCGACGA CATGGGGTTC
GGCGCGTCCA GTCCGTACGG CGGCCCGTGC CGGATGCCCA CGGCGCAGCG CCTGGCCGAC
GAGGGGCTGC GGTACAGCCG GTTCCATGTG ACGGCGTTGT GTTCACCGAC GCGGCAGGCG
TTGTTGACCG GCCGCAACCA CCACTCGGTC GGCATGGGTG TCACCTCGGA GATGACGACG
CCGCAGCCGG GTTACACCGG TTACCGTCCG CCCAGTGCCG CGACGATGGC GCAGATACTG
GGCGGCAACG GTTACAGCAC GGCGGCGTTC GGGAAGTGGC ATCAGACGCC TCCCGCGGAG
GTGAGCCCGT CGGGTCCGTT CACTCGCTGG CCCACCGGTG AGGGTTTCGA CAAGTTCTAC
GGGTTCATGG CCGCCGAGAT GAACCACTGG TATCCGCAGC TGTATGAGGG CACGACGCCG
GTGGAGCCGG ATCGGTTGCC AGAGGACGGT TACCACCTGT CGGAGGATCT CGTCGACCAC
GCGATCGACT GGGTGGGTAC GCAGCAGGCG ATGACGCCGG ACAAACCGTT CTTGACGTAT
CTGGCGTTTG GCGCGACGCA CGCCCCGTTC CATGTCGCGC CACAGTGGTC ACAGGAGTAC
GCGGGCCGGT TCGACGTCGG CTGGGACGCG ATCCGCACGC AGACCCTGGC GCGGCAGGAG
GAACTGGGTA TCGTGCCCAC CGGCACCGAG CTGGGGCCGT GGGCGGAGGG TGTGCCGCAC
TGGGACGAGC TGGACGAGGC GGAGCGGCGG GTCGCGGCGC GGTTCATGGA GGTGTACGCG
GGTTTCGCCG AGCACACCGA CGCGCAGGTC GGCAGGTTCG TCAACGCGTT GGAGGCCATG
GGGGTCCTGG ACGACACGTT GTTCGTGTAC ATGCTGGGCG ACAACGGTGC CTCGGGTGAG
GGCGGGATCG AGGGGACGTT TCGGGAGCAT CTGGTGGGGC ACGGCCTGGC CGACGACACG
GCCTATATGG ACGCCCGGCT GGATCAGCTG GGCACCGCGT CCACGTATCC GTTGTACCCG
GTGGGGTGGG CGTTGGCGAT GAACACGCCG TACCAGTGGA CGAAACAGGT CGCCTCGCAC
TACGGCGGCA CCCGTGACGG TCTGATCGTG CGGTGGGGCA ACGGTATCGC CGCGCGGGGG
CAGGTGCGGC ACCAGTGGCA TCACGTGATC GACGTGCTGC CGACCATTCT GGACGCCGCC
GGTGTGCCGC ATCCCGATAC GGTCAACGGG TTCGCGCAGC AGCCGATCGA GGGCGTCAGC
ATGCGTTACA GCTTCGACGA CGCAGCCGCG CCGGATCGGC GCACGACCCA GTACTTCGAG
ATGGTCGGCA ATCGCGGGAT CTACCACGAG GGTTGGACCG CCGTGACCCG TCACGGCACG
CCGTGGCTGA TGGTCCCCGA TGGACAGCGT TCCTTCGACG ACGACGTGTG GGAGTTGTAC
GACACCGCGA CCGACTGGAG TCAGGCGCAC GATCTGGCCG CGAAGTATCC CGACAAGCTG
CGGCAGTTGC GGGAGTTGTT CACCGTCGAG GCCACGCGCT ATCACGTGTT CCCGTTGGAC
GATCGGGTGA CCGAGCGCGA GAACCCGGCG CTGGCGGGCC GGTTGGATCT GCACCACGGG
CGGACGTCGA TCGTGTTGGG GCCGACGGCC AAGCGGCTCG GTGAGGACGC GGCCCCGGAC
GTGAAGAACC GTTCGCACGT CATCACCGTC GATCTGGAGG CCGATGTGGA TGCGGGCGGG
GTGCTGGTAG CGCAGGGCGG CCGGTTCGGC GGCTGGTCTT TGTACTGTGT GGACGGGACG
GTCGCGTACG CGTACAACCG TTACGGCCTG GATCGCACCG TGGTGCGATC CAGGCGTCGG
ATGTCCCCCG GTTCGCACAC CGTGGCGATG CGGTTCGACT ACGACGGCGG GGCGCCCGGC
AGCGGCGCCG GGATCACCGT GGAGGTCGAC GGCACCGCGG TGGCGAAGGG CCGTGTGGAG
GCCACGACGG CGTACTACTT CAGTTTCGAC GAGTCGCTCA ACGTGGGCGT CGACCGGGGC
ACACCGGTCA GCGAGGACTA TCCGCCGGTG CGCAACGCCT TCACCGGCCG CATCGAGCGG
GTGCGTTTCG ACCTGGGCGC GGACGCCTCG CCGCAGTCGG AGGCGGAGCG GTTGCGGGCG
CGGCTGGCTC ACCAGTGA
 
Protein sequence
MCVMTAPDRT RLPIDVRSAT RDAQSATGAA TVSPPEPVVP VRPPSGAPNV VVVLVDDMGF 
GASSPYGGPC RMPTAQRLAD EGLRYSRFHV TALCSPTRQA LLTGRNHHSV GMGVTSEMTT
PQPGYTGYRP PSAATMAQIL GGNGYSTAAF GKWHQTPPAE VSPSGPFTRW PTGEGFDKFY
GFMAAEMNHW YPQLYEGTTP VEPDRLPEDG YHLSEDLVDH AIDWVGTQQA MTPDKPFLTY
LAFGATHAPF HVAPQWSQEY AGRFDVGWDA IRTQTLARQE ELGIVPTGTE LGPWAEGVPH
WDELDEAERR VAARFMEVYA GFAEHTDAQV GRFVNALEAM GVLDDTLFVY MLGDNGASGE
GGIEGTFREH LVGHGLADDT AYMDARLDQL GTASTYPLYP VGWALAMNTP YQWTKQVASH
YGGTRDGLIV RWGNGIAARG QVRHQWHHVI DVLPTILDAA GVPHPDTVNG FAQQPIEGVS
MRYSFDDAAA PDRRTTQYFE MVGNRGIYHE GWTAVTRHGT PWLMVPDGQR SFDDDVWELY
DTATDWSQAH DLAAKYPDKL RQLRELFTVE ATRYHVFPLD DRVTERENPA LAGRLDLHHG
RTSIVLGPTA KRLGEDAAPD VKNRSHVITV DLEADVDAGG VLVAQGGRFG GWSLYCVDGT
VAYAYNRYGL DRTVVRSRRR MSPGSHTVAM RFDYDGGAPG SGAGITVEVD GTAVAKGRVE
ATTAYYFSFD ESLNVGVDRG TPVSEDYPPV RNAFTGRIER VRFDLGADAS PQSEAERLRA
RLAHQ