Gene Snas_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3902 
Symbol 
ID8885102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4169415 
End bp4170983 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content65% 
IMG OID 
Productsulfatase 
Protein accessionYP_003512650 
Protein GI291301372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTGC GCAAGAGTCT GCTGGCGGTG GCGATCGCCG TGTTCGCGAT CGTGTCCGCC 
ACCGTCGGTG CCGTCGTCGT CTTCGGCGAC GACGCGACGT CCAACAAGGA CAAACCGAAC
ATCATCTACT TCCTCGTCGA CGACATGTCG GCGGATCTGC TGCCGTACAT GGACACGGTG
AGCTCGCTGG CCGACGGCGG CACCAAGTTC GACAACTATT TCGTGTCGAA CTCGCTGTGC
TGCACCTCGC GGGCGACCAT GTTCACCGGC CAGCACCCCC ACAACTCCGG CGTGCTGGGC
AACACCCCCG AGGACCACGG CGGCTACGAG TACTTCGAAC CCCTGGAGGA CGAGACCTAC
GCCAAGCAGA TACAGGACAC AGGGGACTAC CACACCGCCT ACCTCGGCAA GTACCTCAAC
GGGTACAAGA TGAAGGAGGG CTACAAGGTC CCGGCCGGTT GGGACGAATG GCACGTCGCC
GACGGCGGCG GCTACAACGA ATACGACTAC AAGCTCAGCG AGTACACCGG CGGCGACGAC
AAACCCATCA GCGACGGCGA CGGGAAGTAC CTGGTGGACC TGATGGCCGA CCGCGCCGTC
GAGTCCATCG ACCGCTCCCG CGACGCCGAG AAGTCGTTCT TCGTACAGGT GGCACCGTTC
TCACCGCACT CCGGCGTCGG CAAGGACGGC GGCCCGCGGT TCCCACCGGC CAAACGCGAC
CGCCCCGGTG CCGACGAGAA GCACGGCGAC TGCGGCAAGG TCGACTGCGC CGACCTCGAC
GTCACAAAGC TTCCGGGATT CAACGAGGAC ACGAAGGACA AACCCGACTG GGTTCGCCAG
AAACCACTCA CCGACAAGGA GATCAAGGAA CTCAACCGCG ACTTCCGCAA CCGCGCCCGG
ATGGTGCAGT CGGTGGACGA CATGGTGGAG AAGGTGACCA AGTCGCTGTC ACAGTCCGAA
CTGGACAACA CCTACATCAT GTTTGGCTCC GACAACGGAT TCCACCTCGG ACAGCACCGG
CTCATGCGCG GCAAGACCAC CGCCTACGAC CACGACGTGC GCACCCCGTT CCTGGTGAAA
CGCCCCGGCT CCTCCGGCGG CGACTCGATC AAGAGCGACG AGATCGTCCA GAACGTCGAC
CTGTACCCGA CGCTGATCGA CATCGCCAAC GGCGACGAGG ACGGCCCGAC CGACCGCGAC
GGCCGCAGCC TGCGGCGGCT CATAGACGGC GAGAAGGAAC CCGACTGGCG AAACGCGGCA
TACGTCGAGC ACTACAAGTC CCCGAAACCG GGAACCGGCG ACCCCGACGC CGAGGACCTC
GGTCCCAAGA AGGGCAACTC GTCTCCGCCG ACCTACGACG CGATCCGCAC CGCCCAGGAC
CTGCTCGTCG ACTACAAGGG ATACGAGCAA CCGGAGTTCT ACGACCTGGA CGCCGACCCC
TACCAGCTCG ACAACAAACC GGACGACCCC CGAGCCGACG AGCTGAAGGA CCCGCTCGCC
GATCTGGCCA ACTGCGGCAA GAAGGGCCAC CCCGACTGCT GGGAGGCCGC CCACATCGGA
GCCGACTGA
 
Protein sequence
MRLRKSLLAV AIAVFAIVSA TVGAVVVFGD DATSNKDKPN IIYFLVDDMS ADLLPYMDTV 
SSLADGGTKF DNYFVSNSLC CTSRATMFTG QHPHNSGVLG NTPEDHGGYE YFEPLEDETY
AKQIQDTGDY HTAYLGKYLN GYKMKEGYKV PAGWDEWHVA DGGGYNEYDY KLSEYTGGDD
KPISDGDGKY LVDLMADRAV ESIDRSRDAE KSFFVQVAPF SPHSGVGKDG GPRFPPAKRD
RPGADEKHGD CGKVDCADLD VTKLPGFNED TKDKPDWVRQ KPLTDKEIKE LNRDFRNRAR
MVQSVDDMVE KVTKSLSQSE LDNTYIMFGS DNGFHLGQHR LMRGKTTAYD HDVRTPFLVK
RPGSSGGDSI KSDEIVQNVD LYPTLIDIAN GDEDGPTDRD GRSLRRLIDG EKEPDWRNAA
YVEHYKSPKP GTGDPDAEDL GPKKGNSSPP TYDAIRTAQD LLVDYKGYEQ PEFYDLDADP
YQLDNKPDDP RADELKDPLA DLANCGKKGH PDCWEAAHIG AD