Gene Snas_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0223 
Symbol 
ID8881401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp238770 
End bp240560 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content68% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003509035 
Protein GI291297757 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.221724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCC CACCCGCCCG GCGCCTCCTG GTGGCCGTCG GCACGGTCAC GGCCATCACG 
GCCACCACCG TGCTCGCCGG CGCCCCGGCC GCGCACGCGG CCGAAGCCGA GATCGTCAAC
AGCGACGCAT CCGGCGTCAT CGAGGACCGC TACATCGCGG TCCTGCGCGA CAGCGCGGTC
AAAGCCACCC AGAAGTCGGT CGACGCCAAG GCCGAAACGC TCGCCGACAA GTACCACGGT
GAGATCGACC GCACCTACCA CGCCACCATC CGCGGTTTCG CCACCACCAT GGACGAATCC
GACGCGAAGC GGCTGGCGGC CGACCCGGTC GTCGACTACG TCGAGACCGT CCGCGAGGTC
AAACTGGCCG AGGACCAGAC CAACCCGCCC AGCTGGGGTC TGGACCGCGT CGACCAGAAC
AACCTGCCGC TGGACAAGAA GTTCAGCTAC CCGACCCAGG CCGGAGACGG CGTCACGGTC
TACGTCCTCG ACACCGGGGT TCGACTGAGC CACCAGACTT TCGGCGGCCG CGCGAAGTCG
GGCTACGACT TCATCGACAA CGACGCCAAC GCCTCCGACT GCCACGGACA CGGCACCCAC
GTCGCCGGAA CGACCGCCGG AAGCCAGTAT GGTCTCGCCA AGAAGGCCGA CGTCGTGTCG
GTACGGGTCC TCGACTGCCA GGGCTTCGGC GACAACGCCC TCATCGCCGA CGGCATCGAC
TGGGTCACCG AGCACGCCGT CAAACCGGCG GTCGCCAACA TGAGCCTCGG TGACACCCAG
CCCAGCCCGG TCATGGAGGA CGCCGTGCAG CGTTCCATCG ACGCCGGGAT CCAGTACTCG
CTGGCGGCCG GGAACAACAG CGGCGACGCC TGCAGTTTCT CCCCGGCGCG GCTGCCCGCG
GCCGTGACCG TCGGATCAAC CGCTGAATCC GACGGCCGCA GCTCGTTCAG CAACTACGGC
CGCTGCCTCG ACCTGTTCGC GCCCGGCTCG AACATCGTGT CCTCGGCCAA CTCCGGCGAC
TCCGGCCAGG CCACCATGAG CGGCACCTCG ATGGCCGCGC CACACGTGGC CGGGGCGATC
GCGCTCTACC TGGGACAGAA CCCGAACGCG ACCCCGCAAC AGGTTCGCGA CGCCATCGTC
ACCAACGGCA CCGCCGGAAA GGTCACCAAC CCGGGCTCGG GTTCGCCGAA TGTCCTGCTG
TACAGCGGTT TCATCAACGA ACCACCCGCC GAGCACGACT TCAGCATCGC CGCCGACCCC
GGCTCGGCGA CCGTCGAACC CGGACAGTCG GCCAAGACCA CGGTGTCCAC GAAGGTCACC
AAGGGACAGG CCCAGCAGCT GAAACTCTCC GCCACCGGAC TGCCGTCCGG CGCCCAGGCC
AGCTTCGACC CGGCCGCCAT CGCCTCCGGC GAGAGCTCAC AACTGTCGAT CGCGACCTCG
TCCGGCACCC CCAAGGGCTC GTACCCGGTG ACCATCACCG CCGAAGGCAC CGAAGCCACC
CGCACCGCGA GCTTCACGCT CCAGGTAGGA GCCGACGGCG GCAACAAACA ACCGATCGCC
GACTTCACCT CGAACTGCTT CGCCGGTATC GGCTACTGCT TCTTCGACGG CAACGGCTCC
TCCGACCCGG ACGGCTCCGT CGCCAGCTAC AAGTGGAACT TCGGTGACGG CACCACCGGC
ACCGGAGCCG CACCCTTCCA CCGCTACTCA CCCGGAACCT ACAAGGTGAC CCTCACGGTC
ACCGACAACA AGGGAGCGAC CGGATCGGTC ACCAAGACCG TCACAGTCTG A
 
Protein sequence
MKLPPARRLL VAVGTVTAIT ATTVLAGAPA AHAAEAEIVN SDASGVIEDR YIAVLRDSAV 
KATQKSVDAK AETLADKYHG EIDRTYHATI RGFATTMDES DAKRLAADPV VDYVETVREV
KLAEDQTNPP SWGLDRVDQN NLPLDKKFSY PTQAGDGVTV YVLDTGVRLS HQTFGGRAKS
GYDFIDNDAN ASDCHGHGTH VAGTTAGSQY GLAKKADVVS VRVLDCQGFG DNALIADGID
WVTEHAVKPA VANMSLGDTQ PSPVMEDAVQ RSIDAGIQYS LAAGNNSGDA CSFSPARLPA
AVTVGSTAES DGRSSFSNYG RCLDLFAPGS NIVSSANSGD SGQATMSGTS MAAPHVAGAI
ALYLGQNPNA TPQQVRDAIV TNGTAGKVTN PGSGSPNVLL YSGFINEPPA EHDFSIAADP
GSATVEPGQS AKTTVSTKVT KGQAQQLKLS ATGLPSGAQA SFDPAAIASG ESSQLSIATS
SGTPKGSYPV TITAEGTEAT RTASFTLQVG ADGGNKQPIA DFTSNCFAGI GYCFFDGNGS
SDPDGSVASY KWNFGDGTTG TGAAPFHRYS PGTYKVTLTV TDNKGATGSV TKTVTV