Gene Snas_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1004 
Symbol 
ID8882189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1064769 
End bp1065959 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003509807 
Protein GI291298529 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.774887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.176655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCACC AGGTTCGGTT CGGTCTCGGG GCTGCCGCAG CGCTTCTTGT GGTCGCGGGG 
CTTCCCGTTC CCGCCCATGC GGATGAGGTC CGTGATGACC AGTGGATGTT GAACGCCTTG
GGAATCGAAC AGGCCCATAA GGAGACCAGA GGCGCTGGCG TGACGATTGG GATTGTGGAC
TCCGGTGTGG ACGCCACACA TCCTGACCTC AAAGGGAACG TTGAGGCGGG GCAGGCGTCT
TGGGAGGGCG GCAAGGATGG CCTGAAGGAC ACCATGGGCC ACGGGACCGC CATGGCCTCG
ATACTCGTCG GACATGGTCA CGGCGACGGA GGTGAAGACG GTGTCCTGGG TATCGCACCC
GAGGCCAAGG TGAAATCAGT ATCGATCTAT CCGAGCAGCG ATCCTCGCGA TGACCCACGC
GGCTCACATG ACCGCATGGT CGAAGGTATC CGGTGGCTGG CCGACGAAGG CGTGGACGTC
ATCTCCGTTT CCCAGGGCGG GGCTGGATCC GACGCATTGG AAGAAGCCGT CAAATATGCC
GTTGAGGAGA AAGGTATCCC TCTCGTCGCC TCGGCTGGCA ATACGGCAGG GGGGCCAACG
GGCGACGTCG TGGTGCAGGC TCCGGCTGTG TACGACAATG TTTTCAGTGC CACCGGAACG
ACCAAGCAAG GCAAGTTCTG GGATGGCTCC GTAGAGGGGA CTTCCCCTGA CGACGTCACT
GTCGCAGCAC CTGCCGAGGA TGTGGTGCAC GCGTGGAACG ACCGAGGTTA TGACGACAAT
TCGGGAACCT CGGATTCCGC GGCGATCGTG GCGGGCACGA TCGCGTTGAT GAAGGCCCAG
TGGCCCGACA TGTCGCGCGA GACCATTGAA TGGCGGCTGA CCGAAACAGC TGACGAAAAA
GGCAAGGACG GGCCGGACAC GAAATATGGC TTCGGCATTG TCAATCCTGC CGAAGCGTTG
ACGGCACACG TTGACCCTCC CGATGGGGTT TCCGACGAGG AGATCAACCC GGAACCCAAC
CCGAAGGCGA GTGCCTCGCC CAGTCCCTCC AAGGATGACG GGGCGTTGAC GGCCTCCGAT
TCCGGTGCGG GGCCGGTTGT CTGGATCGTT GTCGCCGTCG TCGCCATCCT GGCCGCCGCT
GTCGTCAGCT TCATCCTCAT CCGCCGCCGC AGACAACCGC CCGCTGCCTA G
 
Protein sequence
MRHQVRFGLG AAAALLVVAG LPVPAHADEV RDDQWMLNAL GIEQAHKETR GAGVTIGIVD 
SGVDATHPDL KGNVEAGQAS WEGGKDGLKD TMGHGTAMAS ILVGHGHGDG GEDGVLGIAP
EAKVKSVSIY PSSDPRDDPR GSHDRMVEGI RWLADEGVDV ISVSQGGAGS DALEEAVKYA
VEEKGIPLVA SAGNTAGGPT GDVVVQAPAV YDNVFSATGT TKQGKFWDGS VEGTSPDDVT
VAAPAEDVVH AWNDRGYDDN SGTSDSAAIV AGTIALMKAQ WPDMSRETIE WRLTETADEK
GKDGPDTKYG FGIVNPAEAL TAHVDPPDGV SDEEINPEPN PKASASPSPS KDDGALTASD
SGAGPVVWIV VAVVAILAAA VVSFILIRRR RQPPAA