Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1004 |
Symbol | |
ID | 8882189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 1064769 |
End bp | 1065959 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | peptidase S8 and S53 subtilisin kexin sedolisin |
Protein accession | YP_003509807 |
Protein GI | 291298529 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.774887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.176655 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCACC AGGTTCGGTT CGGTCTCGGG GCTGCCGCAG CGCTTCTTGT GGTCGCGGGG CTTCCCGTTC CCGCCCATGC GGATGAGGTC CGTGATGACC AGTGGATGTT GAACGCCTTG GGAATCGAAC AGGCCCATAA GGAGACCAGA GGCGCTGGCG TGACGATTGG GATTGTGGAC TCCGGTGTGG ACGCCACACA TCCTGACCTC AAAGGGAACG TTGAGGCGGG GCAGGCGTCT TGGGAGGGCG GCAAGGATGG CCTGAAGGAC ACCATGGGCC ACGGGACCGC CATGGCCTCG ATACTCGTCG GACATGGTCA CGGCGACGGA GGTGAAGACG GTGTCCTGGG TATCGCACCC GAGGCCAAGG TGAAATCAGT ATCGATCTAT CCGAGCAGCG ATCCTCGCGA TGACCCACGC GGCTCACATG ACCGCATGGT CGAAGGTATC CGGTGGCTGG CCGACGAAGG CGTGGACGTC ATCTCCGTTT CCCAGGGCGG GGCTGGATCC GACGCATTGG AAGAAGCCGT CAAATATGCC GTTGAGGAGA AAGGTATCCC TCTCGTCGCC TCGGCTGGCA ATACGGCAGG GGGGCCAACG GGCGACGTCG TGGTGCAGGC TCCGGCTGTG TACGACAATG TTTTCAGTGC CACCGGAACG ACCAAGCAAG GCAAGTTCTG GGATGGCTCC GTAGAGGGGA CTTCCCCTGA CGACGTCACT GTCGCAGCAC CTGCCGAGGA TGTGGTGCAC GCGTGGAACG ACCGAGGTTA TGACGACAAT TCGGGAACCT CGGATTCCGC GGCGATCGTG GCGGGCACGA TCGCGTTGAT GAAGGCCCAG TGGCCCGACA TGTCGCGCGA GACCATTGAA TGGCGGCTGA CCGAAACAGC TGACGAAAAA GGCAAGGACG GGCCGGACAC GAAATATGGC TTCGGCATTG TCAATCCTGC CGAAGCGTTG ACGGCACACG TTGACCCTCC CGATGGGGTT TCCGACGAGG AGATCAACCC GGAACCCAAC CCGAAGGCGA GTGCCTCGCC CAGTCCCTCC AAGGATGACG GGGCGTTGAC GGCCTCCGAT TCCGGTGCGG GGCCGGTTGT CTGGATCGTT GTCGCCGTCG TCGCCATCCT GGCCGCCGCT GTCGTCAGCT TCATCCTCAT CCGCCGCCGC AGACAACCGC CCGCTGCCTA G
|
Protein sequence | MRHQVRFGLG AAAALLVVAG LPVPAHADEV RDDQWMLNAL GIEQAHKETR GAGVTIGIVD SGVDATHPDL KGNVEAGQAS WEGGKDGLKD TMGHGTAMAS ILVGHGHGDG GEDGVLGIAP EAKVKSVSIY PSSDPRDDPR GSHDRMVEGI RWLADEGVDV ISVSQGGAGS DALEEAVKYA VEEKGIPLVA SAGNTAGGPT GDVVVQAPAV YDNVFSATGT TKQGKFWDGS VEGTSPDDVT VAAPAEDVVH AWNDRGYDDN SGTSDSAAIV AGTIALMKAQ WPDMSRETIE WRLTETADEK GKDGPDTKYG FGIVNPAEAL TAHVDPPDGV SDEEINPEPN PKASASPSPS KDDGALTASD SGAGPVVWIV VAVVAILAAA VVSFILIRRR RQPPAA
|
| |