Gene Ksed_21910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_21910 
Symbol 
ID8373695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp2266358 
End bp2268118 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content70% 
IMG OID644992435 
Productsubtilisin-like serine protease 
Protein accessionYP_003149941 
Protein GI256825981 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.164253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACACA CACGCGCCTC CTTCGGCTTC CTCGCCGCCG CCGTCGTCGG CCTCGGCACC 
ATGACCCCCG CCTCAGCCGG CCTCACCACC GAGCCGCACC TACAGCAGCC GCACCCCTCC
ACCGAGATCT CGCAGCAGGC CGAGGTCCTG TCCGTCGCGG ACGCCAACTA CATCGTGATG
CTCGAGCTGC CGTCGGCCGC CAAGCGCGGC CCGAACGCCA TGGCCAGCGC CCAGGGCAAG
GCCGCCGTGG CCGCCGCCAC CCAGAAGCAG GCCGACAAGT GGAGCGCGAA GGGCGTCAAG
GTCAAGCAGC GTTACGAGGC CCTCGGCGGC TTCAGCGCCC ACCTCACTCC CGCCCAGGTG
GAGGCGCTGC GCAACGACCC GGCCGTCTCC ACGGTGACCG AGAACAAGAT GGTCTCCATC
GACGCCACCC AGTACAGCGC CCCCTGGGGT CTGGACCGTG TCGACCAGGA CGACCTGCCG
CTGAACGGCA CCTACAACTA CACCGAGACC GGCCAGGGCG TCACCTCCTA CGTGCTGGAC
ACCGGCATCC TCGCCAACCA CTCCGACCTC GGCGGTCGGG TGCAGGCCGG CGTGACCGCC
ATCGACGACG GCCGCGGGTC GGCCGACTGC AACGGCCACG GAACCCACGT CGCCGGCACC
GTCGGCGGCA CCAGGTACGG CGTCGCCAAG GGCACCACCC TGGTCCCGGT CCGCGTGCTC
GGGTGCAACG GCAGCGGCTC GACGAACGGG ATCATTTCCG CCATGGACTG GGTGGCCCAG
AACAAGTCCG GCCCCTCGGT GGCCAACATG AGCCTCGGCG GCGGCGCGGA CGCGGCCACC
GACCAGGGCA TCGCCCGCAT GACCTCCGCC GGTGTCATCA CCGTCGTGGC GGCCGGCAAC
GACACCGACA ACGCGTGCAA CTACTCGCCC GCCCGCGCCT CCTCGGCCAT CACCGTCGGC
TCCACCGACA AGACCGACGG CCTGTCCTAT TTCTCCAACT ACGGCTCCTG CGTAGACATC
CTCGCGCCGG GCTCGGACAT CACGTCCGCC TGGTACACCA GCAGCAGCGC CACGAACACG
ATCTCGGGCA CCTCGATGGC GTCCCCGCAC GTGGCCGGCG CCGCGGCGCT CTACCTGCAG
AAGAACCCCA ACGCCAGCGT CTCGCAGGTG ACCAACGCCC TGACCTCCAC GGCCACCACC
AACACCATCA CCGGCGTCAA CGGGTCGCCC AACCGCTTCC TGGACACCAC GGCCCTGATG
GGTGGCGCCA CGCCCACCGA CCCCACTGAC CCCACCGACC CGACCGACCC CACGCCGGGC
ACCAGCCTGG TGAACGGTGA CTTCGAGCAG GGCAGCACCG GCTGGGGTGG CGCCACCTCG
GCCATCACCT CCGGCCGGTA CTCGGCCTAC AGCGGCAACT ACAAGGCGCT GCTGGGCGGC
AAGGGGTACA GCAACACCTC CATCCTGACC CAGCGGTTCA AGGTGCCCTC CAACGCGACC
TCCCTGCGCT TCGCGCTGAA CGTGCAGTCG GGTGAGTCGA CCTACAGGGC CTACGACCGC
TTCCAGGTGC AGGCGGTCGA CTCCAGCGGC AGCACCTCGG TGCTGGGCGA GTGGTCCAAC
CGCGACCAGT CGAGCACCTA CTCGCTGAAG ACGCTGGACA TCTCGCGCTA CGCGGGGCAG
ACGATCACCC TGCGGTTCGC CGCTCAGGAG GACGTCTCGG TGCAGACCTC GTTCAACGTG
GACGCCGTCA CGGTGCGGTG A
 
Protein sequence
MSHTRASFGF LAAAVVGLGT MTPASAGLTT EPHLQQPHPS TEISQQAEVL SVADANYIVM 
LELPSAAKRG PNAMASAQGK AAVAAATQKQ ADKWSAKGVK VKQRYEALGG FSAHLTPAQV
EALRNDPAVS TVTENKMVSI DATQYSAPWG LDRVDQDDLP LNGTYNYTET GQGVTSYVLD
TGILANHSDL GGRVQAGVTA IDDGRGSADC NGHGTHVAGT VGGTRYGVAK GTTLVPVRVL
GCNGSGSTNG IISAMDWVAQ NKSGPSVANM SLGGGADAAT DQGIARMTSA GVITVVAAGN
DTDNACNYSP ARASSAITVG STDKTDGLSY FSNYGSCVDI LAPGSDITSA WYTSSSATNT
ISGTSMASPH VAGAAALYLQ KNPNASVSQV TNALTSTATT NTITGVNGSP NRFLDTTALM
GGATPTDPTD PTDPTDPTPG TSLVNGDFEQ GSTGWGGATS AITSGRYSAY SGNYKALLGG
KGYSNTSILT QRFKVPSNAT SLRFALNVQS GESTYRAYDR FQVQAVDSSG STSVLGEWSN
RDQSSTYSLK TLDISRYAGQ TITLRFAAQE DVSVQTSFNV DAVTVR