Gene Ksed_21640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_21640 
Symbol 
ID8373668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp2243428 
End bp2244681 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content72% 
IMG OID644992410 
Producttrypsin-like serine protease with C-terminal PDZ domain 
Protein accessionYP_003149916 
Protein GI256825956 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.00345268 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0571371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC ACGGGCAGGA CGAGCCGTAC GGCCAGGGTG ACCAGTACCC GCCCTACAGC 
TCGTCGAGGC AGTACGAGGA CGCCCCGCAG CGTTCGCGTG GCTGGTGGCA GGTGCCCACC
GCCGCGGTGC TGTCGGCTGC CCTGGCCACC GCCGGTACCT GGACGCTGGC GGAGAACGGA
GTCATCGGTT CCGGGGGCAG CACCTCCCCC TCGGAGAGCC AGGTGGCGCA GGGCGAGCCC
GCCAGCGACC GCCAGGACGG CGCCGGTGAG TCCGACGGCT CCGGTGACGG CGCCGAGTCG
GGCGACGGCG CGAAGGCCGC CCCCGTGGCG ACCGCCGACG GTGTCGACTG GTCCGGGGTG
GCCGAGCAGG TCTCCCCCAG CGTGGTCGCC ATCTCGGTGG CCAGTGCCAC GGCGGGTGGG
TCGGGCTCCG GCGTGATCCT CGACGAGCAG GGCCACGTGG TCACCAACGA CCACGTCGTC
AGCGGCGCTC AGGACATCCG GGTGACCATC GGCGACAACC GGGCGTACGA CGCCACCGTC
GTGGGCACCG ACCCTGAGAC GGACCTCGCG GTCCTGAAGA TCGACCAGGC GCCCGAGGAC
CTGCAGCCGA TCACGGTCGG GGACGACAAG GAGCTCAACG TCGGCGACCC TGTGATGGCT
GTGGGTAACC CGCTGGGCCT CTCGGGCACC GTGACCACCG GCATCGTCAG CGCGCTGGAC
CGCCCGGTCC GGGCCGGTGA CGCCGAGACC CAGGTGGTGA CGAACGCGGT GCAGACCTCC
GCGGCGATCA ACCCGGGAAA CTCCGGCGGC GCGCTGGTGA ACTCCGCCGG CGAGCTGGTG
GGCATCAACT CCAGCATCGC CACCCTGGGC TCCAACGGCC AGGAGGGCGG CAACATCGGC
ATCGGGTTCG CCATCACGGC CACCCAGATG AAGAACGTCA CCAGCGAGCT CATCGAGACC
GGCAAGGCCA CCCACGCGCA GCTGGGCGTC CGGGTCACCG ACGCCACGGT GCAGGTCGAC
GGCGCCCACG TGAACGGCGC CGGCATCGCG TCGGTGGAGC CGAACACCGC CGCGGCCGAG
GCCGGGCTCG AGGAGGGCGA CGTGGTGGTG GCCATCGACG GTGAGTCGGT GGACAGCATG
TGGGCCCTCA TCGCCCAGAT GCACGAACGA GCCGTCGGCG AGACCGCCAC CGTCACGGTG
GTGCGCGACG GCGAGCGCCA GGACGTGGAG GTGACCGCCG GCGCCAAGGA ATGA
 
Protein sequence
MTQHGQDEPY GQGDQYPPYS SSRQYEDAPQ RSRGWWQVPT AAVLSAALAT AGTWTLAENG 
VIGSGGSTSP SESQVAQGEP ASDRQDGAGE SDGSGDGAES GDGAKAAPVA TADGVDWSGV
AEQVSPSVVA ISVASATAGG SGSGVILDEQ GHVVTNDHVV SGAQDIRVTI GDNRAYDATV
VGTDPETDLA VLKIDQAPED LQPITVGDDK ELNVGDPVMA VGNPLGLSGT VTTGIVSALD
RPVRAGDAET QVVTNAVQTS AAINPGNSGG ALVNSAGELV GINSSIATLG SNGQEGGNIG
IGFAITATQM KNVTSELIET GKATHAQLGV RVTDATVQVD GAHVNGAGIA SVEPNTAAAE
AGLEEGDVVV AIDGESVDSM WALIAQMHER AVGETATVTV VRDGERQDVE VTAGAKE