Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ksed_16430 |
Symbol | |
ID | 8373149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Kytococcus sedentarius DSM 20547 |
Kingdom | Bacteria |
Replicon accession | NC_013169 |
Strand | - |
Start bp | 1691810 |
End bp | 1693990 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644991911 |
Product | protease II |
Protein accession | YP_003149423 |
Protein GI | 256825463 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1770] Protease II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.33763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0433895 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCCA GCATCCCCTC CCAGCCCCCG GTCGCACCCC GCCGCCCCCT CGTCCGCGAG CACCACGGCC AGGTCTTCAC CGACCCGTGG GAGTGGTTGC GCGACACCGA GGACCCCCAG GTGGTGGCCC ACCTGGAGGC GGAAAACGCC TGGACCGACG AGCAGCTCGC CGACCAGGCG CCCCTGCGGG AGACCCTCTA CGGTGAGATC GCCGGCCGCA CCCAGCAGAC CGACGTCTCG GTGCCCGCCC GCCACGGTCA GTGGTGGTAC TACACGCGCA CCGTGGAGGG CCAGCAGTAC CCGATCCACG CCCGGGTGGC GGCCACGGGC GAGTACGCCG CGGACCGCCC CCGGCTCGAT GCGGGAAACG TCCCAGCGGG CGAACAGGTC CTGCTCGACG GCAACGCAGA GGCCGGGGAC TCCACCTTCT TCTCCCTGGG CGGTTTCACC GTCAGCCGCG ACGACGCCCG GCTGGCCTTC GCGACCGACA CCACCGGCGA CGAACGGTTC GACGTCACCG TGGTGGACCT GGCGACCGGC GAGCGCATCG ACGAGTCGAT CACCGGGGTC GGCTACGGCC TGTGTTTCAG CCACGACGCC AGCCAGTTGT TCTACGTGCG CGTCGACGAG TCCTGGCGCC CCCACGAGGT GTGGCGCCAC ACCATCGGCG CCGACCCGGA CACCGACGAG CTGCTGCACA CCGAGACCGA CGAGGCCTGG TGGATGGGGC TGGACTCCTC GGGCGACGAG CGCTGGGTGC TCATCGGCAA GGCGTCCTCG GACTCCAGCG AGTGGTGGCT GGTGGACGCC ACCGACCCCA CCAGCCAGCC GCGGGTTGTG CAGCCGCGCC GCGACCGCTT GGAGTACGAC GTCGACGTGG CCGGTGAGGA CCTCCTGCTG GTGCACACCG TGAACACCCC CGAGGGGGAG CTCGCCACCG CCACGGTCGA CGCGCCGGGG CTCGAGCACT GGCGGCAGGT CACCCCACCG GCCGACGGCG AGCGCCTGCT CGGCTGCGAG CTGTTCGAGC ACTTCGCGCT GGTGACCCTG CGCCGCGACG GGCTCACGGG CCTGCGCGTG CTGCCCCGCG GGGTGGACGC CGGCGGGGCC CCCCCGACCG ACGACGCTGG CGGCCTGACC CTGGGCGCCG CGGTGGACGT GGCCCCCGAC GAGGCGCTCT ACAGCATCGG CGCCGGCGCG AACCCGGAGT TCACCGCCAC CACGCTGCAG GTGGTCACCG AGTCCCTGAT CACCCCGCCG ACCACCAGCG AGTACGACCT GGCGCCGCTC CTGGCCGGTG GCCCGCTGCC CGCACCCACG GTGCTGCGCC AGCAGCCCGT CCTCGGCGGG TACGACCCCG GCGCCTACAC CCAGGAGCGG CTCTGGGCCA CCGCGGCCGA CGACACCCGC ATCCCCATCT CGGTGGTGCG CCGCAGCGAC CTCGAGCCCG ACGGCACCGC ACCCGGGCTG CTCACCGGCT ACGGGGCCTA CGAGGTCTCC AGCGACCCGG ACTTCCGCAT CAGCCGACTG TCCCTGCTCG ACCGGGGCGT CGTCTTCGCC ATCGCCCACG TGCGCGGCGG TGGCGAGATG GGGCGCGCCT GGTACGAGAG CGGTCGCATG GAGCACAAGG CCAACAGCTT CACCGACCTC AACGCGTGCG CGCACGCCCT CATCGATGCC GGGTGGGTCG ATGCGGGGCG CCTCGCCGTC GAGGGTGGGT CGGCCGGGGG TCTGCTGCTC GGTGCGGCGA TCAACCTGGA GCCGGACCTG TACCGGGCGG CCCACCTGGC CGTGCCCTTT GTGGACGCAC TGACCACCAT CCTGAACCCG GAGCTGCCGC TGACGGTGGG GGAGTGGACC GAGTGGGGCA ACCCGCTGGC GGACCCGCAG GTCTACGAGG GCATGGCCGC CTACACGCCG TACGAGAACG TGCGTGCGGA GCAGTACCCG GCGATGCTGG CGACCACCAG CCTCAACGAC ACCCGTGTGG GCTGCGGGGA GCCGACCAAG TGGGTGCAGA TGGTGCGCTC CCGGGCGACG AACGACCCCA TCGAGCGGCC CGTGCTGCTG CGCACCGAGA TGGTGGCGGG CCACGGCGGG CGCTCCGGGC GCTACGACGC CTGGCGCGAC CGGGCGCTCG AGCTGGCCTT CCTGCTCACC CACATCGGGG TCGATCGGTG A
|
Protein sequence | MSASIPSQPP VAPRRPLVRE HHGQVFTDPW EWLRDTEDPQ VVAHLEAENA WTDEQLADQA PLRETLYGEI AGRTQQTDVS VPARHGQWWY YTRTVEGQQY PIHARVAATG EYAADRPRLD AGNVPAGEQV LLDGNAEAGD STFFSLGGFT VSRDDARLAF ATDTTGDERF DVTVVDLATG ERIDESITGV GYGLCFSHDA SQLFYVRVDE SWRPHEVWRH TIGADPDTDE LLHTETDEAW WMGLDSSGDE RWVLIGKASS DSSEWWLVDA TDPTSQPRVV QPRRDRLEYD VDVAGEDLLL VHTVNTPEGE LATATVDAPG LEHWRQVTPP ADGERLLGCE LFEHFALVTL RRDGLTGLRV LPRGVDAGGA PPTDDAGGLT LGAAVDVAPD EALYSIGAGA NPEFTATTLQ VVTESLITPP TTSEYDLAPL LAGGPLPAPT VLRQQPVLGG YDPGAYTQER LWATAADDTR IPISVVRRSD LEPDGTAPGL LTGYGAYEVS SDPDFRISRL SLLDRGVVFA IAHVRGGGEM GRAWYESGRM EHKANSFTDL NACAHALIDA GWVDAGRLAV EGGSAGGLLL GAAINLEPDL YRAAHLAVPF VDALTTILNP ELPLTVGEWT EWGNPLADPQ VYEGMAAYTP YENVRAEQYP AMLATTSLND TRVGCGEPTK WVQMVRSRAT NDPIERPVLL RTEMVAGHGG RSGRYDAWRD RALELAFLLT HIGVDR
|
| |