Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ksed_16680 |
Symbol | |
ID | 8373174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Kytococcus sedentarius DSM 20547 |
Kingdom | Bacteria |
Replicon accession | NC_013169 |
Strand | + |
Start bp | 1721689 |
End bp | 1724040 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644991936 |
Product | transglutaminase-like enzyme, predicted cysteine protease |
Protein accession | YP_003149448 |
Protein GI | 256825488 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.738671 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0217109 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCC CGACCCCGCA GGCCGCCCCG CGCCCTCCGC GGGCCCAGTC CCCCGGCGGT GGTCTGCTCC GCCGCCTCTG GGACCGCACC GAGCCCCTGT CGGCCGCCCT GGTGGCCGGC CTGTGCCTGC TGACCATCTG GCCGGCACGC CAGCTCTTCG AGGACGCCGG CTGGTGGGGT TCGGCCATCG TGCTCACCGT GGGTCTCGTG GCCGTCGCCG CCCTGGTGCG TCTGGTGACC GGGTCGCCCG TCCTGGCCTC GGTGGTCCAG TTCGTCACCG CGCTGGTCCT GCTGCTGCGG TCCTCCCTGG CCGACACGCT GTGGGCCGGC ATCGTCCCGA CACCCCGGAC CGCCAGCGCC ATCAGCGAGC ACGTCCGGGT GACCGGTCAG CTGCTCGCCG AGAGCGCGGC GCCGGTTCCT GCCCACCCAT CCCTCACGGT CACCCTCCTG AGCGTCATCG CGCTGCTCGT GCTGTTCACC GATGCGGCGG TCCACACCCT GCGCTCGGTG CTGCTGGGCG CCATCGCCCC ACTGCTGGTC TTCGTGATTC TGGCTGCGAA CCGCACCACC CACGAGCCGT GGTGGTGGTT CCTGCTGCTC GCCGCGGCCT GGGCCGGCCT GCTCGCACTC CACCACTCCG CCGAGACAGC GCCGGCCTCC GGGCAGGGGC GTGGCATCCT GGGTGCCCCC GGGCGGGGGG CGATGCTCAC CACGGCGGCC GCGCTGACCA CCCTGGGCGT CGTCGCGGCG CTGGCGATTC CCTCGGTGCT GCCTGAGCGC GAGCAACGTC TGGTCGGCCG CGGCCTGGCC ACGGACAGCT CGCTGGCCAC GGTCGACTTC ACCGAGACCC TCGACCTCGA GGCCGACCTG CGCAGTGACG ACGAGCGCCC CGTGCTCCTG TGGCACACCG AGAGCGATTC GCCCGGCCCG CTGCGCATCA CGGCCACCAA TCGCTTCGCC AACGACCGGT GGTCGCCGCA GGAGGGCCGC GCCGCCGCGG AGGTGTTGCC GGACCCCACG GCCGTGGACG GTGCGGTGCC CGACCGCCTG CCGCGGGTGG AGTGGTCCGG CGAGCTGGAT TCCACCGAGG AGGACTTCGC GGTGACCGCC AACGGGATCC CGACCCCCTT CGTGGCGACT CCCTCCTTCC CGGTGGACCT GAGTTCCCCC GTCGCGGTGA CGGGTGACCC GATCACGGGT GCGGTGTGGG TCGGCGAGGA CGCCAACCGC TATGAGGGCA CGGCGTTGGA GCCCACGGTG CCGGAGGAGC TGCCGGACCC CGCCGAGCCC CAGGGGCTGT CGGAGACGTA CACCGAGGTA CCCGAGGGGC TGCAGGACAC GATCTCGCAG CTGAACGCCG AGGTGCTCGA CCCCGACGAC GCGCCCCTGG ACAAGGCCCG GACCATGCAG TCCTTCCTGC GCAACGGGGA CTTCGAGTAC TCCCTCGACG CCGCCGAGCC GCAGGACGGC GAGTCGATGG TGCAGGCCTT CCTGCGCGAG AAGCGCGGGT ACTGCACGCA GTACGCCACC ACGATGATCA TGATGGCCCG CGAACAGGGC ATCCCCGCCC GCATGGCGAT CGGCATGCTG CCCGGCGAGC AGACCGCCAG CGACCTGGGC CGCGGGTCCG ACGTCGGCCC CGAGCGCGTG GTGCAGCGCA ACGACGCCCA CGCCTGGCCG GAGCTGTACT TCGAGGGCGT GGGGTGGCTG CGCTTCGAGC CGACCCCCTC CAGCCGGGCC GCTGCGGTGC CGGCCTACAG CCAGCCGGTG GGCGCCGACG CGAGCGCGTC GCCCTCCCCC TCCGAGGCGT CCCCGTCCCC GGCCTCGCCC TCCCCTTCCG AGGCGTCCCC CTCGCCGTCC CAGGCCTCGC CCTCCCCGTC GCCGTCGTCC GCCGAGGACG GCGACGACGA GGGTGGTTCC ACGGGCTGGT GGCGGGTGCT GCTGACGTGG TTGGCGGTCC TGCTGGTCGC CGCCCTGGCC CTGGCCTACC TGCCCTGGCG GGCGCGCCAG GCGCGCAAGC GGATCCGCGA GGGCGAGCAG TCGCCCTGGT CGGGCGCGTG GGAGGTGCTG CGCCTGGACC TGTTGGACCG GGGAGTGAGC ACCCTGCCCA CGGACTCCGT GCGCACCCAG GCCAGTGCGG TGCTGCGACA GCGTCCCGAC GTGGACGTCG ACACGCTCCA GGAGCTGGCC CACCGAGCCG AGGCCGCCCG ATACGCCCGA CCGTCCACGG ACGCCGGGGA CGCCGCCGCG GCCGACACGC TGCGGAAACA GCTGCTGACG TGGCTGGACC ACGACGAGTC CGCCGTCGAC CGGACCCGCC GGCGGCTGTT CCCGGCCTCC GCCACCCGCT GA
|
Protein sequence | MSTPTPQAAP RPPRAQSPGG GLLRRLWDRT EPLSAALVAG LCLLTIWPAR QLFEDAGWWG SAIVLTVGLV AVAALVRLVT GSPVLASVVQ FVTALVLLLR SSLADTLWAG IVPTPRTASA ISEHVRVTGQ LLAESAAPVP AHPSLTVTLL SVIALLVLFT DAAVHTLRSV LLGAIAPLLV FVILAANRTT HEPWWWFLLL AAAWAGLLAL HHSAETAPAS GQGRGILGAP GRGAMLTTAA ALTTLGVVAA LAIPSVLPER EQRLVGRGLA TDSSLATVDF TETLDLEADL RSDDERPVLL WHTESDSPGP LRITATNRFA NDRWSPQEGR AAAEVLPDPT AVDGAVPDRL PRVEWSGELD STEEDFAVTA NGIPTPFVAT PSFPVDLSSP VAVTGDPITG AVWVGEDANR YEGTALEPTV PEELPDPAEP QGLSETYTEV PEGLQDTISQ LNAEVLDPDD APLDKARTMQ SFLRNGDFEY SLDAAEPQDG ESMVQAFLRE KRGYCTQYAT TMIMMAREQG IPARMAIGML PGEQTASDLG RGSDVGPERV VQRNDAHAWP ELYFEGVGWL RFEPTPSSRA AAVPAYSQPV GADASASPSP SEASPSPASP SPSEASPSPS QASPSPSPSS AEDGDDEGGS TGWWRVLLTW LAVLLVAALA LAYLPWRARQ ARKRIREGEQ SPWSGAWEVL RLDLLDRGVS TLPTDSVRTQ ASAVLRQRPD VDVDTLQELA HRAEAARYAR PSTDAGDAAA ADTLRKQLLT WLDHDESAVD RTRRRLFPAS ATR
|
| |