Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccur_10940 |
Symbol | |
ID | 8375301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptobacterium curtum DSM 15641 |
Kingdom | Bacteria |
Replicon accession | NC_013170 |
Strand | - |
Start bp | 1242492 |
End bp | 1244111 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644994016 |
Product | trypsin-like serine protease with C-terminal PDZ domain protein |
Protein accession | YP_003151467 |
Protein GI | 256827508 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000345024 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 153 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA ATGGAATTCC TCCCGTAGGA CCGACGGGCT CTTCATATAC GGGATATACC GGACAGCCGG GTGCGCCGAT TCCACCGCAG TCGACTCAAC CGATGACTTC AGCGACACAA ACGGCACAAA CAATGCAGGC AGCACAATCG GCTACAACAG CGCAGCCAAC CCAGGCGACA CAGGCAACTT CGTATGCAAC CGTGCCGCCT CAATTTACCT CCGTTCCTCC GCAACCTCCA ACCGTTCCAC CAGCGGGCAC AACAACTGCT ATACCAGCAG CTCATGTGCC CCAGACCGGT GTTTCAGGTA AGCGTGTCTT TGGTATTGCG TTTATCGGTG CACTTGCTGC GTGTGCGGTT GCTGCCGCGT GTTTTCTTGG CTATCAGTCG ATTACGGGGC ATTCGGCTTC CCAGGTGGTA CTGGGGTCTT CCAGTGACAG CACCATTACC GCAAGCGACG ACGAGACTGA TCGTGCTGAA AAGGTAGCTG ACAAGACGCT TCCATCGGTT ACAGCTATTA ATGTGTACAC CGACCAATCG GGGTGGGGCA GCATGTTGGG TCGTTCAACC GATCAGTCGA GTTTGGTTAA GACAAGCCTG GGCAGCGGCG TGATTATTTC TTCCGATGGC TACATTCTGA CAAATAACCA TGTGGTTGAA TCAGCTGATG CATTAAAGGT AACGGCCAAT GGGCAAGAAT ATGATGCCAA GGTTGTGGGT ACTGACCCAA CTACTGACCT GGCAGTCATT AAGATTGATG CAACGGGTTT GACCCCCATC GAGATCGGCA AATCGTCCGA TTTGAAGGCT GGTCAGTGGG TCATGACCGT TGGCAGCCCC TTTGGTCTTG AACAGTCAGT GGCGACGGGT ATTATTTCGG CAACAAGTCG TACCGTGGCC GTGTCGTCAA GTAGCGATGA AGGGTCGGGT AATGGCTATA ACAATAGCTA CTCTTCGGCT GCTGCACCTA CGATTTACAC GAACATGATT CAGACCGATG CAGCTATCAA CCCGGGCAAT TCCGGTGGTG CGCTTGTTGA TGCAAATGGT AAGCTCATTG GCATCAATGC GGTTATCGAA TCGTATTCAG GTAACTATTC AGGTGTTGGA TTTGCCATCC CTGTTGACTA CGCCATGGAT ATCGCGCAGC AGATTATTTC TGGTAAAACG CCAACGCATG CCATGATTGG GGTAACGCCG ATTTCAATTA CGAGCCAGCT CAATCAGCGC TACCACTTGG GTACTGATTC AGGTGCGTAT GTATCGAGCA TTGTTGAAGG TAGTGGCGCT GCGCAGGCTG GCCTTGAAGA GGGCGATATC ATCACGAAGG TTGATGATAC GGCGGTAACA GACGCAACAG GATTGATCGC GGCTGTGCGT TCGAAGAATG TCGGCGACAC AGTAACGGTT ACCTATCTGC GCGATGGTCA GCAGATGACC GCTCAAGTGA CCCTTGGATC AGACGACAAT GCCAATTCGC TGTTGCGCAA GAATAGTAGC AGCCGGGGCG GCTTATTTGG TATGAGCTAT GACAACCAGG GAAGCGGCAA CAACCAAGGC TCAACGGCAA CTGGTTCAAC AGGATCGAGT CAGTCTTCTC CTCAAAGGGC AGCTGCTTAG
|
Protein sequence | MTENGIPPVG PTGSSYTGYT GQPGAPIPPQ STQPMTSATQ TAQTMQAAQS ATTAQPTQAT QATSYATVPP QFTSVPPQPP TVPPAGTTTA IPAAHVPQTG VSGKRVFGIA FIGALAACAV AAACFLGYQS ITGHSASQVV LGSSSDSTIT ASDDETDRAE KVADKTLPSV TAINVYTDQS GWGSMLGRST DQSSLVKTSL GSGVIISSDG YILTNNHVVE SADALKVTAN GQEYDAKVVG TDPTTDLAVI KIDATGLTPI EIGKSSDLKA GQWVMTVGSP FGLEQSVATG IISATSRTVA VSSSSDEGSG NGYNNSYSSA AAPTIYTNMI QTDAAINPGN SGGALVDANG KLIGINAVIE SYSGNYSGVG FAIPVDYAMD IAQQIISGKT PTHAMIGVTP ISITSQLNQR YHLGTDSGAY VSSIVEGSGA AQAGLEEGDI ITKVDDTAVT DATGLIAAVR SKNVGDTVTV TYLRDGQQMT AQVTLGSDDN ANSLLRKNSS SRGGLFGMSY DNQGSGNNQG STATGSTGSS QSSPQRAAA
|
| |