Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_3170 |
Symbol | |
ID | 3935641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 3209135 |
End bp | 3210682 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637905541 |
Product | peptidase S1C, Do |
Protein accession | YP_511112 |
Protein GI | 89055661 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.963921 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGACCC AAACAAGACC CCCACGTCCC CCCCAGTCCA TGTCGCAGTC CCAGCCCAGA TTGCGGGCCC GTGCGATCGC GATGCCGCAT CAGCGCCGCC TGGACACGCG CCTTGTTCTT GGCGCTCTCA TGGCCGCCCT CACGGCGCTG GCCCTGGCGA TCAGCTCATT GCCCGCGCGT GCGCAATCGG GTGCCCTGCC GGGGTTTGCC GATCTGGTTG ATCAGGTGGG CGACGCGGTG GTCAACATCA CGACGTCGTC GGTCGTTGCT GGCCGTGGCG GTGGCCCCGC GCCGGTCGTG CCGGAAGGCT CTCCGCTGGA AGATTTCTTC AACGAGTTCC TCGGCCCGGA TGGCGGTGAC AACGGTCCCT CACCCCGGCG GAGCCAGGCC CTGGGGTCCG GTTTTGTGAT CTCGGAGGAT GGCTATCTGG TCACCAACAA CCACGTCATT GAGGGCGCGG ATGAGATCAT GATCGAGTTC CGCAACGGTG TGGAACTGGT GGCCGAACTG ATCGGCACAG ACCCCAATAC CGACATCGCG CTGCTGCGTG TGCAAAGCGA TGAGCCGCTG CCATATGTCC CCTTCGGCGT CGCGTCGGAC CACCGCGTCG GCGACTGGGT TATGGTCATG GGCAACCCGC TGGGGCAGGG CTTCTCGGTC TCCGTCGGTG TGGTGTCCGC ATTCGGCCGC TCCCTGTCCG GCACCTATGA CGACTTCATT CAGACCGATG CGGCGATCAA TCAGGGTAAT TCCGGCGGCC CGCTGTTCAA TCTTGAAGGG GAGGTGATCG GGGTGAACAC CGCGATCCTG TCCCCCACGG GCGGCTCCAT CGGTATCGGC TTCGCAATGT CCTCGGATGT CGTGATCAAC GTGGTCGATC AGCTGCGTGA GTTCGGAGAG ACTCGGCGCG GCTGGCTTGG CGTTCGCATC CAAGACGTGA CGACGGAGAT GGCCGACGCG CTGGGGCTTG ATGAGGCCCG TGGCGCGATG GTCACCGACG TGCCCGAGGG CCCCGCGCTT GAGGGTGGGA TCGAGGTTGG TGATGTGATC CTGACGTTCG ACGGAGCCGA TGTGGAAGAC ACGCGCGGCC TCGTGCGCGT GGTGGGTGAC AGCACTGTGG GTGAGACCGT CCGCGTGGTG GTCTTCCGCG ATGGCGCGAC GGAGACGTTG CGCATTACCC TCGGTCGCCG GGAGACGGCG GAATCGGCCG CCGCCCCTGA GACACCCGAA GGCGCAGAAG CCGCTCCCTC CAGTATTGTG CTTGGCATGA CGCTGACCCC CCTGACCGAT GAGTTGCGGG GCGAGCTGAA TGCGCGCGGT GTCACCAATG GTCTGGTGAT CACCGAGATC GACCCGGTCT CCGCTGCTGC TGAAATGGGC CTCCAGGTGG GTGACATCAT CACCGAAGTG ACCCAGATGC CTGTGACAAC AATCGCTGAT TTTCAACTGC GGATCGACGC GGCGGATGAG GCGGGGCAGG AAACGATCCT GCTTCTGATC CGCCGCGACG GCAACCCACG CTTCATGGCC TTGGGCATCA AGGAGTGA
|
Protein sequence | MTTQTRPPRP PQSMSQSQPR LRARAIAMPH QRRLDTRLVL GALMAALTAL ALAISSLPAR AQSGALPGFA DLVDQVGDAV VNITTSSVVA GRGGGPAPVV PEGSPLEDFF NEFLGPDGGD NGPSPRRSQA LGSGFVISED GYLVTNNHVI EGADEIMIEF RNGVELVAEL IGTDPNTDIA LLRVQSDEPL PYVPFGVASD HRVGDWVMVM GNPLGQGFSV SVGVVSAFGR SLSGTYDDFI QTDAAINQGN SGGPLFNLEG EVIGVNTAIL SPTGGSIGIG FAMSSDVVIN VVDQLREFGE TRRGWLGVRI QDVTTEMADA LGLDEARGAM VTDVPEGPAL EGGIEVGDVI LTFDGADVED TRGLVRVVGD STVGETVRVV VFRDGATETL RITLGRRETA ESAAAPETPE GAEAAPSSIV LGMTLTPLTD ELRGELNARG VTNGLVITEI DPVSAAAEMG LQVGDIITEV TQMPVTTIAD FQLRIDAADE AGQETILLLI RRDGNPRFMA LGIKE
|
| |