Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0287 |
Symbol | |
ID | 4077422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 292758 |
End bp | 294155 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638005581 |
Product | peptidase S1C, Do |
Protein accession | YP_612282 |
Protein GI | 99080128 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCA GTCTCGCCCT TCCCCTGACG ATTGCCCTCG CAACTGCTCT GCCGCTTGCG TCACCAGCAG AGACCAAGAT CCCGCAGAGC CAGACCGAGA TCTCGCTGGG CTTTGCCCCC TTGGTCAAAG AGGCCGCGCC CGCTGTGGTG AATATCTATG CCAAGATCAT CCGTCAGGAC CGTGCGCGCA GCCCCTTTGC CGATGATCCT TTCTTTGATG ATTTCTTTCG CCAGTTTGCC CAGCCGCGCC CAAGGGTGCA GAACTCGCTC GGCTCTGGTG TAATCCTCTC CGAAGATGGC ATCGTGGTCT CCAACTATCA CGTCGTGGGC GAGGCGTCGG ATATTCGCGT GGTGACCAAC GACCGCCGTG AATATCAGGC CGAGGTGATC CTTGCGGATC AGGCGTCCGA TCTGGCGATC CTGCAACTGC AAGATGCCGA AGGGCTGCCG CATCTGGGAT TGCGCAACAG TGATGAGGTC GAGGTGGGCG AGCTGACCCT GGCCATCGGC AACCCGTTCG GGGTCGGTCA GACGGTGTCC TCCGGCATTA TTTCCGGGCT TGCGCGCACC GGTACCGGCG GCGGGCAGGG CTTTGGCTAT TACATCCAGA CCGATGCGCC GATCAATCCT GGCAACTCTG GTGGGGCGCT GATTGATGTG AATGGCGATC TCATCGGGAT CAACACGCGG ATCCTCAGCC GTTCGGGTGG CTCCAACGGG ATTGGCTTTG CCATTCCGGC CAATCTGGTG CGTGAATTTG TGCGACAGGC GCGGGCCGGT GCAGAGGAGT TTCAACGCCC CTGGGCGGGG ATGACTGGCC AGCCGGTGGA TTCCGATCTG GCAGAAGCGC TGGGCCTTGG TCAGGTGGAT GGGATGTTGA TTTCAGAGCT CCACCCCCAG AGCCCCTTTG TCGAGGCGGG ATTTGAGGTT GGCGATGTGG TGCTGGCCGT GGATGGCGAG CCGGTGAACT CGCCCTCGGA GATGGCCTTT CGCTTGTCGG TGGCGGAACT GGGCGGCACC AGCGCGGTGA CGCGGGTGCG TCAGGGCAAG ACGGACACTG TTGAGGTCGC CTTGATCGAA GCGCCTGACA CCCCCGCCGC CGATCCGATC ACGCTGAGCG AGCGCACCCC GATGCCGGGT CTGGTGGTGG GACGGGTGAA CCCGCAGGTC ATCACCAAGA TGCAGCTACC GCTATCGACC GAGGGTGTTG TGGTGATGGA CCCCGGCCCC TATGCCGGAC GCGGCGGCGT GCGCGCGGGG GATTTGATTT TTGCGATCAA CGGCGAGGCG GTTGAGGCCC CCGAAGATGT AGCAAATCTC CTGATGAGCA GTGACCGCTG GATGCGGATG GACCTGATGC GTCAGGGCCA GCGCGTGTCT CTGCGGTTCC GGCTCTGA
|
Protein sequence | MIRSLALPLT IALATALPLA SPAETKIPQS QTEISLGFAP LVKEAAPAVV NIYAKIIRQD RARSPFADDP FFDDFFRQFA QPRPRVQNSL GSGVILSEDG IVVSNYHVVG EASDIRVVTN DRREYQAEVI LADQASDLAI LQLQDAEGLP HLGLRNSDEV EVGELTLAIG NPFGVGQTVS SGIISGLART GTGGGQGFGY YIQTDAPINP GNSGGALIDV NGDLIGINTR ILSRSGGSNG IGFAIPANLV REFVRQARAG AEEFQRPWAG MTGQPVDSDL AEALGLGQVD GMLISELHPQ SPFVEAGFEV GDVVLAVDGE PVNSPSEMAF RLSVAELGGT SAVTRVRQGK TDTVEVALIE APDTPAADPI TLSERTPMPG LVVGRVNPQV ITKMQLPLST EGVVVMDPGP YAGRGGVRAG DLIFAINGEA VEAPEDVANL LMSSDRWMRM DLMRQGQRVS LRFRL
|
| |