Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1953 |
Symbol | |
ID | 4076903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2056616 |
End bp | 2058115 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638007268 |
Product | peptidase S1C, Do |
Protein accession | YP_613947 |
Protein GI | 99081793 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.100212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0596559 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATCCAA TCTCATCATC CAAGGCAATA GGCCGTGCGC GCGTCCATGA GGGGGCGCAC AGCGGCTGGC GCTTGTTCTG GATCAGCGTC CTCGGGACCG CGCTGTTGGT CATGCAGTCG CTTTCGGCGG CGGCCCGTCC CGAAAGCCTC GCGCCACTGG CCGAAAAAGT CAGCCCGGCG GTGGTCAATA TCACCACGTC GACCGTGGTC GAGGGCCGCA CTGGCCCGCA GGGGATCGTG CCCGAGGGCT CCCCGTTTGA GGATTTCTTT CGCGAGTTCC AGGACCGCAA CGGCGGCGGC ACGCGGCCAC GTCGTTCCTC GGCGCTTGGC TCGGGGTTCG TGATCTCGGA AGACGGCTAC ATCGTCACCA ACAACCATGT GATCGAGGGG GCTGACGAGA TCGAAATCGA GTTCTTCCCC GGTGAAGGCC AGCCTGCTGA CCTGCTGCCA GCCACTGTTG TTGGCACCGA TCCCAACACC GATATTGCGC TGTTGAAAGT GGACGCGCCG ATGCCGCTGA GCTTTGTGAA GTTCGGTGAC AGCGACAAGG CGCGCGTGGG CGACTGGGTT GTCGCCATGG GTAACCCGCT GGGGCAGGGA TTTTCGCTCT CGGCAGGGAT CGTATCTGCA CGCAACCGGG CGCTTTCGGG CTCTTATGAC GACTACATTC AGACCGATGC GGCCATCAAC CGTGGCAACT CGGGCGGTCC ACTCTTTAAC ATGGATGGCG AAGTCGTTGG CGTGAACACC GCAATCCTGT CGCCGAACGG CGGCTCCATC GGGATCGGCT TCTCCATGGC GTCAAATGTA GTGACCAAAG TGGTGAGCCA GCTCAAGGAG TTCGGGGAAA CCCGCCGGGG TTGGCTTGGC GTGCGCATTC AGGACGTCAC CGATGATCTG GCCGAAGCGA TCGGGCTGGC GAGCGACGAA GGTGTGCTGA TCACCGATGT CCCCGAAGGT CCCGCCAAAG AGGCTGGTCT CCTGGCACGC GACGTCATCA CCAGCTTTGA CGGGTTTGAG GTGAAAGACA CCCGCGATCT CGTGCGTCGC GTGGGCGACA CCGAAGTCGG CAAGACCGTG CGTGTCGTGG TGTTCCGCGA TGGCAAAACC GAGACGATCC GCGTCACGCT GGGACGTCGC GAAGATGCGG TGGGCAACGG CACCGAAGGT GGCGGCGCTG AAGCCGTTCC GGATGAAGGC GAAAAAGAGC TCCTCGGGCT GACCGTGGGC GTTGTGCGCG ACGACATGCG CGAGGATCTG AACCTCGATG CGGGCACCAC TGGTCTGGTG ATCCTCTCCG TGGATGAGAC CTCTGGCGCG TGGGAAAAAG GTCTGCGTGC AGGCGACGTG ATCACTGAGG CAGGCCAGAA CAAGCTCTCT TCGATTGCCG ATCTCGAAGC GCAGATCGCC GCTGCCGAAG AAGGCGGCCG CAAATCGATC TTCCTGATGG TGCGCCGTGC TGGTGAGCCG CGCTTTGTCG CGCTCAATCT TGAGGACTGA
|
Protein sequence | MHPISSSKAI GRARVHEGAH SGWRLFWISV LGTALLVMQS LSAAARPESL APLAEKVSPA VVNITTSTVV EGRTGPQGIV PEGSPFEDFF REFQDRNGGG TRPRRSSALG SGFVISEDGY IVTNNHVIEG ADEIEIEFFP GEGQPADLLP ATVVGTDPNT DIALLKVDAP MPLSFVKFGD SDKARVGDWV VAMGNPLGQG FSLSAGIVSA RNRALSGSYD DYIQTDAAIN RGNSGGPLFN MDGEVVGVNT AILSPNGGSI GIGFSMASNV VTKVVSQLKE FGETRRGWLG VRIQDVTDDL AEAIGLASDE GVLITDVPEG PAKEAGLLAR DVITSFDGFE VKDTRDLVRR VGDTEVGKTV RVVVFRDGKT ETIRVTLGRR EDAVGNGTEG GGAEAVPDEG EKELLGLTVG VVRDDMREDL NLDAGTTGLV ILSVDETSGA WEKGLRAGDV ITEAGQNKLS SIADLEAQIA AAEEGGRKSI FLMVRRAGEP RFVALNLED
|
| |