Gene TM1040_0287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0287 
Symbol 
ID4077422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp292758 
End bp294155 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content63% 
IMG OID638005581 
Productpeptidase S1C, Do 
Protein accessionYP_612282 
Protein GI99080128 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCA GTCTCGCCCT TCCCCTGACG ATTGCCCTCG CAACTGCTCT GCCGCTTGCG 
TCACCAGCAG AGACCAAGAT CCCGCAGAGC CAGACCGAGA TCTCGCTGGG CTTTGCCCCC
TTGGTCAAAG AGGCCGCGCC CGCTGTGGTG AATATCTATG CCAAGATCAT CCGTCAGGAC
CGTGCGCGCA GCCCCTTTGC CGATGATCCT TTCTTTGATG ATTTCTTTCG CCAGTTTGCC
CAGCCGCGCC CAAGGGTGCA GAACTCGCTC GGCTCTGGTG TAATCCTCTC CGAAGATGGC
ATCGTGGTCT CCAACTATCA CGTCGTGGGC GAGGCGTCGG ATATTCGCGT GGTGACCAAC
GACCGCCGTG AATATCAGGC CGAGGTGATC CTTGCGGATC AGGCGTCCGA TCTGGCGATC
CTGCAACTGC AAGATGCCGA AGGGCTGCCG CATCTGGGAT TGCGCAACAG TGATGAGGTC
GAGGTGGGCG AGCTGACCCT GGCCATCGGC AACCCGTTCG GGGTCGGTCA GACGGTGTCC
TCCGGCATTA TTTCCGGGCT TGCGCGCACC GGTACCGGCG GCGGGCAGGG CTTTGGCTAT
TACATCCAGA CCGATGCGCC GATCAATCCT GGCAACTCTG GTGGGGCGCT GATTGATGTG
AATGGCGATC TCATCGGGAT CAACACGCGG ATCCTCAGCC GTTCGGGTGG CTCCAACGGG
ATTGGCTTTG CCATTCCGGC CAATCTGGTG CGTGAATTTG TGCGACAGGC GCGGGCCGGT
GCAGAGGAGT TTCAACGCCC CTGGGCGGGG ATGACTGGCC AGCCGGTGGA TTCCGATCTG
GCAGAAGCGC TGGGCCTTGG TCAGGTGGAT GGGATGTTGA TTTCAGAGCT CCACCCCCAG
AGCCCCTTTG TCGAGGCGGG ATTTGAGGTT GGCGATGTGG TGCTGGCCGT GGATGGCGAG
CCGGTGAACT CGCCCTCGGA GATGGCCTTT CGCTTGTCGG TGGCGGAACT GGGCGGCACC
AGCGCGGTGA CGCGGGTGCG TCAGGGCAAG ACGGACACTG TTGAGGTCGC CTTGATCGAA
GCGCCTGACA CCCCCGCCGC CGATCCGATC ACGCTGAGCG AGCGCACCCC GATGCCGGGT
CTGGTGGTGG GACGGGTGAA CCCGCAGGTC ATCACCAAGA TGCAGCTACC GCTATCGACC
GAGGGTGTTG TGGTGATGGA CCCCGGCCCC TATGCCGGAC GCGGCGGCGT GCGCGCGGGG
GATTTGATTT TTGCGATCAA CGGCGAGGCG GTTGAGGCCC CCGAAGATGT AGCAAATCTC
CTGATGAGCA GTGACCGCTG GATGCGGATG GACCTGATGC GTCAGGGCCA GCGCGTGTCT
CTGCGGTTCC GGCTCTGA
 
Protein sequence
MIRSLALPLT IALATALPLA SPAETKIPQS QTEISLGFAP LVKEAAPAVV NIYAKIIRQD 
RARSPFADDP FFDDFFRQFA QPRPRVQNSL GSGVILSEDG IVVSNYHVVG EASDIRVVTN
DRREYQAEVI LADQASDLAI LQLQDAEGLP HLGLRNSDEV EVGELTLAIG NPFGVGQTVS
SGIISGLART GTGGGQGFGY YIQTDAPINP GNSGGALIDV NGDLIGINTR ILSRSGGSNG
IGFAIPANLV REFVRQARAG AEEFQRPWAG MTGQPVDSDL AEALGLGQVD GMLISELHPQ
SPFVEAGFEV GDVVLAVDGE PVNSPSEMAF RLSVAELGGT SAVTRVRQGK TDTVEVALIE
APDTPAADPI TLSERTPMPG LVVGRVNPQV ITKMQLPLST EGVVVMDPGP YAGRGGVRAG
DLIFAINGEA VEAPEDVANL LMSSDRWMRM DLMRQGQRVS LRFRL