Gene TM1040_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1953 
Symbol 
ID4076903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2056616 
End bp2058115 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content63% 
IMG OID638007268 
Productpeptidase S1C, Do 
Protein accessionYP_613947 
Protein GI99081793 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.100212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0596559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCCAA TCTCATCATC CAAGGCAATA GGCCGTGCGC GCGTCCATGA GGGGGCGCAC 
AGCGGCTGGC GCTTGTTCTG GATCAGCGTC CTCGGGACCG CGCTGTTGGT CATGCAGTCG
CTTTCGGCGG CGGCCCGTCC CGAAAGCCTC GCGCCACTGG CCGAAAAAGT CAGCCCGGCG
GTGGTCAATA TCACCACGTC GACCGTGGTC GAGGGCCGCA CTGGCCCGCA GGGGATCGTG
CCCGAGGGCT CCCCGTTTGA GGATTTCTTT CGCGAGTTCC AGGACCGCAA CGGCGGCGGC
ACGCGGCCAC GTCGTTCCTC GGCGCTTGGC TCGGGGTTCG TGATCTCGGA AGACGGCTAC
ATCGTCACCA ACAACCATGT GATCGAGGGG GCTGACGAGA TCGAAATCGA GTTCTTCCCC
GGTGAAGGCC AGCCTGCTGA CCTGCTGCCA GCCACTGTTG TTGGCACCGA TCCCAACACC
GATATTGCGC TGTTGAAAGT GGACGCGCCG ATGCCGCTGA GCTTTGTGAA GTTCGGTGAC
AGCGACAAGG CGCGCGTGGG CGACTGGGTT GTCGCCATGG GTAACCCGCT GGGGCAGGGA
TTTTCGCTCT CGGCAGGGAT CGTATCTGCA CGCAACCGGG CGCTTTCGGG CTCTTATGAC
GACTACATTC AGACCGATGC GGCCATCAAC CGTGGCAACT CGGGCGGTCC ACTCTTTAAC
ATGGATGGCG AAGTCGTTGG CGTGAACACC GCAATCCTGT CGCCGAACGG CGGCTCCATC
GGGATCGGCT TCTCCATGGC GTCAAATGTA GTGACCAAAG TGGTGAGCCA GCTCAAGGAG
TTCGGGGAAA CCCGCCGGGG TTGGCTTGGC GTGCGCATTC AGGACGTCAC CGATGATCTG
GCCGAAGCGA TCGGGCTGGC GAGCGACGAA GGTGTGCTGA TCACCGATGT CCCCGAAGGT
CCCGCCAAAG AGGCTGGTCT CCTGGCACGC GACGTCATCA CCAGCTTTGA CGGGTTTGAG
GTGAAAGACA CCCGCGATCT CGTGCGTCGC GTGGGCGACA CCGAAGTCGG CAAGACCGTG
CGTGTCGTGG TGTTCCGCGA TGGCAAAACC GAGACGATCC GCGTCACGCT GGGACGTCGC
GAAGATGCGG TGGGCAACGG CACCGAAGGT GGCGGCGCTG AAGCCGTTCC GGATGAAGGC
GAAAAAGAGC TCCTCGGGCT GACCGTGGGC GTTGTGCGCG ACGACATGCG CGAGGATCTG
AACCTCGATG CGGGCACCAC TGGTCTGGTG ATCCTCTCCG TGGATGAGAC CTCTGGCGCG
TGGGAAAAAG GTCTGCGTGC AGGCGACGTG ATCACTGAGG CAGGCCAGAA CAAGCTCTCT
TCGATTGCCG ATCTCGAAGC GCAGATCGCC GCTGCCGAAG AAGGCGGCCG CAAATCGATC
TTCCTGATGG TGCGCCGTGC TGGTGAGCCG CGCTTTGTCG CGCTCAATCT TGAGGACTGA
 
Protein sequence
MHPISSSKAI GRARVHEGAH SGWRLFWISV LGTALLVMQS LSAAARPESL APLAEKVSPA 
VVNITTSTVV EGRTGPQGIV PEGSPFEDFF REFQDRNGGG TRPRRSSALG SGFVISEDGY
IVTNNHVIEG ADEIEIEFFP GEGQPADLLP ATVVGTDPNT DIALLKVDAP MPLSFVKFGD
SDKARVGDWV VAMGNPLGQG FSLSAGIVSA RNRALSGSYD DYIQTDAAIN RGNSGGPLFN
MDGEVVGVNT AILSPNGGSI GIGFSMASNV VTKVVSQLKE FGETRRGWLG VRIQDVTDDL
AEAIGLASDE GVLITDVPEG PAKEAGLLAR DVITSFDGFE VKDTRDLVRR VGDTEVGKTV
RVVVFRDGKT ETIRVTLGRR EDAVGNGTEG GGAEAVPDEG EKELLGLTVG VVRDDMREDL
NLDAGTTGLV ILSVDETSGA WEKGLRAGDV ITEAGQNKLS SIADLEAQIA AAEEGGRKSI
FLMVRRAGEP RFVALNLED