Gene TM1040_2755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2755 
Symbol 
ID4077627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2903128 
End bp2904465 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content61% 
IMG OID638008080 
Productcarboxyl-terminal protease 
Protein accessionYP_614749 
Protein GI99082595 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0444358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TTGCGATGGC GGCTCTGGGC GGGACGCTTG CGGGCTTTGT GGCAACAACC 
TACGTGGCAG CGCCGTTGCT GGCACAGGAG AGCAATAACA CAACCGTCTA TGAGCAACTC
GATCTGTTTG GTGACATCTT CGAGCGCATT CGGGCGCAAT ACGTCGAAGA GGTCGACGAA
AAAGAGCTGA TCGAGGCGGC GATTGGCGGC ATGCTGCAGT CCCTGGATCC GCACTCTAGC
TATCTGTCCC CTGATGATGC GAGCAAGATG CGGGTGCAGA CCCGTGGGGA GTTCGGTGGC
CTTGGCATCG AAGTCACGCA GGAAGACGGC TTTGTCAAAG TGGTGTCGCC GATGGATGGC
ACACCCGCAG ACGAAGCCGG AGTCGAGGCG GGCGATTTTA TCACCCATGT GGATGGCGAG
AGCCTGTTGG GGCTTGGACT TGATGAAGCG GTCGAACTGA TGCGCGGCCC GGTGGGGTCT
GAAATCATCA TCACCATCGT GCGCGAAGGC GAAGACGAGC CTTTTGATGT TTCCATTATT
CGCGACACCA TCAAGCTGAC GGCTGTACGC GGTCGGACCG AAGGGGACTC GGTGGTGCTG
CGGGTCACGA CGTTCAATGA ACAGACCACG CCCAACCTCG AGGCCAAGCT CGAAGAGCAA
GTCGAGGCGC TGGGCGGCAT GGACAACGTG AACGGCATCG TGCTTGACCT GCGCAACAAC
CCGGGCGGGT TGCTGACCCA GGCGATCAGC GTAGCCGACA GCTTCCTTGA AAGCGGTGAA
ATCGTCTCTA CCCGTGGTCG CAATCCCGAG GATGGCGAGC GGTTCAATGC AACGCCGGGG
GACCTGGTTG GTGGCAAGCC GATTGTGGTG CTGATCAACG GTGGCTCCGC CTCCGCGTCC
GAGATTGTCG CCGGCGCGCT GCAAGACCAC CGCCGCGCGA TCGTTGTGGG CACCAAATCC
TTTGGCAAAG GGTCGGTTCA GACCGTGATG CCCCTCCGCA GCGACGGAGC CATGCGCCTC
ACCACCGCGC GGTATTACAC GCCCTCTGGC CGCTCCATTC AGGCACTGGG TGTCAGCCCG
GACATCGTTG TAGCGCAGCC GCGTCGCCGC CCCGAGGCTG AGGAAGATGA ACCCGCCAGC
AGCGCCTTTG GTCCCCGTTC AGAGGCGGAC CTGCGTGGTC GTCTGAACAA TGACAGCCTC
TCCGAGGATG AGGTGCGCCA GATCGAAGCG GATCGCGAGA AGGCTGAAAA AGCCGCAGAA
CTGCGTGAAC AGGATTATCA GCTGGCCTAC GCCATCGATA TTCTGAAAGG CCTCTCGGCT
CTGGGTCCTA AAGACTAA
 
Protein sequence
MKKFAMAALG GTLAGFVATT YVAAPLLAQE SNNTTVYEQL DLFGDIFERI RAQYVEEVDE 
KELIEAAIGG MLQSLDPHSS YLSPDDASKM RVQTRGEFGG LGIEVTQEDG FVKVVSPMDG
TPADEAGVEA GDFITHVDGE SLLGLGLDEA VELMRGPVGS EIIITIVREG EDEPFDVSII
RDTIKLTAVR GRTEGDSVVL RVTTFNEQTT PNLEAKLEEQ VEALGGMDNV NGIVLDLRNN
PGGLLTQAIS VADSFLESGE IVSTRGRNPE DGERFNATPG DLVGGKPIVV LINGGSASAS
EIVAGALQDH RRAIVVGTKS FGKGSVQTVM PLRSDGAMRL TTARYYTPSG RSIQALGVSP
DIVVAQPRRR PEAEEDEPAS SAFGPRSEAD LRGRLNNDSL SEDEVRQIEA DREKAEKAAE
LREQDYQLAY AIDILKGLSA LGPKD