Gene TM1040_2361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2361 
Symbol 
ID4076480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2482013 
End bp2483926 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content59% 
IMG OID638007683 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_614355 
Protein GI99082201 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.698046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.908613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCAACG CACGCAATAT CGCCTTCTGG GTCGTTCTGT TTCTACTCGT TGTGACCCTG 
TTCAATCTGT TCAGCGGGTC CGGCAGCACA CTACAGAGCC GCGAGAAGAC TTATTCGGAA
TTCGTGACCG CTGTAGAGGG CGGAAACGTC AGCACTGTGA CGCTGGACGG CGAACAGGTT
CGCTACACGA CTTCGGATGG CCAGAGCTAC ACCACGATCA AACCGGGCGA CGCAGAAGTC
ACCCAAATGC TGATCGACAA CAACATCCCC GTGCGCGCAG AAAAACAGCA GCAATCGACC
TTCCAGTCCT TCCTGCTGAC GCTTCTGCCG TTCCTGCTGC TGATCGGCGT CTGGATCTAT
TTCATGAACC GCATGCAGGG CGGGGGCAAA GGTGGCGCCA TGGGCTTTGG CAAATCCAAG
GCCAAGATGC TGACCGAGAA GCATGGTCGC GTGACGTTTG ACGACGTTGC AGGTATCGAC
GAGGCGAAAG AAGAGCTCGA AGAGATCGTG GAATTCCTGC GCAACCCGCA GAAATTCTCG
CGTCTCGGCG GCAAGATCCC CAAAGGCGCG CTGCTTGTAG GCCCTCCGGG TACTGGTAAG
ACGCTCCTTG CGCGTGCGAT CGCGGGCGAG GCGGGTGTGC CGTTCTTCAC CATCTCCGGT
TCCGATTTTG TCGAGATGTT CGTTGGTGTG GGTGCAAGCC GAGTCCGCGA TATGTTTGAA
CAGGCCAAGA AGAATGCGCC CTGTATCGTC TTTATCGACG AAATCGACGC CGTGGGTCGC
CACCGGGGTG CCGGTTATGG CGGCGGCAAT GACGAGCGCG AACAGACCCT CAACCAGCTG
TTGGTTGAAA TGGACGGCTT TGAGGCCAAC GAGGGCGTGA TCATCCTAGC GGCCACCAAC
CGCAAGGACG TGCTTGACCC GGCCTTGCTG CGTCCAGGCC GCTTTGACCG CAACGTGACC
GTCGGAAACC CTGACATCAA AGGTCGCGAG AAGATCCTCG GCGTGCATGC CCGCAAGACC
CCTCTGGGTG CGGACGTGGA CCTGCGCATC ATTGCGCGTG GCACCCCGGG CTTCTCCGGC
GCGGATCTGG CGAACCTTGT GAACGAGGCT GCTTTGATGG CTGCGCGTGT GGGCCGCCGC
TTTGTCACCA TGGAAGATTT TGAAAACGCC AAGGACAAAG TCATGATGGG GGCCGAGCGC
CGCTCCATGG TGCTGACCGC CGATCAGAAG GAAAAGACTG CCTATCACGA GGCAGGTCAC
GCTGTGGTTG GTCTGAAGCT GCCGGAATGT GATCCAGTCT ACAAGGCGAC GATCATCCCC
CGTGGCGGCG CGCTTGGCAT GGTGGTGAGC CTTCCTGAGA TGGACCGTCT GAACTGGCAC
AAGGACGAGT GCGAGCAGAA GTTGGCGATG ACCATGGCCG GTAAGGCTGC CGAGATCATC
AAATATGGCC CCGGCCATGT GTCCAATGGC CCCGCCGGCG ACATTCAGCA GGCGAGCCAA
CTGGCGCGGG CCATGGTGCT GCGCTGGGGG ATGTCCGACA AGGTCGGTAA CATCGACTAC
GCCGAAGCGC ATGAGGGCTA TTCCGGCAAC ACGGCTGGGT TCTCGGTCTC TGCCAATACC
AAAGAGCTTA TCGAAGAAGA GGTCCGCCGC TTCATCGAGG AGGCCTACCA ACGCGCTTAT
CAGATCCTTG AAGAGAACAA GGATGAATGG GAGCGTCTGG CACAAGGTCT TCTGGAATAT
GAGACGCTGA CAGGCGAAGA GATCAAGCGC GTGATGAACG GCGAGCCGCC GCAGGCGGAT
GATGACGCTG ATGATGACGC CGACACCGGT GCCACCTCCG TCACCGCGAT CCCGAAGGCA
AAGTCCAAGA AAACTCCGCC CGAAGGGGAT ATGGAACCTG AGCCTTCTTC CTAA
 
Protein sequence
MGNARNIAFW VVLFLLVVTL FNLFSGSGST LQSREKTYSE FVTAVEGGNV STVTLDGEQV 
RYTTSDGQSY TTIKPGDAEV TQMLIDNNIP VRAEKQQQST FQSFLLTLLP FLLLIGVWIY
FMNRMQGGGK GGAMGFGKSK AKMLTEKHGR VTFDDVAGID EAKEELEEIV EFLRNPQKFS
RLGGKIPKGA LLVGPPGTGK TLLARAIAGE AGVPFFTISG SDFVEMFVGV GASRVRDMFE
QAKKNAPCIV FIDEIDAVGR HRGAGYGGGN DEREQTLNQL LVEMDGFEAN EGVIILAATN
RKDVLDPALL RPGRFDRNVT VGNPDIKGRE KILGVHARKT PLGADVDLRI IARGTPGFSG
ADLANLVNEA ALMAARVGRR FVTMEDFENA KDKVMMGAER RSMVLTADQK EKTAYHEAGH
AVVGLKLPEC DPVYKATIIP RGGALGMVVS LPEMDRLNWH KDECEQKLAM TMAGKAAEII
KYGPGHVSNG PAGDIQQASQ LARAMVLRWG MSDKVGNIDY AEAHEGYSGN TAGFSVSANT
KELIEEEVRR FIEEAYQRAY QILEENKDEW ERLAQGLLEY ETLTGEEIKR VMNGEPPQAD
DDADDDADTG ATSVTAIPKA KSKKTPPEGD MEPEPSS