Gene TM1040_2092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2092 
Symbol 
ID4077843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2195880 
End bp2198258 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content61% 
IMG OID638007411 
ProductATP-dependent Clp protease ATP-binding subunit clpA 
Protein accessionYP_614086 
Protein GI99081932 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGACG ATCCCCATAA CCCAAGACAG AGGAGCACGC CCGTGCCTTC ATTCTCGAGC 
ACACTGGAAC AAGCCATTCA CGCGGCGCTG GCGCTGGCGA ATGAACGGCG TCACGAATTT
GCAACCCTAG AGCACCTGCT TTTGGCCCTC ACCGAGGAGC CGGATGCCGC ACGCGTCATG
CGTGCCTGCA GCGTGAACCT CGACGAGCTG CGATCGACCC TGCTGGAATT TGTCGACGAG
GATCTGGCCA ATCTGGTCAC CGACATCGAC GGTTCCGAAG CCGTGCCCAC CGCAGCCTTC
CAGCGCGTGA TTCAACGCGC AGCCATCCAT GTTCAGTCCT CCGGCCGCAC CGAAGTGACC
GGCGCGAACG TGCTGGTGGC AATCTTTGCC GAGCGCGAGA GCGACGCGGC CTTCTTCCTG
CAGGACCAGG ACATGACACG CTATGACGCG GTGAACTTCA TCGCGCACGG CGTTGCCAAA
GATCCCGCCT ACGGCGAGAA CCGCCCCGTG ACAGGCGCCA GCGAATCCGA AGAAGAAATC
GGCGGCGGTC CGTCCATGGG CAGCCCAGAG GGCGAACAAA AAGAGTCGGC TCTGGCGAAA
TACTGCGTCG ACCTCAACGC CAAATCCCGC GAAGGCGACA TCGATCCCCT GATCGGGCGC
GACAGCGAAG TCGAGCGCTG CATCCAGGTG CTCTGCCGCC GCCGCAAGAA CAACCCGCTT
TTGGTGGGGG ATCCCGGCGT TGGTAAAACC GCCATCGCTG AAGGTCTCGC CCGCAAGGTC
GTGCAGGGCG AAGTCCCCGA GGTTCTGTCG GAAACCACCA TCTACTCGCT CGACATGGGC
GCGCTTTTGG CCGGCACACG CTATCGCGGT GACTTCGAAG AGCGCCTTAA GGCCGTGGTG
ACCGAGCTTG AAGAACATCC CGATGCGGTG CTGTTCATCG ACGAGATCCA CACCGTGATC
GGTGCAGGTG CCACCTCGGG TGGGGCAATG GATGCGTCAA ACCTGCTGAA GCCTGCCCTG
CAGGGCGGCA AGCTGCGCAC CATGGGATCC ACCACCTACA AGGAGTTCCG TCAACATTTT
GAGAAGGACC GCGCGCTGTC GCGTCGCTTC CAGAAGATCG ACGTAAACGA GCCTTCGGTC
GAAGACAGCA TCGCGATCCT CAAAGGGCTG AAACCCTACT TTGAGGACCA TCACTCGATC
AAATTCACAT CGGATGCGAT CAAATCCGCG GTGGAACTTT CGGCGCGCTA CATCAACGAC
CGCAAGCTGC CGGACAAGGC CATCGACGTA ATCGACGAAG CCGGCGCCGC GCAGCACCTT
GTGGCCGAGA GCAAGCGCCG CAAGACCATC GGCGTCAAGG AGATTGAGGC CGTGGTGGCC
AAGATCGCCC GCATCCCGCC GAAAAACGTC TCCAAGGACG ATGCCGAGGT GCTGAAGGAC
CTCGAGGCGA GCCTGAAGCG CGTGGTCTTT GGTCAGGACG CAGCCATCGA TGCGCTGTCT
TCAGCGATCA AACTGGCCCG TGCCGGGCTG CGCGAGCCGG AAAAACCCAT CGGGAACTAT
CTCTTTGCGG GCCCCACCGG TGTCGGCAAA ACCGAGGTGG CCAAGCAGCT CGCAGATACG
CTTGGGGTGG AACTCCTGCG CTTTGACATG TCGGAGTACA TGGAGAAACA CGCGGTCTCC
CGCCTGATCG GCGCGCCTCC GGGCTATGTC GGCTTTGACC AGGGTGGTCT TCTGACAGAT
GGCGTCGACC AGCATCCCCA CTGCGTGCTG CTTCTCGACG AGATCGAGAA AGCGCACCCG
GATGTGTTCA ATATCCTCCT GCAGGTAATG GACAATGGCC AGCTCACGGA TCACAACGGC
CGCACGGTCA ATTTCCGCAA CGTGGTTCTG ATCATGACCT CCAACGCGGG CGCCTCGGAA
CTGGCGAAAT CCGCCATCGG CTTTGGCCGC GACCGGCGCG AGGGTGAAGA CACAGCTGCC
ATCGAGCGCA CCTTCAGCCC CGAATTCCGC AACCGTCTGG ATGCCACGAT TTCCTTTGGG
CCGCTGCCCA AGGAGGTCAT CCTGCAGGTG GTCGAGAAGT TCGTTCTGCA GCTTGAGGCG
CAGCTCATGG ACCGCAACGT CTCGATTGAG CTTACCCGCA AGGCAGCTGA ATGGCTCGCG
GACAAAGGCT ATGATGACAA GATGGGTGCG CGCCCCTTGG GCCGCGTCAT CCAGGAGCAC
ATCAAGAAGC CGCTTGCAGA AGAGCTCTTG TTCGGCAAGC TCTCCAAGGG CGGTGTGGTT
CAGGTCGGCA TCAAGGACGG CAAACTTGAT CTTCGGATCG AGGGTCCAGG CAAGCCCCGC
CTGAGCGGCA ACAAACCACC GCTCCTGACT GCGGAATAG
 
Protein sequence
MADDPHNPRQ RSTPVPSFSS TLEQAIHAAL ALANERRHEF ATLEHLLLAL TEEPDAARVM 
RACSVNLDEL RSTLLEFVDE DLANLVTDID GSEAVPTAAF QRVIQRAAIH VQSSGRTEVT
GANVLVAIFA ERESDAAFFL QDQDMTRYDA VNFIAHGVAK DPAYGENRPV TGASESEEEI
GGGPSMGSPE GEQKESALAK YCVDLNAKSR EGDIDPLIGR DSEVERCIQV LCRRRKNNPL
LVGDPGVGKT AIAEGLARKV VQGEVPEVLS ETTIYSLDMG ALLAGTRYRG DFEERLKAVV
TELEEHPDAV LFIDEIHTVI GAGATSGGAM DASNLLKPAL QGGKLRTMGS TTYKEFRQHF
EKDRALSRRF QKIDVNEPSV EDSIAILKGL KPYFEDHHSI KFTSDAIKSA VELSARYIND
RKLPDKAIDV IDEAGAAQHL VAESKRRKTI GVKEIEAVVA KIARIPPKNV SKDDAEVLKD
LEASLKRVVF GQDAAIDALS SAIKLARAGL REPEKPIGNY LFAGPTGVGK TEVAKQLADT
LGVELLRFDM SEYMEKHAVS RLIGAPPGYV GFDQGGLLTD GVDQHPHCVL LLDEIEKAHP
DVFNILLQVM DNGQLTDHNG RTVNFRNVVL IMTSNAGASE LAKSAIGFGR DRREGEDTAA
IERTFSPEFR NRLDATISFG PLPKEVILQV VEKFVLQLEA QLMDRNVSIE LTRKAAEWLA
DKGYDDKMGA RPLGRVIQEH IKKPLAEELL FGKLSKGGVV QVGIKDGKLD LRIEGPGKPR
LSGNKPPLLT AE