Gene MCA1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1789 
SymbolclpA 
ID3103829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1918320 
End bp1920590 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content59% 
IMG OID637170949 
ProductATP-dependent Clp protease, ATP-binding subunit ClpA 
Protein accessionYP_114227 
Protein GI53804153 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAGTA AAGAACTCGA ACAATCATTG AATGCGGCCT TCCGTACCGC CTACGAGAAA 
CGCCATGAGT TCATCACCGT GGAGCACCTG CTGCTGGCCA TGCTCGACAA CGGCGCCGCC
ATCGAAGTCC TGCGTGCCTG CGGTGGCAAT CTCGACCTGC TCCGCAACGA ACTGAACGAA
TTCCTGGATG AAACCACGCC GCTGATCCCT GCCGGCGTCA AGCGTGATAC CCAGCCGACG
CTGGGTTTCC AGCGGGTGCT CCAGCGCGCG GCCTTTCACG TGCAGTCGTC GGGGAAAAAA
GAGGTTACCG GAGCGAACGT CCTGGTCGCC ATCTTCAGCG AGCAGGACTC GCATGCGGTC
TATCTGTTGC ACAAGCAGGA CATCACCCGC CTCGACGTGG TGAACTTCAT TTCCCACGGC
ATCTCCAAAG TGCGGGACGA GGCTGATGGC ACTGCCAGGG GCTCGGCTCC CGAATCCGAC
GACGCCGAAA GCGGGAGCAG CGCCAACCCG CTCGAAAAGT TCGCCACCAA TCTCAACGAA
TCCGCCCGGC GCGGAAAAAT CGACCCGCTG ATCGGCCGCA AGGACGAACT GGAGCGCACG
GTCCAGGTGC TGTGTCGCCG AAGGAAGAAT AATCCGTTGT TCGTCGGCGA AGCCGGGGTG
GGTAAAACTG CCATCGCCGA AGGTCTGGCG AAGAAAATCG TTGAAAACGA CGTACCGGAA
GTGCTCAAGG ACAGCACGAT CTATTCGCTG GATCTCGGCA GTCTGGTGGC GGGAACCAAG
TACCGCGGTG ATTTCGAGAA GCGTCTGAAG TCGCTCCTGG CGCAGCTCAA GAAGGAGCCG
AATGCCATCC TCTTCATCGA CGAGATTCAC ACCATCATCG GCGCTGGCTC CGCCTCGGGC
GGGGTGATGG ATGCATCCAA TCTGATCAAA CCGGTCCTGG CCTCAGGCGA GTTGCGCTGC
ATCGGTTCGA CGACCTACCA GGAATACCGC GGCATCTTCG AAAAGGACCG CGCGCTGGCC
CGCCGCTTCC AGAAAATCGA TATTCCCGAA CCCTCGGTCG AAGAAACCTA CCAGATTCTC
AAAGGACTGA AGACCCGTTT CGAACAGCAC CATGACGTCA AATATTCCCT CGCTGCGCTG
CGCACTGCGG TGGAGCTGTC GGACCGGTAC ATCAACGACC GTCATCTCCC CGACAAAGCC
ATCGACGTCA TCGACGAGGC GGGAGCGAAT CAGCGCCTGC TGCCGCCGTC GCGCCGCAAG
AAGACCATCG GCACCGTGGA CATCGAGGAC ATCGTTTCGA AGATCGCGCG GATTCCCGCC
AAAACCGTTT CCGCCAACGA CAAGGAGAAA TTGCGCGACC TGGAATCCAA TCTGAAGATG
CTGGTCTTCG GTCAGGATGA AGCCATAGCC GCACTGTCGT CGGCGATCAA ATTGTCCCGC
GCAGGTCTGC GCGACGGTCA GAAACCCATA GGCTCATTTC TTTTTGCCGG CCCCACTGGC
GTAGGCAAAA CGGAAGTGAC ACGCCAGCTC GCCCGCATGC TCGGTGTCGA GCTGATCCGT
TTCGACATGT CCGAATACAT GGAGCGGCAC ACGGTGTCCC GGCTTATCGG AGCGCCGCCG
GGCTATGTCG GATTCGACCA GGGAGGTCTT CTGACCGAAG CCATCAACAA ACATCCGCAT
GCCGTGCTGC TGCTCGACGA AATCGAAAAA GCGCACCCGG ACGTCTTCAA TCTGCTGTTA
CAAGTGATGG ATCACGGCAC TCTGACGGAC AACAACGGGC GCAAGGCCGA CTTCCGTAAC
ATCATCCTGG TCATGACCAC CAATGCCGGG GCTTTCGAGG GTGCCCGCCC GTCGATAGGT
TTCACTCCGC AAGACCACTC CACCGATAGC TTGAAAGCGA TCGAACGCAC CTTCTCGCCC
GAGTTCCGGA ACCGCCTCGA CGCGATCATC CAGTTCAACC CGCTCAGCCC GGAAACGATC
GGGCATGTGG TGGACAAATT CATCTTCGAA CTCGAGGCGC AGTTGGCCGA GAAGCAGGTA
TCCCTGGTCA TCGAGCCGGA CGCACGTGCA TGGCTGGCCG AGCACGGTTT CGACTCGAAG
ATGGGAGCGC GTCCGATGGC GCGCGTGATT CAGGAAAACA TCAAGAAGCC GCTGGCCGAG
GAAATCTTGT TCGGCCGCCT CGCGCATGGT GGCACGGTGC GTGTCGGCGC TTCCTCCGGC
GGACTGACAT TTACCTACGA AACCCGCCAA AAAGCCGCCG AGCCCGTATG A
 
Protein sequence
MLSKELEQSL NAAFRTAYEK RHEFITVEHL LLAMLDNGAA IEVLRACGGN LDLLRNELNE 
FLDETTPLIP AGVKRDTQPT LGFQRVLQRA AFHVQSSGKK EVTGANVLVA IFSEQDSHAV
YLLHKQDITR LDVVNFISHG ISKVRDEADG TARGSAPESD DAESGSSANP LEKFATNLNE
SARRGKIDPL IGRKDELERT VQVLCRRRKN NPLFVGEAGV GKTAIAEGLA KKIVENDVPE
VLKDSTIYSL DLGSLVAGTK YRGDFEKRLK SLLAQLKKEP NAILFIDEIH TIIGAGSASG
GVMDASNLIK PVLASGELRC IGSTTYQEYR GIFEKDRALA RRFQKIDIPE PSVEETYQIL
KGLKTRFEQH HDVKYSLAAL RTAVELSDRY INDRHLPDKA IDVIDEAGAN QRLLPPSRRK
KTIGTVDIED IVSKIARIPA KTVSANDKEK LRDLESNLKM LVFGQDEAIA ALSSAIKLSR
AGLRDGQKPI GSFLFAGPTG VGKTEVTRQL ARMLGVELIR FDMSEYMERH TVSRLIGAPP
GYVGFDQGGL LTEAINKHPH AVLLLDEIEK AHPDVFNLLL QVMDHGTLTD NNGRKADFRN
IILVMTTNAG AFEGARPSIG FTPQDHSTDS LKAIERTFSP EFRNRLDAII QFNPLSPETI
GHVVDKFIFE LEAQLAEKQV SLVIEPDARA WLAEHGFDSK MGARPMARVI QENIKKPLAE
EILFGRLAHG GTVRVGASSG GLTFTYETRQ KAAEPV