Gene Mext_4802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4802 
Symbol 
ID5834380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5363898 
End bp5366381 
Gene Length2484 bp 
Protein Length827 aa 
Translation table11 
GC content65% 
IMG OID641370599 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionYP_001642241 
Protein GI163854198 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.953902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0652574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCAGCT TCTCACGTAG CCTCGAGCAA GCCCTCCACC GCGCCCTGGC CTTGGCCGGA 
GAGCGCCGCC ACGAATACGC GACCCTCGAA CACCTCCTGC TCGCTCTCGT CGATGATCAG
GATGCTGCCG CGGTGATGCG AGCCTGTAAC GTCGAGACCG ACGTACTGCG CCGCAATCTC
GTCGAATACG TCGACACCGA ACTCTCGAAT CTGACGGGCG ACGGGCGCCA GGACGCCAAG
CCGACCGCTG GCTTCCAGCG GGTCATCCAG CGTGCGGTGA TTCATGTCCA GTCCTCGGGT
CGCGAGGAGG TGACCGGGGC CAATGTGCTC GTCGCGATTT TCGCTGAGCG CGAGAGCCAC
GCCGCCTATT TCCTGCAAGA GCAGGACATG ACTCGCTACG ACGCGGTGAA CTACATCAGC
CACGGCATCG CCAAGCGGCC CGGCGCGTCC GAGGCCAAGC CTGTGCGCGG CGCCGAGGAG
GAGGCCGCGT CCGAGCGTCC GGGCGGCGAG GAGACCGAGG CGCGGCCCAA GAAGAAGGGC
GATGCGCTCG ATGCCTACTG CGTCAACCTC AACAAGAAGG CCCGTGACGG CAAGATCGAC
CCGCTGATCG GCCGCCATTC CGAGGTCGAG CGCACGATTC AGGTGCTCTG CCGCCGCCAG
AAGAACAATC CTCTGCTGGT GGGCGACCCC GGCGTTGGCA AGACGGCGAT CGCGGAAGGT
CTCGCGCGTA AGATCATCCA GCACGAGGTG CCGGAAGTTC TGGCCGACGC GACGGTGTTC
TCCCTCGACA TGGGCACGCT GCTGGCCGGC ACCCGCTACC GCGGCGACTT CGAGGAGCGC
CTCAAGCAGG TAATGAAGGA GATCGAGGCG CACCCCAACG CCATCATGTT CATCGACGAG
ATTCACACCG TGATCGGTGC GGGTGCGACC TCGGGCGGTG CGATGGACGC CTCGAACCTG
TTGAAGCCCG CTCTGGCCTC GGGCGCCCTG CGCTGCATCG GCTCGACCAC CTACAAGGAG
TACCGCCAGT ACTTCGAGAA GGATCGCGCC CTGGTGCGCC GCTTCCAGAA GATCGACGTC
AACGAGCCGT CGATCCCGGA TACGATCGAA ATCCTCAAGG GGCTGAAGCC GTATTTCGAG
GAGTTCCACA AGCTCAAATA CACCACCGAG GCCGTGAAGG CGGCGGTGGA GCTGTCGGCC
CGCTACATCA ACGACCGCAA GCTGCCTGAC AAGGCGATCG ACGTGATCGA CGAGACCGGC
GCCTCGCAGA TGCTGGTGCC GGAGGCGCGC CGCAAGCGCA CCATCGGCGT CAAGGAGATC
GAGACCACCA TCGCCACGAT GGCGCGCATC CCGCCGAAAA CCGTGTCGAA GGACGACGCG
GTGGTTCTCA AGAACCTCAC TGAGAACCTC AAGCGGGTCG TCTACGGCCA GACCAACGCC
ATCGAGGCGC TGACCTCGGC GATCAAGCTC GCCCGTGCGG GCCTGCGCGA TCCCGACAAG
CCGATCGGCT CCTACCTGTT CGCCGGCCCG ACCGGCGTCG GTAAGACCGA AGCGGCCAAG
CAGCTTGCCG CCAGCCTCGG CGTCGAGATG CTGCGGTTCG ACATGTCGGA ATACATGGAG
CGCCACACGG TCTCGCGGCT GATCGGTGCC CCGCCCGGCT ATGTCGGCTT CGACCAGGGC
GGCCTGCTCA CCGACGGGAT CGACCAGCAC CCGCACTGCG TGCTCCTGCT CGACGAGATC
GAGAAGGCGC ATCCGGACCT GTTCAACATC CTGTTGCAGG TGATGGATCA CGGCAAGCTG
ACCGACCACA ACGGTAAGCA GGTCGATTTC CGCAACGTCA TCATCATTAT GACGTCGAAC
GCAGGCGCCT CGGATCTGGC GAAGTCGGCC TACGGCTTCA CGCAGTCAAA GCGCACGGGC
GACGACGTCG AGGCGATCAA CCGGCTGTTC GCGCCGGAAT TCCGCAACCG CCTCGACGCG
ATCATCTCGT TCGGCCACCT GCCGAAGGAG GTCGTGGCCA AGGTCGTCGA CAAGTTCGTG
CTCCAGCTCG AAGCACAGCT CGCCGACCGC AACGTCACGA TCGAGCTGTC GGACGAGGCC
CGCGAATGGC TCGTGGAGAA CGGCTACGAC GATGCGATGG GCGCCCGCCC CATGGCCCGT
CTGATCCAAT CCACGATCAA GACGCCGCTC GCCGACGAGG TGCTGTTCGG CCGCCTCAAG
GACGGCGGTG CCGTCAAGGT GGTGCTGAAG AAGCCGGAGG CCGAGGACGG CAAGGGCAAG
GCGGAACTCG GTTTCGAGTT CCCCGCCGGA CCGGTGACGC CGAAGCCGGA GACGGACGTC
GCCAACGCGG CCAAGCGCAA GCGCTCGAAG CCGCGTTCGG CCCCGCGCAA GAAGGCGGCC
AAGCGTGATG GTGGTCCTTC CGGCGGTGGA TCCTCCGGTG GCGGCTCCGT CCGCACCGTG
CCGAAGGTTC CGCTGAAAGT CTGA
 
Protein sequence
MPSFSRSLEQ ALHRALALAG ERRHEYATLE HLLLALVDDQ DAAAVMRACN VETDVLRRNL 
VEYVDTELSN LTGDGRQDAK PTAGFQRVIQ RAVIHVQSSG REEVTGANVL VAIFAERESH
AAYFLQEQDM TRYDAVNYIS HGIAKRPGAS EAKPVRGAEE EAASERPGGE ETEARPKKKG
DALDAYCVNL NKKARDGKID PLIGRHSEVE RTIQVLCRRQ KNNPLLVGDP GVGKTAIAEG
LARKIIQHEV PEVLADATVF SLDMGTLLAG TRYRGDFEER LKQVMKEIEA HPNAIMFIDE
IHTVIGAGAT SGGAMDASNL LKPALASGAL RCIGSTTYKE YRQYFEKDRA LVRRFQKIDV
NEPSIPDTIE ILKGLKPYFE EFHKLKYTTE AVKAAVELSA RYINDRKLPD KAIDVIDETG
ASQMLVPEAR RKRTIGVKEI ETTIATMARI PPKTVSKDDA VVLKNLTENL KRVVYGQTNA
IEALTSAIKL ARAGLRDPDK PIGSYLFAGP TGVGKTEAAK QLAASLGVEM LRFDMSEYME
RHTVSRLIGA PPGYVGFDQG GLLTDGIDQH PHCVLLLDEI EKAHPDLFNI LLQVMDHGKL
TDHNGKQVDF RNVIIIMTSN AGASDLAKSA YGFTQSKRTG DDVEAINRLF APEFRNRLDA
IISFGHLPKE VVAKVVDKFV LQLEAQLADR NVTIELSDEA REWLVENGYD DAMGARPMAR
LIQSTIKTPL ADEVLFGRLK DGGAVKVVLK KPEAEDGKGK AELGFEFPAG PVTPKPETDV
ANAAKRKRSK PRSAPRKKAA KRDGGPSGGG SSGGGSVRTV PKVPLKV