Gene Mext_1529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1529 
Symbol 
ID5833868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1708315 
End bp1711065 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content70% 
IMG OID641367327 
ProductATPase domain-containing protein 
Protein accessionYP_001638999 
Protein GI163850956 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.777004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0499469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATC TGCTTCGCGA GTTTCTGGTC GAGAGCGCCG AGCATCTGGA CACCGTCGAT 
GCGGAGCTGG TCCGTTTCGA ACAGGACCCC AATAACCAGC AGATCTTGCG CAACATCTTC
CGGCTCGTCC ACACCATCAA GGGGACGTGC GGGTTTCTCG GCCTGCCGCG CCTCGAAGCA
CTGGCGCACG CCGCTGAGAC CCTGATGGGA CGCTTCCGCG ACGGCTATCC CGTCAGCGGC
GCCTCGGTCA CGCTGATCCT GGCCACCCTC GACCGGCTCA AGGCGATCCT CGGCGACCTG
GAAGCCACCG GCTCCGAGCC GGCCGGCACC GACGACGACC TGATCGGGGC CCTGGAGAAG
ATGGCCTCCG ACGAATCGCC CGCCGCCGCC GCTGCCCCGC CCCCGCCGCC GCTTCCCGAG
CTGCCGCCGA TCGTCGAGCG CGAACTCAAG CCCGGCGAAG TGTCGCTGGA CGATCTCGAA
CGGGCTTTCA TGGAAGCCCC CGGCCCCGAC GATTTTGCCG CCGCGCCCGC CATCAATGCC
CCGGCCCACG TCTTCGACGC CGGCGAGCCC GCGTTCGACA GCGCGCCCGA GCTCACCGCC
CCCGTCCAGG CTCCGGAGCG CCCTGCAGCC CCGGCCGCTG CGTCTCCCGC CGCCGAGGGC
GGTGAGAGCG CCGTGGCTGC CAAGGTGCAG ACGATCCGCG TGAACGTCGA CACCATCGAA
CACCTGATGA CGATGGTCTC GGAACTGGTG CTGACCCGCA ACCAGCTCCT CGAGATCGCC
CGCCGCCACG AGGATTCCGG CTACAAGGTT CCGCTCCAGC GCCTCAGCCA CGTCACGGCC
GAGCTGCAGG AAGGCGTGAT GAAGACGCGC ATGCAGCCGA TCGGCAATGC GTGGCAGAAG
CTGCCCCGCG TCGTGCGCGA CCTCTCGGCC GAACTCGGCA AGGGCATCGA CCTCGTGATG
TCGGGCGCCG AGACCGAACT CGACCGCCAG GTGCTCGACG TCATCAAGGA CCCGCTCACC
CACATGGTGC GCAACTCGGC CGACCACGGC ATCGAGTCCA CCAACGAGCG CCTCAAGGCC
GGCAAGCCCG CCCGCGGCTC GATCCGCCTC TCCGCCTACC ACGAGGGCGG CACGATCACG
ATCGAGATCG CCGACGACGG CAAGGGCCTC GACCTCGCCG CGATCCGCAA GAAGGCGATC
GAGCGCAACT TCGCGCCTGC CGCCGACATC GAGCGGATGA CCGACGCGCA GGTCGCGAAG
TTCATCTTCC ACGCCGGCTT CTCCACCGCC AAGGCGATCA CCTCGGTCTC CGGCCGCGGC
GTCGGCATGG ACGTGGTCAA GACCAACATC GAGACCATCG GCGGCGTGGT CGACATCGCC
ACCGAGCTCG GCAAGGGCAC CACCTTCACC ATCAAGATCC CGCTGACGCT CGCCATCGTC
TCGGCGCTGA TCGTCAAGGC CGGCGCGCAG CGCTACGCCG TGCCGCAGAT CGCGGTGCTC
GAACTGGTGC GGGTCGATCC CAAGGGCGAC AACACCAGCG CGAACTCGAT CGAGCGCATC
CACGGCGCCC CGGTGCTGCG CCTGCGCGAG CGGCTCCTGC CGATCGTCAC CCTCAACGGG
CTGATGCGCG GGCAGGCGAC CGTCGAGGAG GGCGAGGTCG TCGAGTCCGG CTTCGTGGTG
GTGGCCCAGG TCGGCCGGCA GCGCTTCGGC GTGCTTGTGG ACGAGGTCTT CCATACGGAA
GAAATCGTCG TGAAGCCGAT GTCGTCGAAG CTGCGGCACA TCCCGCTCTT TGCCGGCAAC
ACGATCCTCG GCGACGGCGC CGTGGTGCTG ATCGTCGATC CCAACGGCGT CGCCAAGCTG
GTGGGCCAGA GCGCGCAGTC CGGCGCGGCG ACGGAGACCG AGTCCGACGA GGTCGAGGCC
GGCGACGCCA AGGCGACGCT CCTCGTGTTC AAGGGGGGTG CCGGCGGCTT CAAGGCGGTG
CCGCTCTCGC TCGTCACCCG CCTCGAGGAG ATCGACGCCT CGAAGATCGA GCATCTCGGC
GGACGTCCGC TGATCCAGTA CCGCGGCCGC CTGATGCCGC TGGTGCCGGC CGATCCCTCG
GTTCCGATCC GCTCGGAGGG CAACCAGGCG CTGGTCGTGT TCTCCGACGG CGACCGGGCG
ATGGGCCTCG TCGTGGACGA GATCGTCGAC ATCGTCGAGG AGCGCCTCGA CATCGAGATC
TCGGCCGACC GCTCCGATCT CATCGGCTCG GCGGTTCTGC GGGGTCGGGC GACCGACATC
ATCAACATCG CCCACTTCCT GCCGCTCGCC TACGACGACT GGGCGCGGGG CCCGCGCAAG
ACGGTGGTCA AGGCGCCCTC GCTCCTGCTC GTGGACGACT CGGCCTTCTT CCGCGACATG
CTCACCCCCG TCCTCAAGGC GGCGGGCTAC AGCGTGACGA CCGCGTCCTC CGCCGAGGAG
GCACTCGGTC TGCTCAAGGG GAGCGCCGGC CTCGACCTCG TGGTCAGCGA TCTCGACATG
CCCGGCCGCA GCGGCTTCGA CCTCGTCGCC GCCATGCGCA AGAGCGGCGG GCGGCTGGCC
GAGATGCCGG TGATCGCGCT CACCGGCACG GTCGCCCCCG ACGCCATCGA ACAGGCGCGG
CGCCTCGCGA TCAGCGATCT CGTCGCCAAG TTCGACCGCA GCGGCCTGCT CGCGGCGCTC
GCCGAGATCG GCGAAGCCGC CCAGGCGGCC GACGCCCGCG CCGCTGCCTG A
 
Protein sequence
MDDLLREFLV ESAEHLDTVD AELVRFEQDP NNQQILRNIF RLVHTIKGTC GFLGLPRLEA 
LAHAAETLMG RFRDGYPVSG ASVTLILATL DRLKAILGDL EATGSEPAGT DDDLIGALEK
MASDESPAAA AAPPPPPLPE LPPIVERELK PGEVSLDDLE RAFMEAPGPD DFAAAPAINA
PAHVFDAGEP AFDSAPELTA PVQAPERPAA PAAASPAAEG GESAVAAKVQ TIRVNVDTIE
HLMTMVSELV LTRNQLLEIA RRHEDSGYKV PLQRLSHVTA ELQEGVMKTR MQPIGNAWQK
LPRVVRDLSA ELGKGIDLVM SGAETELDRQ VLDVIKDPLT HMVRNSADHG IESTNERLKA
GKPARGSIRL SAYHEGGTIT IEIADDGKGL DLAAIRKKAI ERNFAPAADI ERMTDAQVAK
FIFHAGFSTA KAITSVSGRG VGMDVVKTNI ETIGGVVDIA TELGKGTTFT IKIPLTLAIV
SALIVKAGAQ RYAVPQIAVL ELVRVDPKGD NTSANSIERI HGAPVLRLRE RLLPIVTLNG
LMRGQATVEE GEVVESGFVV VAQVGRQRFG VLVDEVFHTE EIVVKPMSSK LRHIPLFAGN
TILGDGAVVL IVDPNGVAKL VGQSAQSGAA TETESDEVEA GDAKATLLVF KGGAGGFKAV
PLSLVTRLEE IDASKIEHLG GRPLIQYRGR LMPLVPADPS VPIRSEGNQA LVVFSDGDRA
MGLVVDEIVD IVEERLDIEI SADRSDLIGS AVLRGRATDI INIAHFLPLA YDDWARGPRK
TVVKAPSLLL VDDSAFFRDM LTPVLKAAGY SVTTASSAEE ALGLLKGSAG LDLVVSDLDM
PGRSGFDLVA AMRKSGGRLA EMPVIALTGT VAPDAIEQAR RLAISDLVAK FDRSGLLAAL
AEIGEAAQAA DARAAA