Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1529 |
Symbol | |
ID | 5833868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1708315 |
End bp | 1711065 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641367327 |
Product | ATPase domain-containing protein |
Protein accession | YP_001638999 |
Protein GI | 163850956 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.777004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0499469 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGATC TGCTTCGCGA GTTTCTGGTC GAGAGCGCCG AGCATCTGGA CACCGTCGAT GCGGAGCTGG TCCGTTTCGA ACAGGACCCC AATAACCAGC AGATCTTGCG CAACATCTTC CGGCTCGTCC ACACCATCAA GGGGACGTGC GGGTTTCTCG GCCTGCCGCG CCTCGAAGCA CTGGCGCACG CCGCTGAGAC CCTGATGGGA CGCTTCCGCG ACGGCTATCC CGTCAGCGGC GCCTCGGTCA CGCTGATCCT GGCCACCCTC GACCGGCTCA AGGCGATCCT CGGCGACCTG GAAGCCACCG GCTCCGAGCC GGCCGGCACC GACGACGACC TGATCGGGGC CCTGGAGAAG ATGGCCTCCG ACGAATCGCC CGCCGCCGCC GCTGCCCCGC CCCCGCCGCC GCTTCCCGAG CTGCCGCCGA TCGTCGAGCG CGAACTCAAG CCCGGCGAAG TGTCGCTGGA CGATCTCGAA CGGGCTTTCA TGGAAGCCCC CGGCCCCGAC GATTTTGCCG CCGCGCCCGC CATCAATGCC CCGGCCCACG TCTTCGACGC CGGCGAGCCC GCGTTCGACA GCGCGCCCGA GCTCACCGCC CCCGTCCAGG CTCCGGAGCG CCCTGCAGCC CCGGCCGCTG CGTCTCCCGC CGCCGAGGGC GGTGAGAGCG CCGTGGCTGC CAAGGTGCAG ACGATCCGCG TGAACGTCGA CACCATCGAA CACCTGATGA CGATGGTCTC GGAACTGGTG CTGACCCGCA ACCAGCTCCT CGAGATCGCC CGCCGCCACG AGGATTCCGG CTACAAGGTT CCGCTCCAGC GCCTCAGCCA CGTCACGGCC GAGCTGCAGG AAGGCGTGAT GAAGACGCGC ATGCAGCCGA TCGGCAATGC GTGGCAGAAG CTGCCCCGCG TCGTGCGCGA CCTCTCGGCC GAACTCGGCA AGGGCATCGA CCTCGTGATG TCGGGCGCCG AGACCGAACT CGACCGCCAG GTGCTCGACG TCATCAAGGA CCCGCTCACC CACATGGTGC GCAACTCGGC CGACCACGGC ATCGAGTCCA CCAACGAGCG CCTCAAGGCC GGCAAGCCCG CCCGCGGCTC GATCCGCCTC TCCGCCTACC ACGAGGGCGG CACGATCACG ATCGAGATCG CCGACGACGG CAAGGGCCTC GACCTCGCCG CGATCCGCAA GAAGGCGATC GAGCGCAACT TCGCGCCTGC CGCCGACATC GAGCGGATGA CCGACGCGCA GGTCGCGAAG TTCATCTTCC ACGCCGGCTT CTCCACCGCC AAGGCGATCA CCTCGGTCTC CGGCCGCGGC GTCGGCATGG ACGTGGTCAA GACCAACATC GAGACCATCG GCGGCGTGGT CGACATCGCC ACCGAGCTCG GCAAGGGCAC CACCTTCACC ATCAAGATCC CGCTGACGCT CGCCATCGTC TCGGCGCTGA TCGTCAAGGC CGGCGCGCAG CGCTACGCCG TGCCGCAGAT CGCGGTGCTC GAACTGGTGC GGGTCGATCC CAAGGGCGAC AACACCAGCG CGAACTCGAT CGAGCGCATC CACGGCGCCC CGGTGCTGCG CCTGCGCGAG CGGCTCCTGC CGATCGTCAC CCTCAACGGG CTGATGCGCG GGCAGGCGAC CGTCGAGGAG GGCGAGGTCG TCGAGTCCGG CTTCGTGGTG GTGGCCCAGG TCGGCCGGCA GCGCTTCGGC GTGCTTGTGG ACGAGGTCTT CCATACGGAA GAAATCGTCG TGAAGCCGAT GTCGTCGAAG CTGCGGCACA TCCCGCTCTT TGCCGGCAAC ACGATCCTCG GCGACGGCGC CGTGGTGCTG ATCGTCGATC CCAACGGCGT CGCCAAGCTG GTGGGCCAGA GCGCGCAGTC CGGCGCGGCG ACGGAGACCG AGTCCGACGA GGTCGAGGCC GGCGACGCCA AGGCGACGCT CCTCGTGTTC AAGGGGGGTG CCGGCGGCTT CAAGGCGGTG CCGCTCTCGC TCGTCACCCG CCTCGAGGAG ATCGACGCCT CGAAGATCGA GCATCTCGGC GGACGTCCGC TGATCCAGTA CCGCGGCCGC CTGATGCCGC TGGTGCCGGC CGATCCCTCG GTTCCGATCC GCTCGGAGGG CAACCAGGCG CTGGTCGTGT TCTCCGACGG CGACCGGGCG ATGGGCCTCG TCGTGGACGA GATCGTCGAC ATCGTCGAGG AGCGCCTCGA CATCGAGATC TCGGCCGACC GCTCCGATCT CATCGGCTCG GCGGTTCTGC GGGGTCGGGC GACCGACATC ATCAACATCG CCCACTTCCT GCCGCTCGCC TACGACGACT GGGCGCGGGG CCCGCGCAAG ACGGTGGTCA AGGCGCCCTC GCTCCTGCTC GTGGACGACT CGGCCTTCTT CCGCGACATG CTCACCCCCG TCCTCAAGGC GGCGGGCTAC AGCGTGACGA CCGCGTCCTC CGCCGAGGAG GCACTCGGTC TGCTCAAGGG GAGCGCCGGC CTCGACCTCG TGGTCAGCGA TCTCGACATG CCCGGCCGCA GCGGCTTCGA CCTCGTCGCC GCCATGCGCA AGAGCGGCGG GCGGCTGGCC GAGATGCCGG TGATCGCGCT CACCGGCACG GTCGCCCCCG ACGCCATCGA ACAGGCGCGG CGCCTCGCGA TCAGCGATCT CGTCGCCAAG TTCGACCGCA GCGGCCTGCT CGCGGCGCTC GCCGAGATCG GCGAAGCCGC CCAGGCGGCC GACGCCCGCG CCGCTGCCTG A
|
Protein sequence | MDDLLREFLV ESAEHLDTVD AELVRFEQDP NNQQILRNIF RLVHTIKGTC GFLGLPRLEA LAHAAETLMG RFRDGYPVSG ASVTLILATL DRLKAILGDL EATGSEPAGT DDDLIGALEK MASDESPAAA AAPPPPPLPE LPPIVERELK PGEVSLDDLE RAFMEAPGPD DFAAAPAINA PAHVFDAGEP AFDSAPELTA PVQAPERPAA PAAASPAAEG GESAVAAKVQ TIRVNVDTIE HLMTMVSELV LTRNQLLEIA RRHEDSGYKV PLQRLSHVTA ELQEGVMKTR MQPIGNAWQK LPRVVRDLSA ELGKGIDLVM SGAETELDRQ VLDVIKDPLT HMVRNSADHG IESTNERLKA GKPARGSIRL SAYHEGGTIT IEIADDGKGL DLAAIRKKAI ERNFAPAADI ERMTDAQVAK FIFHAGFSTA KAITSVSGRG VGMDVVKTNI ETIGGVVDIA TELGKGTTFT IKIPLTLAIV SALIVKAGAQ RYAVPQIAVL ELVRVDPKGD NTSANSIERI HGAPVLRLRE RLLPIVTLNG LMRGQATVEE GEVVESGFVV VAQVGRQRFG VLVDEVFHTE EIVVKPMSSK LRHIPLFAGN TILGDGAVVL IVDPNGVAKL VGQSAQSGAA TETESDEVEA GDAKATLLVF KGGAGGFKAV PLSLVTRLEE IDASKIEHLG GRPLIQYRGR LMPLVPADPS VPIRSEGNQA LVVFSDGDRA MGLVVDEIVD IVEERLDIEI SADRSDLIGS AVLRGRATDI INIAHFLPLA YDDWARGPRK TVVKAPSLLL VDDSAFFRDM LTPVLKAAGY SVTTASSAEE ALGLLKGSAG LDLVVSDLDM PGRSGFDLVA AMRKSGGRLA EMPVIALTGT VAPDAIEQAR RLAISDLVAK FDRSGLLAAL AEIGEAAQAA DARAAA
|
| |