Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1801 |
Symbol | |
ID | 5832923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2021643 |
End bp | 2024411 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641367600 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001639271 |
Protein GI | 163851228 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.369049 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGA CGCTGCACGC CCGGCCGTCA GCCGCGACCG ACACGACCTT CGCGCCGCCG GTCATCACCG GCACGGCCAC CGAAGACGCC CTCGAGATTC TGTTTCACGC CCTGCTCGAT GTGGCCCGCC GCCACGATCC CGAGCTGGAA GACGTGCTGC ATGGCCGCGC CGACATCTCC TCCTTCACGC CGGAGATGCT GGCGCGCGCG CTTCAGGTGC AGGGCATCTG GTTTCAGCTC GTCTCGATCG CCGAGCAGAA TGCGGCCATG CGCCGCCGCC GCCACGTCGA GCGCGACCAG GGCCGTGAGG CGCTGAACGG CTCCTTCGCC AAGGTTCTGG CCGAGGCCTC CGCCCGGGGC ATCGGCCCGC AGCAGATCCA CGCGCTGCTC AAGGATCTTC GCATCCGCCC GACCATCACT GCGCATCCCA CCGAGGGCAA GCGCGTCACG GTGCTGGAGA AGCTGCGCCG GATCTACCTC GTGCTGCGTG AGCTGGAGCT GCCGCGCTGG ACCGAGCGCG AGCGCAACGG CCTGATGAAC GAGCTGCGCG ACCAGATCGA GCTGATCTGG ATGACGGGCG AGCTGCATCT GGAGAAAGCC ACCGTCGAGC GCGAGGTCGC CTGGGGCCTG CACTTCTTCG ACGAGACCCT GTTCGAGATG CTGCCCGAGA TGCTGCTCTC GCTCGAAGAG AGCCTCGCGC AGTACTATCC CGACGAGACC TTCGAGGTGC CGCCGTTCTT CCAGTTCGGA AGCTGGATCG GCGGCGACCG CGACGGCAAC CCCTACGTCA CCGCCAGCGT CACCCGCGAG ACGCTTCAGC GCAACGCGCT GGCGTCGCTC CGCCGCTACC GCGACGGCAT CACCCATCTC GGCCGGGTGC TCTCGATCAC CGAGCGCTCG CTGCCGGTGC CCGAGACCTT CCGCAGCGAA CTCGCGCATA TGCTGGCCGA GTCCGGCGAT GCGCGGGCGA TCGCCAACCG CAATCCCGGC GAGGCCTATC GCCAGTTCCT CTCCTGCGTG CTGCGCAAGC TCAAGGCGAC GATCGCCCGC AACAAGGGCG CCCGCTCGGT CGGGCCGGAT TATCCGAGCG CCGACGGGCT CATCAACGAT CTGCGCACCC TGGAGAAGGG GCTGGCCGAC GCCAAATGCG GCGCGCTCGC CACCGATATC GTGCGGCCCG TGCGGCGGAT GGTCGAGATC TTCCGCTTCT CCACCGTGCG GCTCGATCTG CGCGAGAACT CGACCCGCAC CACCAAGACG CTGCACGCGC TGTGGAAGCT GCGCAACGGC GACCGCGAGC CGCCGGCGCT GGATTCGCCG GCCTGGAAGG ATTGGCTGCT CACCGAGCTG GCCCGGCCGC GCACGCCCGA AACCTCGTTC GAGGATTTCG CCGACCGTCT GCCCGACGAC GCGCGCGAGA CGCTCGCCAC CTTCGCCCTG GTCGGCGAGA TGCGCGACAC TCTCGACCGC GAGGCCTTCG GCGCCTTCAT CCTGTCGATG ACGCGCTCCA CCGTCGACGT GCTCGGCGCC TACCTGCTGG CCAAGGAGGC CGGCATCTTC CTCGACACGA CCGGCACCGA GATCTGCCCG CTGCCGATCG TGCCGCTGTT CGAGACCATC GACGACCTGC GCGCCGCCCC GGCGATCATG AAGGAACTGC TCGGCATCCC CGTGGTGCGC CGCTCCACCC GCTGGCAGGG CGGGGTGCAG GAAGTGATGA TCGGCTACTC CGATTCCAAC AAGGACGGCG GCTTCATTGC CTCGAACTGG GAGCTCTACA AGGCGCAGGT GCGGCTGACC ACGCTCGGCA ACCATCTCGG CGTGCCGATC GCCTTCTTCC ACGGCCGCGG CGGTTCGGTG AGCCGCGGCG GCGTGCCGAC CCATCGCGGC ATCGCGGCCC AGCCGCCGGG CTCGATCCAG GGCCGCTTCC GCATCACCGA GCAGGGCGAA GTCGTCTCGT TCAAATACGC CAACCGCGGC ACCGCCGCCT ACCAGATGGA GCTGCTGGCG GCTTCGGTGT TCGAGCACGC CCTGCTCTCG GAGGGCAACG GCAACGGCTC GCGCGCCGAA TTCGACGACG CGCTGGAGGC GCTCTCGGGC GCCTCGCGGG CGGCCTACGT CAACCTGCTC CAGGCTGAGG GCCTCGTCGA TTACTTCCAG GCGGCGAGCC CGCTCGACGA GATCTCGCTG CTCAATATCG GCTCGCGCCC GGCCCGGCGC TTCGGCGCCA AGTCGCTCTC GGATCTGCGC GCGATCCCCT GGGTGTTCGC GTGGTCGCAG AACCGGCACG TCATCACCGG TTGGTACGGC GTCGGCTCGG GGCTGAAGAG CTTCATCGAC GTGCGCGGCG AGGCGGGCGA AGCCCTGCTG AGGCGCCTGT TCCGGGATTG CCGGGTGTTC CGCCTTGTCC TCGACGAGGT CGAGAAGACG CTGCTGATGG TCGATCTCGA GATCGCCCGC GACTATGCCG GCCTCGTCGA GGACGAGGGC ATCCGCGCGC GCATCTTCGG GATGATCGAG GCGGAATACG CGCTGACGCG GGAGATGGTG CTGCGGGTGT CGGGCGACAG CGAGCTCGCC CAGCGCTTCC CGCAATTCAG CGAACGCCTG CGCGGCCGGC TTCCGACCAT CAATCAGGTC AGCCGCGAGC AGGTCGAGCT GCTGCGCCGC TACCGCAGCG AGACCGACGA GGACAAGCGC GAGGCGGTGA AGTCCGCGCT GCTGCTGTCG ATCAACTGCA TCGCCGTCGG CTTCGGCGCG ACTGGATAA
|
Protein sequence | MTKTLHARPS AATDTTFAPP VITGTATEDA LEILFHALLD VARRHDPELE DVLHGRADIS SFTPEMLARA LQVQGIWFQL VSIAEQNAAM RRRRHVERDQ GREALNGSFA KVLAEASARG IGPQQIHALL KDLRIRPTIT AHPTEGKRVT VLEKLRRIYL VLRELELPRW TERERNGLMN ELRDQIELIW MTGELHLEKA TVEREVAWGL HFFDETLFEM LPEMLLSLEE SLAQYYPDET FEVPPFFQFG SWIGGDRDGN PYVTASVTRE TLQRNALASL RRYRDGITHL GRVLSITERS LPVPETFRSE LAHMLAESGD ARAIANRNPG EAYRQFLSCV LRKLKATIAR NKGARSVGPD YPSADGLIND LRTLEKGLAD AKCGALATDI VRPVRRMVEI FRFSTVRLDL RENSTRTTKT LHALWKLRNG DREPPALDSP AWKDWLLTEL ARPRTPETSF EDFADRLPDD ARETLATFAL VGEMRDTLDR EAFGAFILSM TRSTVDVLGA YLLAKEAGIF LDTTGTEICP LPIVPLFETI DDLRAAPAIM KELLGIPVVR RSTRWQGGVQ EVMIGYSDSN KDGGFIASNW ELYKAQVRLT TLGNHLGVPI AFFHGRGGSV SRGGVPTHRG IAAQPPGSIQ GRFRITEQGE VVSFKYANRG TAAYQMELLA ASVFEHALLS EGNGNGSRAE FDDALEALSG ASRAAYVNLL QAEGLVDYFQ AASPLDEISL LNIGSRPARR FGAKSLSDLR AIPWVFAWSQ NRHVITGWYG VGSGLKSFID VRGEAGEALL RRLFRDCRVF RLVLDEVEKT LLMVDLEIAR DYAGLVEDEG IRARIFGMIE AEYALTREMV LRVSGDSELA QRFPQFSERL RGRLPTINQV SREQVELLRR YRSETDEDKR EAVKSALLLS INCIAVGFGA TG
|
| |