Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3917 |
Symbol | |
ID | 5834121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4353138 |
End bp | 4354187 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641369708 |
Product | HpcH/HpaI aldolase |
Protein accession | YP_001641359 |
Protein GI | 163853316 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2301] Citrate lyase beta subunit |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.243397 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGC CGCGCCGCTT CTTCCAGCCC CTCGCCGCGG GTGCGCCCGA GCCGTTCCGC GAACTGCCGA TCAAGCTCGA GCGGATGATC CACTTCGTGC CGCCGCACAA CGAGAAGGTC CGCGCCCGCG TGCCCGAACT CGCCAAGACG GTCGATGTGG TGCTCGGCAA CCTGGAGGAC GCGGTCCCCG CCGACCAGAA GGAGGCGGCG CGCAAGGGCT TCGTCGAGAT GGCCCGCGCC ACCGATTTCG CAGCCTCCGG CACCGGCCTG TGGACGCGCA TCAATGCCCT GAACTCGCCC TGGATCCTCG ACGACCTGTT CACCATCGTC GCCGAGGTCG GCGCGAAGCT CGACGTGGTG ATGGTGCCGA AGGTCGAGGG CCCCTGGGAC ATCCACTACA TCGACCAGTT GCTGGCCCAG CTCGAGGCGC GCCACGGCGT GACCAAGCCG ATCCTCGTCC ACGCCATCCT CGAGACCGCC GAAGGCGTGG CCAACGTCGA CGCCATCGCC TCCGCCTCGC CGCGCATGCA CGGCATGAGC CTCGGGCCGG CCGATCTCGC GGCGTCCCGC GGCATGAAGA CCACCCGCGT CGGCGGCGGT CACCCGGATT ACCGCGTCCT GTCCGATCCC AAGGGCGATG CCGAGCGGGC GTCCGCCCAG CAGGATCTGT GGCACTACAC CATCGCCAAG ATGGTCGATG CCTGCATGGC CAACGGCATC AAGGCGTTCT ACGGCCCGTT CGGCGACTTC TCCGATTCGG CCGCCTGCGA GGTGCAGTTC CGCAACGCCT TCCTGATGGG CTGCGCCGGC GCCTGGACCC TGCATCCGAG CCAGGTCGCC CTGGCCAAGA CCGTGTTCGC CCCCGATCCG GCCGAGGTGA ACTTCGCCTC CCGCATCGTC GAGGCGATGC CCGACGGCAC CGGCGCGGTG ATGATCGACG GCAAGATGCA GGACGACGCC ACCTGGAAGC AGGCCAAGGT CATCGTCGAT CTCGCCCGGC TCGTGGCCGA GAAGGATCCG GATCTCGCCA AGGTCTACAA TCTGCCCTGA
|
Protein sequence | MKLPRRFFQP LAAGAPEPFR ELPIKLERMI HFVPPHNEKV RARVPELAKT VDVVLGNLED AVPADQKEAA RKGFVEMARA TDFAASGTGL WTRINALNSP WILDDLFTIV AEVGAKLDVV MVPKVEGPWD IHYIDQLLAQ LEARHGVTKP ILVHAILETA EGVANVDAIA SASPRMHGMS LGPADLAASR GMKTTRVGGG HPDYRVLSDP KGDAERASAQ QDLWHYTIAK MVDACMANGI KAFYGPFGDF SDSAACEVQF RNAFLMGCAG AWTLHPSQVA LAKTVFAPDP AEVNFASRIV EAMPDGTGAV MIDGKMQDDA TWKQAKVIVD LARLVAEKDP DLAKVYNLP
|
| |