Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_1923 |
Symbol | |
ID | 7116738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 1984870 |
End bp | 1986609 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643524687 |
Product | Shikimate kinase., 3-dehydroquinate synthase |
Protein accession | YP_002420714 |
Protein GI | 218529898 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase [COG0703] Shikimate kinase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0886542 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCC CCCTGCCGCC GGCGCCGCCG CCCGGCGAAC CGATCGAAGC ACGACTGCGC CGCGGCCTCG GAGCCCGGTC GATCGTGCTC GTCGGCCTGA TGGGGGCGGG CAAGAGCACC GTCGGGCGCC GCCTGGCGGG GCGTCTCGGG CTGATGTTCA AGGATGCCGA TCACGAGATC GAGGCGGCGG CCAAGCTCAC CATCGCCGAC ATTTTCTCGA TCTACGGCGA GGCGAGCTTC CGCGAGGGCG AGGAACGGGT GATCGCGCGT CTGCTGCGCG AGGGGCCGAT GGTGCTGGCC ACCGGAGGCG GCGCCTTCAT GCGCGAGGCG ACGCGGGCGC GGATCGCCGA GGGGGCCATC TCGGTCTGGC TCAAGGCCGA CCTCGACGTG CTGATGCGAC GCGTGCGCAA ACGCAACACG CGCCCTCTGC TCCAGACCGA GGATCCGGAG GCGACCATGC GCACACTGAT GGAGGTGCGC CATCCGGTCT ATGCCCAAGC CGATGTCACG GTGCTGTCCC GCGAAGTGTC CCACGACCGC GTGGTGGAGG ACGTGATGGA AGCTCTCGAT ATCCACATCA ACCCGTCTCA TACGACACAA TCACAACATT TGACATTCAG TATGACGCAG CAACCCTCGC GTGTGAACGT TCCCCTGTCG GGTGGACGCG AATACGATAT TCGGATCGGT CGGGGTCTTA TCGACGCGGT GGGTGCGGAG GCGCGGGATC TCGGCGCCCG GGCTGCCGGT ATCGTCACCG ACGAGACGGT CGCCGGCCTC TACGGCGAGC GTGTGCGGGC CAGCCTCGAG GCCGCCGGGT TGCGCTGCGG CATCATCGCC GTGCCGCCGG GCGAAGCTTC GAAGAGCTAC GCGGAATTCG CCCGCGTCTG CGACGGGCTC CTGGCCCAGA AGATCGAGCG CGGCGACCTC GTCGTGGCGC TCGGCGGCGG CGTGGTCGGC GATCTCGCGG GTTTTGCGGC GGCCTCCCTG CGGCGCGGTG TCCGCTTCCT CCAGGTTCCA ACGACCTTGC TCGCGCAGGT TGATTCCTCG GTGGGGGGAA AGACCGGGAT CAACTCGCCG CTCGGCAAGA ATCTGATCGG CGCCTTCCAC CAGCCCCGCC TCGTACTGGC CGACACCGCC ACCCTCGACA CGCTCTCGGA GCGCGAGATG CGGGCGGGTT ACGCCGAGGT CGCCAAGTAT GGCTTGATCG GGGATGCCGG CTTCTTCGAG TGGTGCGAGG CGAACTGGGC CGGCATCTTC TCCGGCGGGC CGGAGCGCGA CGAGGCGGTG GCCGCCTGCT GCCGCGCCAA GGCCGGCGTC GTGACCCGCG ACGAGCGGGA GGACGGCGAA CGCGCCCTGC TCAATCTCGG CCACACCTTC GGCCACGCCC TGGAGCGGCT GACCGGCTAC GACGCGGCCC GCCTCGTCCA CGGCGAGGGC GTCGCGATCG GTCTGGCGCT GGCCTTCCGC TTCTCGGCCC GGCTCGGCCT CTGCCCCGGC CAGGATGCGG GGCGGGTGGC CAACCACCTC GCGCTCGCCG GCCTTCCGAC CCGCCTGCAA CAGGTGCCCG GCGGGGCCGG CGACCCGGAT GCCCTCCTCG ACGCCATGGC CCAGGACAAG AAGGTCCGCG ACGGGCAGCT CACCTTCATC CTCGCCCACG GCATCGGCCA GAGCTTCATC GCGCCGGGCA TCGATGCGGC GGAGGTGCGG GCCTTCCTGG AGACGGAGTT GCAGGGCTGA
|
Protein sequence | MTVPLPPAPP PGEPIEARLR RGLGARSIVL VGLMGAGKST VGRRLAGRLG LMFKDADHEI EAAAKLTIAD IFSIYGEASF REGEERVIAR LLREGPMVLA TGGGAFMREA TRARIAEGAI SVWLKADLDV LMRRVRKRNT RPLLQTEDPE ATMRTLMEVR HPVYAQADVT VLSREVSHDR VVEDVMEALD IHINPSHTTQ SQHLTFSMTQ QPSRVNVPLS GGREYDIRIG RGLIDAVGAE ARDLGARAAG IVTDETVAGL YGERVRASLE AAGLRCGIIA VPPGEASKSY AEFARVCDGL LAQKIERGDL VVALGGGVVG DLAGFAAASL RRGVRFLQVP TTLLAQVDSS VGGKTGINSP LGKNLIGAFH QPRLVLADTA TLDTLSEREM RAGYAEVAKY GLIGDAGFFE WCEANWAGIF SGGPERDEAV AACCRAKAGV VTRDEREDGE RALLNLGHTF GHALERLTGY DAARLVHGEG VAIGLALAFR FSARLGLCPG QDAGRVANHL ALAGLPTRLQ QVPGGAGDPD ALLDAMAQDK KVRDGQLTFI LAHGIGQSFI APGIDAAEVR AFLETELQG
|
| |