Gene Mchl_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1923 
Symbol 
ID7116738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1984870 
End bp1986609 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content70% 
IMG OID643524687 
ProductShikimate kinase., 3-dehydroquinate synthase 
Protein accessionYP_002420714 
Protein GI218529898 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase
[COG0703] Shikimate kinase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0886542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC CCCTGCCGCC GGCGCCGCCG CCCGGCGAAC CGATCGAAGC ACGACTGCGC 
CGCGGCCTCG GAGCCCGGTC GATCGTGCTC GTCGGCCTGA TGGGGGCGGG CAAGAGCACC
GTCGGGCGCC GCCTGGCGGG GCGTCTCGGG CTGATGTTCA AGGATGCCGA TCACGAGATC
GAGGCGGCGG CCAAGCTCAC CATCGCCGAC ATTTTCTCGA TCTACGGCGA GGCGAGCTTC
CGCGAGGGCG AGGAACGGGT GATCGCGCGT CTGCTGCGCG AGGGGCCGAT GGTGCTGGCC
ACCGGAGGCG GCGCCTTCAT GCGCGAGGCG ACGCGGGCGC GGATCGCCGA GGGGGCCATC
TCGGTCTGGC TCAAGGCCGA CCTCGACGTG CTGATGCGAC GCGTGCGCAA ACGCAACACG
CGCCCTCTGC TCCAGACCGA GGATCCGGAG GCGACCATGC GCACACTGAT GGAGGTGCGC
CATCCGGTCT ATGCCCAAGC CGATGTCACG GTGCTGTCCC GCGAAGTGTC CCACGACCGC
GTGGTGGAGG ACGTGATGGA AGCTCTCGAT ATCCACATCA ACCCGTCTCA TACGACACAA
TCACAACATT TGACATTCAG TATGACGCAG CAACCCTCGC GTGTGAACGT TCCCCTGTCG
GGTGGACGCG AATACGATAT TCGGATCGGT CGGGGTCTTA TCGACGCGGT GGGTGCGGAG
GCGCGGGATC TCGGCGCCCG GGCTGCCGGT ATCGTCACCG ACGAGACGGT CGCCGGCCTC
TACGGCGAGC GTGTGCGGGC CAGCCTCGAG GCCGCCGGGT TGCGCTGCGG CATCATCGCC
GTGCCGCCGG GCGAAGCTTC GAAGAGCTAC GCGGAATTCG CCCGCGTCTG CGACGGGCTC
CTGGCCCAGA AGATCGAGCG CGGCGACCTC GTCGTGGCGC TCGGCGGCGG CGTGGTCGGC
GATCTCGCGG GTTTTGCGGC GGCCTCCCTG CGGCGCGGTG TCCGCTTCCT CCAGGTTCCA
ACGACCTTGC TCGCGCAGGT TGATTCCTCG GTGGGGGGAA AGACCGGGAT CAACTCGCCG
CTCGGCAAGA ATCTGATCGG CGCCTTCCAC CAGCCCCGCC TCGTACTGGC CGACACCGCC
ACCCTCGACA CGCTCTCGGA GCGCGAGATG CGGGCGGGTT ACGCCGAGGT CGCCAAGTAT
GGCTTGATCG GGGATGCCGG CTTCTTCGAG TGGTGCGAGG CGAACTGGGC CGGCATCTTC
TCCGGCGGGC CGGAGCGCGA CGAGGCGGTG GCCGCCTGCT GCCGCGCCAA GGCCGGCGTC
GTGACCCGCG ACGAGCGGGA GGACGGCGAA CGCGCCCTGC TCAATCTCGG CCACACCTTC
GGCCACGCCC TGGAGCGGCT GACCGGCTAC GACGCGGCCC GCCTCGTCCA CGGCGAGGGC
GTCGCGATCG GTCTGGCGCT GGCCTTCCGC TTCTCGGCCC GGCTCGGCCT CTGCCCCGGC
CAGGATGCGG GGCGGGTGGC CAACCACCTC GCGCTCGCCG GCCTTCCGAC CCGCCTGCAA
CAGGTGCCCG GCGGGGCCGG CGACCCGGAT GCCCTCCTCG ACGCCATGGC CCAGGACAAG
AAGGTCCGCG ACGGGCAGCT CACCTTCATC CTCGCCCACG GCATCGGCCA GAGCTTCATC
GCGCCGGGCA TCGATGCGGC GGAGGTGCGG GCCTTCCTGG AGACGGAGTT GCAGGGCTGA
 
Protein sequence
MTVPLPPAPP PGEPIEARLR RGLGARSIVL VGLMGAGKST VGRRLAGRLG LMFKDADHEI 
EAAAKLTIAD IFSIYGEASF REGEERVIAR LLREGPMVLA TGGGAFMREA TRARIAEGAI
SVWLKADLDV LMRRVRKRNT RPLLQTEDPE ATMRTLMEVR HPVYAQADVT VLSREVSHDR
VVEDVMEALD IHINPSHTTQ SQHLTFSMTQ QPSRVNVPLS GGREYDIRIG RGLIDAVGAE
ARDLGARAAG IVTDETVAGL YGERVRASLE AAGLRCGIIA VPPGEASKSY AEFARVCDGL
LAQKIERGDL VVALGGGVVG DLAGFAAASL RRGVRFLQVP TTLLAQVDSS VGGKTGINSP
LGKNLIGAFH QPRLVLADTA TLDTLSEREM RAGYAEVAKY GLIGDAGFFE WCEANWAGIF
SGGPERDEAV AACCRAKAGV VTRDEREDGE RALLNLGHTF GHALERLTGY DAARLVHGEG
VAIGLALAFR FSARLGLCPG QDAGRVANHL ALAGLPTRLQ QVPGGAGDPD ALLDAMAQDK
KVRDGQLTFI LAHGIGQSFI APGIDAAEVR AFLETELQG