Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1641 |
Symbol | |
ID | 5832582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1830117 |
End bp | 1831931 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641367439 |
Product | shikimate kinase., 3-dehydroquinate synthase |
Protein accession | YP_001639111 |
Protein GI | 163851068 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase [COG0703] Shikimate kinase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.124892 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGCTTT CGACTCGGCG CCGACGGGGG TACCGGACGC TTCGCCGCTT CCGCAAAACC GTCTATGACC GCCCGATGAC CGTCCCCCTG CCGCCGGCGC CGCCGCCCGG CGAACCGATC GAAGCACGAC TGCGCCGCGG CCTCGGAGCC CGGTCGATCG TGCTCGTCGG CCTGATGGGG GCGGGCAAGA GCACCGTCGG GCGCCGCCTG GCGGGGCGTC TCGGGCTGAT GTTCAAGGAT GCCGACCACG AGATCGAGGC GGCGGCCAAG CTCACCATCG CCGACATTTT CTCGATCTAC GGCGAGGCGA GCTTCCGCGA GGGCGAGGAA CGGGTGATCG CGCGTCTGCT GCGCGAGGGG CCGATGGTGC TGGCCACCGG AGGCGGCGCC TTCATGCGCG AGGCGACGCG GGCGCGGATC GCCGAGGGGG CCATCTCGGT CTGGCTCAAG GCCGACCTCG ACGTGCTGAT GCGACGCGTG CGCAAACGCA ACACGCGCCC TCTGCTCCAG ACCGAGGATC CGGAGGCGAC CATGCGCACA CTGATGGAGG TGCGCCATCC GGTCTATGCC CAAGCCGATG TCACGGTGCT GTCCCGCGAA GTGTCCCACG ACCGCGTGGT GGAGGACGTG ATGGAAGCTC TCGATATCCA CATCAACCCG TCTCATACGA CACAATCACA ACATTTGACA TTCAGTATGA CGCAGCAACC CTCGCGTGTG AACGTTCCCC TGTCGGGCGG ACGCGAATAC GATATTCGGA TCGGTCGGGG TCTTATCGAC GCGGTGGGTG CGGAGGCGCG GGATCTCGGC GCCCGGGCCG CCGGCATCGT CACCGACGAG ACGGTCGCCG GCCTCTACGG CGAGCGTGTG CGGGCCAGCC TCGAGGCCGC CGGGTTGCGC TGCGGCATCA TCGCCGTGCC GCCGGGTGAG GCCTCGAAGA GCTACGCGGA ATTCGCCCGC GTCTGCGACG GCCTGCTCGC CCAGAAGATC GAGCGCGGCG ACCTCGTCGT GGCGCTCGGC GGCGGCGTGG TCGGCGATCT CGCGGGCTTC GCGGCGGCCT CCCTGCGGCG CGGTGTCCGC TTTCTCCAGG TTCCGACCAC CCTGCTCGCG CAGGTTGATT CCTCGGTGGG GGGAAAGACC GGGATCAATT CGCCGCTCGG CAAGAATCTG ATCGGCGCCT TCCACCAGCC CCGCCTCGTA CTGGCCGACA CCGCCACCCT CGACACGCTC TCGGAGCGCG AGATGCGGGC GGGTTACGCC GAGGTCGCCA AGTATGGCTT GATCGGGGAT GCCGGCTTCT TCGAGTGGTG CGAGGCGAAC TGGGCCGGCA TCTTCTCCGG TGGGCCGGAG CGCGACGAGG CCGTGGCCGC CTGCTGCCGC GCCAAGGCCG GCGTCGTGAC CCGCGACGAG CGGGAGGACG GCGAGCGCGC CCTGCTCAAT CTCGGCCATA CCTTCGGCCA CGCCCTGGAG CGGCTGACCG GCTACGACGC GGCCCGCCTC GTCCACGGCG AGGGCGTCGC GATCGGTCTG GCGCTGGCCT TCCGCTTCTC GGCCCGGCTC GGCCTCTGCC CCGGCCAGGA TGCGGGACGC GTGGCCAACC ACCTCGCGCT CGCCGGCCTG CCGACCCGCC TGCAACAGGT GCCCGGCGGG GCGGGTGACC CGGACGCCCT CCTCGACGCC ATGGCCCAGG ACAAGAAGGT CCGCGACGGG CAGCTCACCT TCATCCTCGC CCACGGCATC GGCCAGAGCT TCATCGCGCC GGGCATCGAT GCGGCGGAGG TGCGGGCCTT CCTGGAGGCG GAACTGCGGG GCTGA
|
Protein sequence | MGLSTRRRRG YRTLRRFRKT VYDRPMTVPL PPAPPPGEPI EARLRRGLGA RSIVLVGLMG AGKSTVGRRL AGRLGLMFKD ADHEIEAAAK LTIADIFSIY GEASFREGEE RVIARLLREG PMVLATGGGA FMREATRARI AEGAISVWLK ADLDVLMRRV RKRNTRPLLQ TEDPEATMRT LMEVRHPVYA QADVTVLSRE VSHDRVVEDV MEALDIHINP SHTTQSQHLT FSMTQQPSRV NVPLSGGREY DIRIGRGLID AVGAEARDLG ARAAGIVTDE TVAGLYGERV RASLEAAGLR CGIIAVPPGE ASKSYAEFAR VCDGLLAQKI ERGDLVVALG GGVVGDLAGF AAASLRRGVR FLQVPTTLLA QVDSSVGGKT GINSPLGKNL IGAFHQPRLV LADTATLDTL SEREMRAGYA EVAKYGLIGD AGFFEWCEAN WAGIFSGGPE RDEAVAACCR AKAGVVTRDE REDGERALLN LGHTFGHALE RLTGYDAARL VHGEGVAIGL ALAFRFSARL GLCPGQDAGR VANHLALAGL PTRLQQVPGG AGDPDALLDA MAQDKKVRDG QLTFILAHGI GQSFIAPGID AAEVRAFLEA ELRG
|
| |