Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2318 |
Symbol | |
ID | 7090302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2510248 |
End bp | 2512095 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643465641 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_002362611 |
Protein GI | 217978464 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase [COG0703] Shikimate kinase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0070259 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCAGCT CCGAATCCGT GAAAGACAGG CCGGAGAGCG TGACGGACCG ACGGCTCGCC TCTCTCGTCC GATCGCTCGG CGATCGCTCG ATCGTCCTCG TCGGCTTCAT GGGCTGTGGC AAGACCTCGA CCGGCCGGCG TCTCGCGCAG CGGCTCGGCT TGCCTTTCAT AGACGCCGAC GCCGAGATCG AAGCGGCCGC CGGCATGACC ATCGCCGAGA TTTTTGCAAG GCACGGCGAG CCTTCGTTCC GTGACGGCGA GCGCCGCGTC ATGGCCCGGC TGCTCGAACA CGGCCCCCGC GTCATCGCAA CGGGAGGCGG AGCCTTTCTC AATGAGGAGA CACGCGCGCG GATCGCTCGT CGCGGTGTTT CGGTCTGGCT GAAGGCCGAA CCCGACGTGC TGTGGCGCCG CGTGCGCAAG CGATCGCACC GGCCCTTGTT GCAAAGCGCC GATCCCGAAA AAACGCTGCG CACGCTGTTG CGCGAGCGTT ATCCCTATTA TGCTCGGGCC GACGTCACCG TGATTTCGCG TGACGGCCCG CATGAGACAG CGGTTGAGGA AATCATCGCC GCGGTCGAGT TTTTCATGCG CTTTTCCCCC GAGCCGCCCA TCCTTGCCGT GTTGGACCCG ATGAATCACG CCGCAAAGCC GCCGCTGCCG CCGATCCCCT CCGCCGATGG CGAGCCTCTC TTCGGCCCCG CGGCCACGAC CGCGCCGGCG CCGGCGGCCC AGGACGCCGC CGCAGTGCGG GTCGAACTCG GCGAGCGGTC CTACGACATT CTGATTGGCG CGGGGCTTAT TGCGGCGGCG GGAGCTTGCG TGCGGCGGCT GGCGCCCGGC GCCGCCTGCG CCATCGTCAC GGACGCCAAT GTGGCCGCCC TGCATCTTGC CGAGCTGGAG CGCTCGCTGC GCGAGGCGGG CGTGCGCTAT AGCGCTGTGA TCATCCCGCC GGGCGAGCAG TCCAAATCCT ATGGCGTCTT CGCCAAGGTC TGCGACGATA TTCTTGCGGC GCGGCTCGAA CGGGGCGATC TCGTGATCGC GCTCGGCGGC GGCGTCATCG GCGATCTGGC GGGCTTTGCC GCCGCCTCGA TCCGGCGCGG CATGCGCTTC GTGCAGATCC CGACAAGTCT GCTCGCCCAG GTCGATTCCT CCGTCGGCGG CAAGACCGGG ATCAACTCCG AGCACGGCAA GAATCTGATC GGCGCCTTTC ATCAGCCCTC TCTGGTGCTG GCCGACGCCG ACGCGCTGGC CACCCTGCCG CTGCGCGAAT TCCGCGCCGG CTACGCCGAG ATCGTGAAAT ACGGGTTGAT CGGCGATGCG CCCTTCTTCG CCTGGCTCGA GACTCATTGG CGCGGCGTTT TCGCGGGCGG CCCGGATCGC GTCCACGCCA TCGCGACGAG CTGCCGCGCC AAGGCGGCGA TCGTCGGCCG CGACGAGCGG GAAAGCGGGG AGCGCGCGCT GTTGAATCTC GGCCACACCT TCGGCCATGC GCTGGAGCGC ATCAACCATT ACGACGGCGA GCGGCTGGTG CATGGCGAGG CTGTTTCGGT CGGGCTGGCG CTCGCCTTCC GCTTCTGCGC CAGAGTCGGG CTCTGCGAGG GCGCCGACGC GGACCGCGTT GAGGCGCATC TGCGCGAGGT TGGCCTGCCG ACGCGGATCG CCGACATCCC CGGCCTCGCG CTTGACGCCG ATCAGATGCT CGATGCCATG CGCCAGGACA AAAAGGTCGA GCGCGGCGCC CTGACCTTCG TTCTGGCGCG CGCGATCGGC GACTGTTTCG TCGCCAAATC GGTCGAGGCG GCCGAGGTGC GCGCTTTCCT CGAGGCCGAG CTCAACACAG GACTTTGA
|
Protein sequence | MTSSESVKDR PESVTDRRLA SLVRSLGDRS IVLVGFMGCG KTSTGRRLAQ RLGLPFIDAD AEIEAAAGMT IAEIFARHGE PSFRDGERRV MARLLEHGPR VIATGGGAFL NEETRARIAR RGVSVWLKAE PDVLWRRVRK RSHRPLLQSA DPEKTLRTLL RERYPYYARA DVTVISRDGP HETAVEEIIA AVEFFMRFSP EPPILAVLDP MNHAAKPPLP PIPSADGEPL FGPAATTAPA PAAQDAAAVR VELGERSYDI LIGAGLIAAA GACVRRLAPG AACAIVTDAN VAALHLAELE RSLREAGVRY SAVIIPPGEQ SKSYGVFAKV CDDILAARLE RGDLVIALGG GVIGDLAGFA AASIRRGMRF VQIPTSLLAQ VDSSVGGKTG INSEHGKNLI GAFHQPSLVL ADADALATLP LREFRAGYAE IVKYGLIGDA PFFAWLETHW RGVFAGGPDR VHAIATSCRA KAAIVGRDER ESGERALLNL GHTFGHALER INHYDGERLV HGEAVSVGLA LAFRFCARVG LCEGADADRV EAHLREVGLP TRIADIPGLA LDADQMLDAM RQDKKVERGA LTFVLARAIG DCFVAKSVEA AEVRAFLEAE LNTGL
|
| |