Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1076 |
Symbol | |
ID | 3103278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1131896 |
End bp | 1132987 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637170265 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_113551 |
Protein GI | 53804599 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.901696 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAGCG TGTACAACAC CGACGATCTT CGCATCTGCG AGATCAAGGA AGTCATTCCG CCCGTCCAGG TTCATGAGGA ATTCCCGATC ACGGACCGGG CCGCACTCAC GACACTGACC GCCCGCCGAG GGATTCACGC AATCCTTTCC AAGGAGGACG ACCGCCTGCT GGTGGTGATC GGGCCCTGTT CGATCCATGA CCCCAAGGCC GCGCTCGAAT ACGGGGAGCG GCTGCTGCCA CTCCGCCAGA AACTGGCGAG ACATCTGGAA ATCGTGATGC GGGTCTATTT CGAGAAGCCG CGAACGACCG TCGGCTGGAA GGGCCTGATC AATGATCCCG ATCTGGACGA GAGTTTCAAC ATCAACAAAG GCTTGCGCCT CGCCCGCAAG CTGTTGCTCG ATCTGAACGA ACTGGGCATG CCCGCGGCCA CCGAGTACCT CGATCTCATC ACCCCGCAGT ATGTCTCCGA CCTGATCGCT TGGGGCGCCA TCGGTGCTCG TACCACGGAG AGCCAGTCTC ACCGTGAACT GGCATCGGGG CTGTCATGTC CGGTTGGATT CAAGAACGCC ACCGACGGCA CGATCAAGGT TGCTGTCGAC GCCATAGGTG CGGCACGGCG GCCACATCAT TTCCTGTCTT TGACCAAGGC CGGTCATTCG GCGATCTTCT CCACGACCGG TAACGCCGAC TGTCACATCA TCCTTCGTGG CGGAGCCCGG CCGAATTACG ACGCGGCCAG CGTCGAAGCG GCGGCCAGGG CGCTGGAAGC CGTCGGCCTG CCGCCCAACA TCATGGTGGA CTGCAGCCAT GCCAACAGCA TGAAGGATTA CCTGAAGCAG CTGCGGGTGG CCGAGGACGT GGCCGAACAG ATAGACGGCG GCGACAGGCG GATCATCGGC TTGATGGTGG AAAGTCACCT CAAGCCGGGC AATCAGAAAC TCCACAAGGG CATGGTTCCC GAATACGGCG TCAGCATCAC CGATGCCTGC ATCGGCTGGG ATGACAGCGT GGCCGTGCTG GAACGGCTCG CCGCCGCGGT GGAGAGCCGG CGCGGCCGGT CGGCAGGCAT CCGGAACGTG CGGGGGGCCT GA
|
Protein sequence | MPSVYNTDDL RICEIKEVIP PVQVHEEFPI TDRAALTTLT ARRGIHAILS KEDDRLLVVI GPCSIHDPKA ALEYGERLLP LRQKLARHLE IVMRVYFEKP RTTVGWKGLI NDPDLDESFN INKGLRLARK LLLDLNELGM PAATEYLDLI TPQYVSDLIA WGAIGARTTE SQSHRELASG LSCPVGFKNA TDGTIKVAVD AIGAARRPHH FLSLTKAGHS AIFSTTGNAD CHIILRGGAR PNYDAASVEA AARALEAVGL PPNIMVDCSH ANSMKDYLKQ LRVAEDVAEQ IDGGDRRIIG LMVESHLKPG NQKLHKGMVP EYGVSITDAC IGWDDSVAVL ERLAAAVESR RGRSAGIRNV RGA
|
| |