Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2194 |
Symbol | |
ID | 3786219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2492832 |
End bp | 2493848 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812281 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_412878 |
Protein GI | 82703312 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCATAG TCATGAATAA CGGCGCCACC GAAGAGCAGA TCGAAACGGT GGTTGCAAAA ATTCGAAGCT TCGACCTGGA CGCGAACGTT TCGCGTGGCA CCGAGCGTAC CGTAATCGGC GCAATTGGCA ATGAGCGCAA GCTCAGTCCC GAAATGTTCG ATACCCTGAG CGGTGTCGAA TATTCCATGC ATATCGTCAA GCAGTACAAG ATCGTGTCGC GCGAATGGCA CAGGGATAAT TCGATAATCA ATGTGGGAGA TGTGGCGATT GGCGGGGACC AGGTGCAGGT GATCGGAGGT CCCTGTTCAG TGGAGACGCA AGAGCAAATG GATTCGGCGG CCCAACATGT CTCGGATGCC GGATGCCGGC TGATGCGGGG CGGCGCCTTC AAGCCCCGTA CCAGTCCCTA TACCTTCCAG GGCAATGGCG AAGAAGGATT GAAAATGTTC CGCAAGGCTG CGGATAAGCA CAACCTCCGG ATCGTCACGG AATTGATGGA TGCGCGCATG CTGGACACTT TCCTCGAGTA TGACGTGGAT GTGATCCAGA TCGGTACACG CAGCATGCAG AACTTCGAAC TTTTGAAAGA AGTGGGGCGC ATCAACAAGC CCGTGATACT GAAGCGCGGG ATGTCCGCGA CCGTTTCCGA GTGGCTCATG GCCGCGGAAT ACATCGCCGC GGGCGGCAAC CATAACATCA TTTTCTGCGA ACGCGGCATA CGCACCTTTG AAACCGCTTA TCGCAACGTA ATGGATGTAA CCTGCATCCC CGTGCTAAAA AAAGAAACGC ACTTGCCGGT AATCGTCGAT CCCTCCCATG CGGGGGGAAA GGCATGGATG GTGCCCGCCC TGGCGCGCGC GGCAGTTGCA GCGGGAGCGG ATGGCCTGCT GGTAGAGACG CATCCCAATC CATGCGAAGC CTGGTGCGAC GCAGACCAGG CGTTGAATCC CGAGGAATTC CGCGATCTGA TGGGATCGCT GCAAGGAATA GCGGCAGTAA TCGGACGGAG TCTGTGA
|
Protein sequence | MIIVMNNGAT EEQIETVVAK IRSFDLDANV SRGTERTVIG AIGNERKLSP EMFDTLSGVE YSMHIVKQYK IVSREWHRDN SIINVGDVAI GGDQVQVIGG PCSVETQEQM DSAAQHVSDA GCRLMRGGAF KPRTSPYTFQ GNGEEGLKMF RKAADKHNLR IVTELMDARM LDTFLEYDVD VIQIGTRSMQ NFELLKEVGR INKPVILKRG MSATVSEWLM AAEYIAAGGN HNIIFCERGI RTFETAYRNV MDVTCIPVLK KETHLPVIVD PSHAGGKAWM VPALARAAVA AGADGLLVET HPNPCEAWCD ADQALNPEEF RDLMGSLQGI AAVIGRSL
|
| |