Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3224 |
Symbol | |
ID | 8448838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3554178 |
End bp | 3555566 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645042303 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_003202544 |
Protein GI | 258653388 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000000000420086 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000147183 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACTGGA CGATCGACAT CCCGGCGGAC ATCCTGCCGA GTCTGCCCCC GCTGCCGCCG GAGCTGCGGG CCCGGCTGGA CGACGCGCTG TCCCGGCCCG CCGCGCAGCA ACCGGAGTGG CCCGACCCCG AGCAGGTGCT CGCCGTCCGG GCCGTCCTGG AGGCGGTGCC CCCGGTGACG GTGCCCGGCG AGGTCGACAA GCTCGCCGAC CAGCTGGCCG CGGTGGCCCG CGGTGAGGCC TTCCTGCTGC AGGGCGGCGA CTGCGCCGAG ACCTACGTCG ACAACACCGA GCCGCACATC CGCGGCAACA TCCGCACCCT GCTGCAGATG GCCGTCGTGC TGACCTACGG CGCCTCACTG CCGGTGGTCA AGGTGGCCCG CATCGCCGGC CAGTACGCCA AGCCCCGTTC CTCCAACATC GACGCGCTGG GCCTGCCGTC CTACCGCGGC GACATCATCA ACTCGCTGTC CACCACCCCG GAAGCGCGGA TCCCCGATCC GTCCCGGATG GTGCGCGCCT ACGCGAACTC CTCCGCGGCG ATGAACCTGG TCCGCGCGGT CACCGCCACC GGCATGGGCG ACCTGGCCCG GGTCCACGAG TGGAACCAGG AATTCGTCCT GACCTCGCGC GCCGGCGAGC GGTACGAGCG GGTGGCCAAG GAGATCGACC GGGCCATGCG GTTCATGAGT GCGTGCGGCG TGACCTCGCA TTCACTGCAC CAGGTCGACA TCTTCTCCTC GCACGAGGCG CTGCTGCTGG ACTACGAGCG GGCCATGCTG CGGATGGACA CCAGCCACGA CGAGCCCCGG CTCTACGACC TGTCCGGGCA CTTCCTGTGG GTGGGCGAGC GGACCCGGCA GCTGGACGGC GCGCACATCG CGTTTGCCCA GCTGCTGTCC AACCCGATCG GGCTCAAGAT CGGCCCGAGC ACCACCCCGG AGATGGCCGT CGAGTACGTC GAGCGGCTCG ACCCGCGCAA TCAGGCCGGC CGGCTCACGC TGATCAGCCG GATGAGCAAC ACCAAGATCC GCGACGTGCT GCCGCCGATC ATCGAAAAGG TGGAGGCGTC CGGGCACCAG GTCATCTGGC AGTGCGACCC GATGCACGGC AACACCCACG AGTCGCCGAC CGGCTACAAG ACCCGCCACT TCGACCGCAT CGTGGACGAG GTGCAGGGCT TCTTCGAGGT GCACAACGAG CTGGGCACCC ACCCGGGTGG CATCCACGTG GAGCTGACCG GCGAGGACGT CACCGAGTGC CTGGGCGGGG CCCAGGAGAT CTCCGACGAC GACCTGGCCG GCCGCTACGA GACGGCGTGC GACCCGCGGC TGAACACCCA GCAGTCGCTG GAGCTGGCCT TCCTGGTCGC GGAGATGCTG CGCGGCTAG
|
Protein sequence | MNWTIDIPAD ILPSLPPLPP ELRARLDDAL SRPAAQQPEW PDPEQVLAVR AVLEAVPPVT VPGEVDKLAD QLAAVARGEA FLLQGGDCAE TYVDNTEPHI RGNIRTLLQM AVVLTYGASL PVVKVARIAG QYAKPRSSNI DALGLPSYRG DIINSLSTTP EARIPDPSRM VRAYANSSAA MNLVRAVTAT GMGDLARVHE WNQEFVLTSR AGERYERVAK EIDRAMRFMS ACGVTSHSLH QVDIFSSHEA LLLDYERAML RMDTSHDEPR LYDLSGHFLW VGERTRQLDG AHIAFAQLLS NPIGLKIGPS TTPEMAVEYV ERLDPRNQAG RLTLISRMSN TKIRDVLPPI IEKVEASGHQ VIWQCDPMHG NTHESPTGYK TRHFDRIVDE VQGFFEVHNE LGTHPGGIHV ELTGEDVTEC LGGAQEISDD DLAGRYETAC DPRLNTQQSL ELAFLVAEML RG
|
| |