Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_1501 |
Symbol | |
ID | 4645394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 1588950 |
End bp | 1590005 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639804999 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_952339 |
Protein GI | 120402510 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.66157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.410286 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTGG CGCAGACCGC CACCCCGCCC GCGACGTCGG ACCGCCGGAT CCGCAGTTTC GGCGAGATTC CCAGCCCACA CGCGGTGTCG ACCGAATTCC CGTTGGGTGC CCGCCGTGCC GAGCGGGTGG CCCGCGACCG CGACGAGATC GCCGACATCC TCGCGGGGCG GGACGACCGT CTGCTGGTGG TCGTCGGGCC GTGCTCGGTG CATGACCCTG CCGCCGCGCT GGAGTACGCC GGCCGGCTGG TCAAGATCGC CGCCGAGCTC AAGGACAGCC TCAAGATCGT GATGCGGGTG TACTTCGAGA AGCCGCGCAC CACGATCGGT TGGAAAGGCC TGATCAACGA TCCGGGGATG GACGGCACCT TCGACGTCGC GCGGGGCCTG CGCATCGCCC GGCAACTGCT GCTGGACATC ATCGACATCG GGCTCCCGGT GGGGTGTGAA TTCCTCGAGC CGACCAGCCC GCAGTACATC GCCGACGCCG TGGCGTGGGG TGCGATCGGC GCCCGCACCA CCGAATCGCA GGTGCACCGT CAACTTGCTT CGGGCCTGTC GATGCCGGTC GGCTTCAAAA ACGGAACCGA CGGCAACATT CAGGTCGCCG TCGACGGCGC GAAATCCGCT GCCGCCCAAC ATGTGTTCTT CGGCATGGAC GACATGGGCC GCGGCGCCGT GGTGAGCACC GAAGGTAACA GGGACTGCCA TGTCATCCTG CGGGGAGGTA CCGGCGGACC GAACTGGGAC GCCGAGTCGG TGCGCTCGGC GGCCGACAAG CTCGAGAGCG CGGGACTGCC CGGCCGGGTG GTGATCGACT GCAGCCACGC GAATTCCGGT AAGGACCACG TGCGGCAGGC GAGCGTAGCC GCGGAGGTGG CGCAGCTGGT GCGGGACGGC CTTCCGGTCA GCGGAGTCAT GCTGGAGAGC TTCCTGGTCG CCGGGGCACA GGCTCCCGAG GCGCGTCCGC TGACCTACGG CCAGTCGGTG ACCGACAAGT GCATGGATTG GGGTGCAACG GATCTGGTGT TGCGAGAGCT GGCCCGGCGC GGTTAG
|
Protein sequence | MTLAQTATPP ATSDRRIRSF GEIPSPHAVS TEFPLGARRA ERVARDRDEI ADILAGRDDR LLVVVGPCSV HDPAAALEYA GRLVKIAAEL KDSLKIVMRV YFEKPRTTIG WKGLINDPGM DGTFDVARGL RIARQLLLDI IDIGLPVGCE FLEPTSPQYI ADAVAWGAIG ARTTESQVHR QLASGLSMPV GFKNGTDGNI QVAVDGAKSA AAQHVFFGMD DMGRGAVVST EGNRDCHVIL RGGTGGPNWD AESVRSAADK LESAGLPGRV VIDCSHANSG KDHVRQASVA AEVAQLVRDG LPVSGVMLES FLVAGAQAPE ARPLTYGQSV TDKCMDWGAT DLVLRELARR G
|
| |