Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3540 |
Symbol | |
ID | 4647924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 3765724 |
End bp | 3767130 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639807017 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_954341 |
Protein GI | 120404512 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.161377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.74918 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACTGGA CCGTCGACGT ACCCATCGAC CAGCTGCCTG CGCTCCCGCC GCTGCCCGAG GATCTGCGGC AGCGCCTCGA CGCCGCCCTG GCCAGGCCGG CACTGCAGCA ACCGTCCTGG GACGCCGGAC AGGCCGCGGC GATGCGCAAG GTGCTCGAGA GCGTGCCGCC CGTCACGGTC CCGTCGGAGA TCGAACGGTT GAAGTCGCAG CTGGCCGACG TCGCTCGGGG TCAGGCGTTC CTGCTGCAGG GCGGCGACTG CGCCGAAACG TTCGTCGACA ACACCGAGCC GCACATCCGC GCGAACATCC GCACGCTGCT CCAGATGGCC GTGGTGCTGA CCTACGGTGC GAGCATGCCG GTGGTCAAGG TGGCCCGTAT CGCGGGGCAG TACGCCAAGC CCCGCTCCTC GGACACCGAC GCGCTGGGCC TGAAGTCCTA CCGCGGCGAC ATGGTCAACG GCTTCGCCCC GGACGCCGCG GTCCGCGAGC ATGACCCGTC GCGTCTGGTG CGCGCGTACG CCAACGCCAG CGCCGCGATG AACCTGGTGC GGGCGCTGAC GTCGTCGGGG ATGGCCTCAC TGCACCAGGT GCACGACTGG AACAGGGAGT TCGTCCGCAC CTCGCCCGCC GGTGCCCGGT ACGAGGCGCT GGCCGGGGAG ATCGACCGCG GCCTGCGCTT CATGAGCGCG TGCCGGGTCG ACGATCGCAA CCTCGACACC GCGGAGATCT ACGCGAGCCA CGAGGCCCTG GTGCTGGACT ACGAGCGGGC GATGCTGCGC ATGAACATCG CTGACTCCGC CGATCCGGCC TCGGACGGTC CGCCGAAGCT CTACGACCTG TCGGCCCATT ACGTGTGGAT CGGTGAGCGC ACCCGCCAGC TCGACGGCGC CCACGTGGCC TTCGCCGAGG TCATCGCCAA CCCGGTCGGT ATCAAGATCG GCCCGACCAC GTCGCCCGAG CTTGCCGTCG AATACGTCGA GCGGCTCGAC CCGAACAACG AGCCGGGCCG GCTCACCCTG GTCAGCCGGA TGGGCAACCA CAAGGTGCGC GACGTGCTGC CGCCGATCAT CGAGAAGGTG CAGGCCTCGG GGCATCAGGT CGTCTGGCAG TGCGACCCGA TGCACGGCAA CACCCACGAG TCCTCGACCG GCTACAAGAC GCGCCACTTC GACCGCATCG TCGACGAGGT GCAGGGCTTC TTCGAGGTGC ACCACGCGCT CGGCACGCAC CCCGGCGGCA TCCACGTCGA GATCACCGGT GAGAACGTCA CCGAGTGTCT CGGTGGCGCA CAAGACATTT CGGACACCGA CCTGGCCGGC CGCTACGAGA CCGCCTGCGA CCCGCGGCTG AACACGCAGC AGTCGCTGGA GCTGGCGTTC TTGGTCGCGG AGATGCTCCG CGGTTAG
|
Protein sequence | MNWTVDVPID QLPALPPLPE DLRQRLDAAL ARPALQQPSW DAGQAAAMRK VLESVPPVTV PSEIERLKSQ LADVARGQAF LLQGGDCAET FVDNTEPHIR ANIRTLLQMA VVLTYGASMP VVKVARIAGQ YAKPRSSDTD ALGLKSYRGD MVNGFAPDAA VREHDPSRLV RAYANASAAM NLVRALTSSG MASLHQVHDW NREFVRTSPA GARYEALAGE IDRGLRFMSA CRVDDRNLDT AEIYASHEAL VLDYERAMLR MNIADSADPA SDGPPKLYDL SAHYVWIGER TRQLDGAHVA FAEVIANPVG IKIGPTTSPE LAVEYVERLD PNNEPGRLTL VSRMGNHKVR DVLPPIIEKV QASGHQVVWQ CDPMHGNTHE SSTGYKTRHF DRIVDEVQGF FEVHHALGTH PGGIHVEITG ENVTECLGGA QDISDTDLAG RYETACDPRL NTQQSLELAF LVAEMLRG
|
| |