Gene Mvan_3540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3540 
Symbol 
ID4647924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3765724 
End bp3767130 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content69% 
IMG OID639807017 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_954341 
Protein GI120404512 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.161377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.74918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTGGA CCGTCGACGT ACCCATCGAC CAGCTGCCTG CGCTCCCGCC GCTGCCCGAG 
GATCTGCGGC AGCGCCTCGA CGCCGCCCTG GCCAGGCCGG CACTGCAGCA ACCGTCCTGG
GACGCCGGAC AGGCCGCGGC GATGCGCAAG GTGCTCGAGA GCGTGCCGCC CGTCACGGTC
CCGTCGGAGA TCGAACGGTT GAAGTCGCAG CTGGCCGACG TCGCTCGGGG TCAGGCGTTC
CTGCTGCAGG GCGGCGACTG CGCCGAAACG TTCGTCGACA ACACCGAGCC GCACATCCGC
GCGAACATCC GCACGCTGCT CCAGATGGCC GTGGTGCTGA CCTACGGTGC GAGCATGCCG
GTGGTCAAGG TGGCCCGTAT CGCGGGGCAG TACGCCAAGC CCCGCTCCTC GGACACCGAC
GCGCTGGGCC TGAAGTCCTA CCGCGGCGAC ATGGTCAACG GCTTCGCCCC GGACGCCGCG
GTCCGCGAGC ATGACCCGTC GCGTCTGGTG CGCGCGTACG CCAACGCCAG CGCCGCGATG
AACCTGGTGC GGGCGCTGAC GTCGTCGGGG ATGGCCTCAC TGCACCAGGT GCACGACTGG
AACAGGGAGT TCGTCCGCAC CTCGCCCGCC GGTGCCCGGT ACGAGGCGCT GGCCGGGGAG
ATCGACCGCG GCCTGCGCTT CATGAGCGCG TGCCGGGTCG ACGATCGCAA CCTCGACACC
GCGGAGATCT ACGCGAGCCA CGAGGCCCTG GTGCTGGACT ACGAGCGGGC GATGCTGCGC
ATGAACATCG CTGACTCCGC CGATCCGGCC TCGGACGGTC CGCCGAAGCT CTACGACCTG
TCGGCCCATT ACGTGTGGAT CGGTGAGCGC ACCCGCCAGC TCGACGGCGC CCACGTGGCC
TTCGCCGAGG TCATCGCCAA CCCGGTCGGT ATCAAGATCG GCCCGACCAC GTCGCCCGAG
CTTGCCGTCG AATACGTCGA GCGGCTCGAC CCGAACAACG AGCCGGGCCG GCTCACCCTG
GTCAGCCGGA TGGGCAACCA CAAGGTGCGC GACGTGCTGC CGCCGATCAT CGAGAAGGTG
CAGGCCTCGG GGCATCAGGT CGTCTGGCAG TGCGACCCGA TGCACGGCAA CACCCACGAG
TCCTCGACCG GCTACAAGAC GCGCCACTTC GACCGCATCG TCGACGAGGT GCAGGGCTTC
TTCGAGGTGC ACCACGCGCT CGGCACGCAC CCCGGCGGCA TCCACGTCGA GATCACCGGT
GAGAACGTCA CCGAGTGTCT CGGTGGCGCA CAAGACATTT CGGACACCGA CCTGGCCGGC
CGCTACGAGA CCGCCTGCGA CCCGCGGCTG AACACGCAGC AGTCGCTGGA GCTGGCGTTC
TTGGTCGCGG AGATGCTCCG CGGTTAG
 
Protein sequence
MNWTVDVPID QLPALPPLPE DLRQRLDAAL ARPALQQPSW DAGQAAAMRK VLESVPPVTV 
PSEIERLKSQ LADVARGQAF LLQGGDCAET FVDNTEPHIR ANIRTLLQMA VVLTYGASMP
VVKVARIAGQ YAKPRSSDTD ALGLKSYRGD MVNGFAPDAA VREHDPSRLV RAYANASAAM
NLVRALTSSG MASLHQVHDW NREFVRTSPA GARYEALAGE IDRGLRFMSA CRVDDRNLDT
AEIYASHEAL VLDYERAMLR MNIADSADPA SDGPPKLYDL SAHYVWIGER TRQLDGAHVA
FAEVIANPVG IKIGPTTSPE LAVEYVERLD PNNEPGRLTL VSRMGNHKVR DVLPPIIEKV
QASGHQVVWQ CDPMHGNTHE SSTGYKTRHF DRIVDEVQGF FEVHHALGTH PGGIHVEITG
ENVTECLGGA QDISDTDLAG RYETACDPRL NTQQSLELAF LVAEMLRG