Gene Mkms_3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3335 
Symbol 
ID4611261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3498266 
End bp3499663 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content69% 
IMG OID639793008 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_939319 
Protein GI119869367 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.6171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.158305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTGGA CCGTCGACAT CCCCATCGAC CAGCTCCCCG ACCTGCCGCC CCTGCCGGAC 
GAGCTGCGTC ACCGCCTGGA TTCCGCGCTG GCCAAACCGG CGGTCCAGCA GCCCAGCTGG
GACGCCGACG CCGCCAAGGC CATGCGCACG GTCCTCGAGA GCGTTCCGCC GGTCACGGTG
CCCTCGGAGA TCGAGAAGCT CAAGGGTCTG CTCGCCGACG TCGCGCGCGG TGAGGCGTTC
CTGCTGCAGG GCGGGGACTG CGCCGAGACG TTCGTCGACA ACACCGAACC GCACATCCGC
GCCAACATCC GCACCCTGCT GCAGATGGCC GTCGTCCTCA CCTACGGCGC GAGCATGCCG
GTGGTCAAGG TGGCGCGCAT CGCCGGGCAG TACGCCAAAC CGCGCTCGTC GGACATCGAC
GCGCTGGGGC TGAAGTCCTA CCGCGGCGAC ATGATCAACG GGTTCGCCCC GGACGCGGCG
GCCCGCCAGC ACGATCCGTC GCGTCTCGTG CGCGCCTACG CCAACGCCAG CGCCGCGATG
AACCTGGTGC GCGCGCTCAC CTCGTCGGGG ATGGCGGCGC TGCAGGGTGT GCACGACTGG
AACCGCGAAT TCGTGCGCAC GTCGCCGGCC GGCGCCCGTT ACGAGGCGCT CGCCGGGGAG
ATCGACCGGG CGCTGACGTT CATGAGCGCC TGCGGCGTCG ACGACCGCAA CCTGCAGACC
GCCGAGATCT TCGCCAGCCA CGAGGCGCTG GTGCTCGACT ACGAACGAGC GATGCTGCGG
CTCTCGACGG AGTTCCCGGC CGACGATCCG GAGCCGCGGC TCTACGACCT GTCGGCGCAC
TACGTGTGGA TCGGTGAGCG CACCCGCCAG CTCGACGGCG CGCACATCGC GTTCGTGGAA
ACGATTGCCA ACCCGATCGG CATCAAGCTC GGGCCGACCA CCACACCGGA ACTGGCCGTC
GAGTACGTCG AGCGGCTCGA TCCGCACAAC CAGCCGGGCC GGCTGACGCT GGTGACCCGG
ATGGGGAACA GCAAGGTGCG CGACCTGCTG CCGCCGATCA TCGAGAAGGT GCAGGCCAGC
GGGCATCAGG TCATCTGGCA GTGCGATCCG ATGCACGGCA ACACCCACGA GTCCTCGACC
GGTTACAAGA CCCGCCACTT CGACCGCATC GTCGACGAGG TGCAGGGCTT CTTCGAGGTG
CACCGCGCGC TGGGCACCCA TCCGGGCGGC ATCCACGTCG AGATCACCGG TGAGAACGTC
ACCGAATGCC TCGGCGGCGC GCAGGACATC TCCGACACCG ACCTGGCCGG GCGTTACGAG
ACCGCATGCG ATCCGCGGAT GAACACCCAG CAGAGCCTCG AGTTGGCGTT CCTGGTCGCG
GAGATGCTGC GGGACTAG
 
Protein sequence
MNWTVDIPID QLPDLPPLPD ELRHRLDSAL AKPAVQQPSW DADAAKAMRT VLESVPPVTV 
PSEIEKLKGL LADVARGEAF LLQGGDCAET FVDNTEPHIR ANIRTLLQMA VVLTYGASMP
VVKVARIAGQ YAKPRSSDID ALGLKSYRGD MINGFAPDAA ARQHDPSRLV RAYANASAAM
NLVRALTSSG MAALQGVHDW NREFVRTSPA GARYEALAGE IDRALTFMSA CGVDDRNLQT
AEIFASHEAL VLDYERAMLR LSTEFPADDP EPRLYDLSAH YVWIGERTRQ LDGAHIAFVE
TIANPIGIKL GPTTTPELAV EYVERLDPHN QPGRLTLVTR MGNSKVRDLL PPIIEKVQAS
GHQVIWQCDP MHGNTHESST GYKTRHFDRI VDEVQGFFEV HRALGTHPGG IHVEITGENV
TECLGGAQDI SDTDLAGRYE TACDPRMNTQ QSLELAFLVA EMLRD