Gene Mkms_2401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2401 
SymbolaroB 
ID4613224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2518703 
End bp2519785 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content70% 
IMG OID639792070 
Product3-dehydroquinate synthase 
Protein accessionYP_938389 
Protein GI119868437 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0377427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGC CGGTCACCGT CGACGTACTG GTCGACCCGC CCTACCCGGT GATCATCGGC 
ACCGGACTGC TCGGCGAACT CGGCCGGCTG CTCGAGGGTA GGCACAAGGT GGCCATCCTG
CATCAGCCGA CGCTCTCGGT GACCGCCGAA GCGGTGCGAA GCCACTTGGC CGACAAGGGA
ATCGATGCCC ACCGCATCGA GATCCCGGAC GCCGAAGCCG GTAAGGACCT GCCGGTGGTG
GGGTTCATCT GGGAGGTGCT CGGCCGGATC GGGGTGGGGC GCAAGGACGC GATCGTCAGC
CTCGGCGGGG GAGCGGCCAC CGACGTCGCC GGATTCGCCG CGGCGACCTG GTTGCGCGGT
GTCGACATCG TGCACGTCCC GACCACGCTG CTCGGGATGG TCGACGCGGC GGTCGGCGGT
AAGACCGGCA TCAACACCGA CGCGGGTAAG AACCTCGTCG GCGCCTTCCA TCAGCCCGCC
GCCGTGCTGA TCGACCTCGC GACCCTGGAG ACGTTGCCGC GCAACGAGAT CGTCGCCGGT
ATGGCCGAGG TCGTCAAAGC CGGGTTCATC GCCGATCCGC ACATCCTCGA CCTCATCGAG
GCCGATCCGG AAGCCGCCCT CGACCCGTCC AAAGATGTTC TGCCGGAACT GATTCGACGT
GCGGTCGCGG TCAAGGCGGA GGTGGTCGCG GCCGACGAGA AGGAATCCGC GCTGCGCGAG
ATCCTCAACT ACGGGCACAC GCTGGCCCAC GCGATCGAAC GCCGCGAGCG CTACCAGTGG
CGCCACGGCG CGGCGGTGTC GGTCGGCCTG GTGTTCGCCG CCGAACTCGG CCGCCTGGCG
GGCCGACTCG ACGACCAGAC GGCCGACCGG CACCGGTCGG TGCTGGAAGC GCTGGGGCTG
CCGGTGAGCT ATGACCCCGA CGCGCTGCCG AAACTCCTGG AGTACATGGC GGGCGACAAG
AAGACCCGCT CGGGTGTGCT GCGGTTCGTG GTGCTCGACG GGCTGGCCAA ACCCGGCCGG
CTCGAAGGCC CCGACCCGTC GCTGCTCGCC GCGGCCTACT CGGTGGTGGG AGGGACCCGA
TGA
 
Protein sequence
MSEPVTVDVL VDPPYPVIIG TGLLGELGRL LEGRHKVAIL HQPTLSVTAE AVRSHLADKG 
IDAHRIEIPD AEAGKDLPVV GFIWEVLGRI GVGRKDAIVS LGGGAATDVA GFAAATWLRG
VDIVHVPTTL LGMVDAAVGG KTGINTDAGK NLVGAFHQPA AVLIDLATLE TLPRNEIVAG
MAEVVKAGFI ADPHILDLIE ADPEAALDPS KDVLPELIRR AVAVKAEVVA ADEKESALRE
ILNYGHTLAH AIERRERYQW RHGAAVSVGL VFAAELGRLA GRLDDQTADR HRSVLEALGL
PVSYDPDALP KLLEYMAGDK KTRSGVLRFV VLDGLAKPGR LEGPDPSLLA AAYSVVGGTR