Gene Mmcs_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2354 
SymbolaroB 
ID4111187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2499719 
End bp2500801 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content70% 
IMG OID638031479 
Product3-dehydroquinate synthase 
Protein accessionYP_639518 
Protein GI108799321 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGAGC CGGTCACCGT CGACGTACTG GTCGACCCGC CCTACCCGGT GATCATCGGC 
ACCGGACTGC TCGGCGAACT CGGCCGGCTG CTCGAGGGTA GGCACAAGGT GGCCATCCTG
CATCAGCCGA CGCTCTCGGT GACCGCCGAA GCGGTGCGAA GCCACTTGGC CGACAAGGGA
ATCGATGCCC ACCGCATCGA GATCCCGGAC GCCGAAGCCG GTAAGGACCT GCCGGTGGTG
GGGTTCATCT GGGAGGTGCT CGGCCGGATC GGGGTGGGGC GCAAGGACGC GATCGTCAGC
CTCGGCGGGG GAGCGGCCAC CGACGTCGCC GGATTCGCCG CGGCGACCTG GTTGCGCGGT
GTCGACATCG TGCACGTCCC GACCACGCTG CTCGGGATGG TCGACGCGGC GGTCGGCGGT
AAGACCGGCA TCAACACCGA CGCGGGTAAG AACCTCGTCG GCGCCTTCCA TCAGCCCGCC
GCCGTGCTGA TCGACCTCGC GACCCTGGAG ACGTTGCCGC GCAACGAGAT CGTCGCCGGT
ATGGCCGAGG TCGTCAAAGC CGGGTTCATC GCCGATCCGC ACATCCTCGA CCTCATCGAG
GCCGATCCGG AAGCCGCCCT CGACCCGTCC AAAGATGTTC TGCCGGAACT GATTCGACGT
GCGGTCGCGG TCAAGGCGGA GGTGGTCGCG GCCGACGAGA AGGAATCCGC GCTGCGCGAG
ATCCTCAACT ACGGGCACAC GCTGGCCCAC GCGATCGAAC GCCGCGAGCG CTACCAGTGG
CGCCACGGCG CGGCGGTGTC GGTCGGCCTG GTGTTCGCCG CCGAACTCGG CCGCCTGGCG
GGCCGACTCG ACGACCAGAC GGCCGACCGG CACCGGTCGG TGCTGGAAGC GCTGGGGCTG
CCGGTGAGCT ATGACCCCGA CGCGCTGCCG AAACTCCTGG AGTACATGGC GGGCGACAAG
AAGACCCGCT CGGGTGTGCT GCGGTTCGTG GTGCTCGACG GGCTGGCCAA ACCCGGCCGG
CTCGAAGGCC CCGACCCGTC GCTGCTCGCC GCGGCCTACT CGGTGGTGGG AGGGACCCGA
TGA
 
Protein sequence
MSEPVTVDVL VDPPYPVIIG TGLLGELGRL LEGRHKVAIL HQPTLSVTAE AVRSHLADKG 
IDAHRIEIPD AEAGKDLPVV GFIWEVLGRI GVGRKDAIVS LGGGAATDVA GFAAATWLRG
VDIVHVPTTL LGMVDAAVGG KTGINTDAGK NLVGAFHQPA AVLIDLATLE TLPRNEIVAG
MAEVVKAGFI ADPHILDLIE ADPEAALDPS KDVLPELIRR AVAVKAEVVA ADEKESALRE
ILNYGHTLAH AIERRERYQW RHGAAVSVGL VFAAELGRLA GRLDDQTADR HRSVLEALGL
PVSYDPDALP KLLEYMAGDK KTRSGVLRFV VLDGLAKPGR LEGPDPSLLA AAYSVVGGTR