Gene Amir_5253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5253 
SymbolaroB 
ID8329455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6250219 
End bp6251325 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content74% 
IMG OID644945692 
Product3-dehydroquinate synthase 
Protein accessionYP_003102920 
Protein GI256379260 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.528317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAGC CGGTGCGCAT CCGCGTGGCC GCGGAGCGGC CGTACGAGGT CATCGTCGGG 
CGTGGGTTGC TCGGGGACCT GGTCGAACTG CTGCGCGGCA CCTCGAAGGC GGCGATCGTG
CACACCGCCG TGCTCGCCGA GACGGCGGAC GCGGTGCTCG AGGAGTTGCG CGGGGCCGGG
GTCGACGCGC ACCGGGTCGA GGTGCCCGAC GCCGAGGACG GCAAGGACCT GCGGGTCGCC
GGGTACTGCT GGGACGTGTT CGGGCAGATC GGGCTCGGGC GGCAGGACGT CGTCGTCGGG
CTCGGCGGCG GGGCGGTCAC CGATCTCGCC GGGTTCGTCG CGTCCACCTG GATGCGCGGG
GTGCGGCTGA TCAACGTGCC GACCACGCTC CTCGGCATGG TCGACGCCGC CGTGGGCGGC
AAGACCGGCA TCAACACCGA CGCGGGCAAG AACCTCGTCG GCACCTTCTA CGAGCCGACC
GCCGTCCTGG CCGACCTCAC CACCCTGGAG ACCCTGCCGC GCAACGAGCT CGTCGCGGGC
ATGGCCGAGG TGGTCAAGGG CGGCTTCATC GCCGACCCGG CGATCCTCGA CCTCATCGAG
GCCGACCCGG CCGCCGCGCT CGACCCGTCC GGCGACGTGC TCGCCGAGCT GGTCCGCCGC
AAGATCCAGG TCAAGGCCGA CGTGGTGTCC AGCGACCTGC GCGAGTCGAA CCTGCGCGAG
ATCCTCAACT ACGGCCACAC CCTCGGCCAC GCCCTGGAGC GCCGCGAGCG CTACCGCTGG
CGCCACGGCG CGGCCATCAG CGTCGGCCTG GTCTTCGCCG CCGAGCTCGC CCGCCTGGCG
GGCAGGCTGG ACGACGCCAC CGCCGACCGC CACCGCAGCG TCCTCACCTC GCTCGGCCTC
CCCGTGGCCT ACGACCCGGA CGCCCTGCCG CAGCTGCTGG AGGGGATGCG CTCGGACAAG
AAGAACCGCT CGGGCGTGCT CCGCTTCGTC GTGCTCGACG GCCTGGCCAA GCCGGGCAGG
CTCGAAGGCC CCGACCCGTC GCTGATCGCC GCCGCCTACT CGGCCGTCGC GGCCGAGCCG
AGGACCGGCG GGAGCATCCT GCTGTGA
 
Protein sequence
MGEPVRIRVA AERPYEVIVG RGLLGDLVEL LRGTSKAAIV HTAVLAETAD AVLEELRGAG 
VDAHRVEVPD AEDGKDLRVA GYCWDVFGQI GLGRQDVVVG LGGGAVTDLA GFVASTWMRG
VRLINVPTTL LGMVDAAVGG KTGINTDAGK NLVGTFYEPT AVLADLTTLE TLPRNELVAG
MAEVVKGGFI ADPAILDLIE ADPAAALDPS GDVLAELVRR KIQVKADVVS SDLRESNLRE
ILNYGHTLGH ALERRERYRW RHGAAISVGL VFAAELARLA GRLDDATADR HRSVLTSLGL
PVAYDPDALP QLLEGMRSDK KNRSGVLRFV VLDGLAKPGR LEGPDPSLIA AAYSAVAAEP
RTGGSILL