Gene ECD_03241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03241 
SymbolaroB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3377776 
End bp3378864 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content54% 
IMG OID 
Product3-dehydroquinate synthase 
Protein accessionACT45044 
Protein GI253979374 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00972152 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGGA TTGTCGTTAC TCTCGGGGAA CGTAGTTACC CAATTACCAT CGCATCTGGT 
TTGTTTAATG AACCAGCTTC ATTCTTACCG CTGAAATCGG GCGAGCAGGT CATGTTGGTC
ACCAACGAAA CCCTGGCTCC TCTGTATCTC GATAAGGTCC GCGGCGTACT TGAACAGGCG
GGTGTTAACG TCGATAGCGT TATCCTCCCT GACGGCGAGC AGTATAAAAG CCTGGCTGTA
CTCGATACCG TCTTTACGGC GTTGTTACAA AAGCCGCATG GTCGCGATAC TACGCTGGTG
GCGCTTGGCG GCGGCGTAGT GGGCGATCTG ACCGGCTTCG CGGCGGCGAG TTATCAGCGC
GGTGTTCGTT TCATTCAAGT CCCGACGACG TTACTGTCGC AGGTCGATTC CTCCGTTGGC
GGCAAAACTG CGGTCAACCA TCCCCTCGGT AAAAACATGA TTGGCGCGTT CTACCAGCCT
GCTTCAGTGG TGGTGGATCT CGACTGTCTG AAAACGCTTC CCCCGCGTGA GTTAGCGTCG
GGGCTGGCAG AAGTCATCAA ATACGGCATT ATTCTTGACG GTGCGTTTTT TAACTGGCTG
GAAGAGAATC TGGATGCGTT GTTGCGTCTG GACGGTCCGG CAATGGCGTA CTGTATTCGC
CGTTGTTGTG AACTGAAGGC AGAAGTTGTC GCCGCCGACG AGCGCGAAAC CGGGTTACGT
GCTTTACTGA ATCTGGGACA CACCTTTGGT CATGCCATTG AAGCTGAAAT GGGGTATGGC
AATTGGTTAC ATGGTGAAGC GGTCGCTGCG GGTATGGTGA TGGCGGCGCG GACGTCGGAA
CGTCTCGGGC AGTTTAGTTC TGCCGAAACG CAGCGTATTA TAACCCTGCT CAAGCGGGCT
GGGTTACCGG TCAATGGGCC GCGCGAAATG TCCGCGCAGG CGTATTTACC GCATATGCTG
CGTGACAAGA AAGTCCTTGC GGGAGAGATA CGCTTAATTC TTCCGTTGGC AATTGGTAAG
AGTGAAGTTC GCAGCGGCGT TTCGCACGAG CTTGTTCTTA ACGCCATTGC CGATTGTCAA
TCAGCGTAA
 
Protein sequence
MERIVVTLGE RSYPITIASG LFNEPASFLP LKSGEQVMLV TNETLAPLYL DKVRGVLEQA 
GVNVDSVILP DGEQYKSLAV LDTVFTALLQ KPHGRDTTLV ALGGGVVGDL TGFAAASYQR
GVRFIQVPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP ASVVVDLDCL KTLPPRELAS
GLAEVIKYGI ILDGAFFNWL EENLDALLRL DGPAMAYCIR RCCELKAEVV AADERETGLR
ALLNLGHTFG HAIEAEMGYG NWLHGEAVAA GMVMAARTSE RLGQFSSAET QRIITLLKRA
GLPVNGPREM SAQAYLPHML RDKKVLAGEI RLILPLAIGK SEVRSGVSHE LVLNAIADCQ
SA