Gene EcHS_A3585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3585 
SymbolaroB 
ID5595507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3561755 
End bp3562843 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content54% 
IMG OID640922702 
Product3-dehydroquinate synthase 
Protein accessionYP_001460183 
Protein GI157162865 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000415284 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGGA TTGTCGTTAC TCTCGGGGAA CGTAGTTACC CAATTACCAT CGCATCTGGT 
TTGTTTAATG AACCAGCTTC ATTCTTACCG CTGAAATCGG GCGAGCAGGT CATGTTGGTC
ACCAACGAAA CCCTGGCTCC TCTGTATCTC GATAAGGTCC GCGGCGTACT TGAACAGGCG
GGTGTTAACG TCGATAGCGT TATCCTCCCT GACGGCGAGC AGTATAAAAG CCTGGCTGTA
CTCGATACCG TCTTTACGGC GTTGTTACAA AAGCCGCATG GTCGCGATAC TACGCTGGTG
GCGCTTGGCG GCGGCGTAGT GGGCGATCTG ACCGGCTTCG CGGCGGCGAG TTATCAGCGC
GGTGTTCGTT TCATTCAAGT CCCGACGACG TTACTGTCGC AGGTCGATTC CTCCGTTGGC
GGCAAAACTG CGGTCAACCA TCCCCTCGGT AAAAACATGA TTGGCGCGTT CTACCAGCCT
GCTTCAGTGG TGGTGGATCT CGACTGTCTG AAAACGCTTC CCCCGCGTGA GTTAGCGTCG
GGGCTGGCAG AAGTCATCAA ATACGGCATT ATTCTTGACG GTGCGTTTTT TAACTGGCTG
GAAGAGAATC TGGATGCGTT GTTGCGTCTG GACGGTCCGG CAATGGCGTA CTGTATTCGC
CGTTGTTGTG AACTGAAGGC AGAAGTTGTC GCCGCCGACG AGCGCGAAAC CGGGTTACGT
GCTTTACTGA ATCTGGGACA CACCTTTGGT CATGCCATTG AAGCTGAAAT GGGGTATGGC
AATTGGTTAC ATGGTGAAGC GGTCGCTGCG GGTATGGTGA TGGCGGCGCG GACGTCGGAA
CGTCTCGGGC AGTTTAGTTC TGCCGAAACG CAGCGTATTA TAACCCTGCT CACGCGGGCT
GGGTTACCGG TCAATGGGCC GCGCGAAATG TCCGCGCAGG CGTATTTACC GCATATGCTG
CGTGACAAGA AAGTCCTTGC GGGAGAGATA CGCTTAATTC TTCCGTTGGC AATTGGTAAG
AGTGAAGTTC GCAGCGGCGT TTCGCACGAG CTTGTTCTTA ACGCCATTGC CGATTGTCAA
TCAGCGTAA
 
Protein sequence
MERIVVTLGE RSYPITIASG LFNEPASFLP LKSGEQVMLV TNETLAPLYL DKVRGVLEQA 
GVNVDSVILP DGEQYKSLAV LDTVFTALLQ KPHGRDTTLV ALGGGVVGDL TGFAAASYQR
GVRFIQVPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP ASVVVDLDCL KTLPPRELAS
GLAEVIKYGI ILDGAFFNWL EENLDALLRL DGPAMAYCIR RCCELKAEVV AADERETGLR
ALLNLGHTFG HAIEAEMGYG NWLHGEAVAA GMVMAARTSE RLGQFSSAET QRIITLLTRA
GLPVNGPREM SAQAYLPHML RDKKVLAGEI RLILPLAIGK SEVRSGVSHE LVLNAIADCQ
SA