Gene B21_03193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03193 
SymbolaroB 
ID8113415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3376988 
End bp3378076 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content54% 
IMG OID644849373 
Producthypothetical protein 
Protein accessionYP_003000946 
Protein GI251786642 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00984736 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGGA TTGTCGTTAC TCTCGGGGAA CGTAGTTACC CAATTACCAT CGCATCTGGT 
TTGTTTAATG AACCAGCTTC ATTCTTACCG CTGAAATCGG GCGAGCAGGT CATGTTGGTC
ACCAACGAAA CCCTGGCTCC TCTGTATCTC GATAAGGTCC GCGGCGTACT TGAACAGGCG
GGTGTTAACG TCGATAGCGT TATCCTCCCT GACGGCGAGC AGTATAAAAG CCTGGCTGTA
CTCGATACCG TCTTTACGGC GTTGTTACAA AAGCCGCATG GTCGCGATAC TACGCTGGTG
GCGCTTGGCG GCGGCGTAGT GGGCGATCTG ACCGGCTTCG CGGCGGCGAG TTATCAGCGC
GGTGTTCGTT TCATTCAAGT CCCGACGACG TTACTGTCGC AGGTCGATTC CTCCGTTGGC
GGCAAAACTG CGGTCAACCA TCCCCTCGGT AAAAACATGA TTGGCGCGTT CTACCAGCCT
GCTTCAGTGG TGGTGGATCT CGACTGTCTG AAAACGCTTC CCCCGCGTGA GTTAGCGTCG
GGGCTGGCAG AAGTCATCAA ATACGGCATT ATTCTTGACG GTGCGTTTTT TAACTGGCTG
GAAGAGAATC TGGATGCGTT GTTGCGTCTG GACGGTCCGG CAATGGCGTA CTGTATTCGC
CGTTGTTGTG AACTGAAGGC AGAAGTTGTC GCCGCCGACG AGCGCGAAAC CGGGTTACGT
GCTTTACTGA ATCTGGGACA CACCTTTGGT CATGCCATTG AAGCTGAAAT GGGGTATGGC
AATTGGTTAC ATGGTGAAGC GGTCGCTGCG GGTATGGTGA TGGCGGCGCG GACGTCGGAA
CGTCTCGGGC AGTTTAGTTC TGCCGAAACG CAGCGTATTA TAACCCTGCT CAAGCGGGCT
GGGTTACCGG TCAATGGGCC GCGCGAAATG TCCGCGCAGG CGTATTTACC GCATATGCTG
CGTGACAAGA AAGTCCTTGC GGGAGAGATA CGCTTAATTC TTCCGTTGGC AATTGGTAAG
AGTGAAGTTC GCAGCGGCGT TTCGCACGAG CTTGTTCTTA ACGCCATTGC CGATTGTCAA
TCAGCGTAA
 
Protein sequence
MERIVVTLGE RSYPITIASG LFNEPASFLP LKSGEQVMLV TNETLAPLYL DKVRGVLEQA 
GVNVDSVILP DGEQYKSLAV LDTVFTALLQ KPHGRDTTLV ALGGGVVGDL TGFAAASYQR
GVRFIQVPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP ASVVVDLDCL KTLPPRELAS
GLAEVIKYGI ILDGAFFNWL EENLDALLRL DGPAMAYCIR RCCELKAEVV AADERETGLR
ALLNLGHTFG HAIEAEMGYG NWLHGEAVAA GMVMAARTSE RLGQFSSAET QRIITLLKRA
GLPVNGPREM SAQAYLPHML RDKKVLAGEI RLILPLAIGK SEVRSGVSHE LVLNAIADCQ
SA