Gene EcolC_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1939 
Symbol 
ID6068553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2142036 
End bp2142902 
Gene Length867 bp 
Protein Length288 aa 
Translation table11 
GC content48% 
IMG OID641601350 
Productquinate/shikimate dehydrogenase 
Protein accessionYP_001724912 
Protein GI170019958 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase 
TIGRFAM ID[TIGR00507] shikimate 5-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000737953 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATGTTA CCGCAAAATA CGAATTGATT GGGTTGATGG CTTATCCTAT CCGCCACAGT 
TTATCGCCTG AAATGCAGAA TAAAGCCTTA GAAAAAGCGG GATTGCCATT TACCTATATG
GCCTTCGAAG TGGATAACGA TAGCTTTCCC GGAGCAATTG AAGGATTAAA AGCTCTAAAA
ATGCGCGGAA CTGGTGTATC AATGCCGAAC AAACAACTGG CGTGTGAATA TGTTGATGAA
TTAACGCCAG CAGCCAAACT GGTGGGTGCC ATCAACACCA TCGTTAATGA TGATGGCTAC
CTGCGCGGCT ATAACACCGA CGGCACGGGC CATATTCGAG CCATTAAAGA GAGCGGTTTT
GATATCAAAG GCAAAACAAT GGTGCTGTTA GGGGCCGGAG GTGCCTCAAC GGCGATTGGC
GCGCAGGGGG CAATTGAAGG TTTAAAAGAA ATTAAACTCT TTAACCGTCG GGATGAGTTC
TTCGATAAAG CCCTCGCCTT CGCGCAGCGG GTTAACGAAA ACACCGATTG TGTCGTCACG
GTTACCGATC TCGCCGATCA GCAAGCCTTT GCTGAAGCCC TGGCTTCCGC CGACATTTTA
ACCAATGGCA CAAAAGTGGG TATGAAACCC CTTGAGAATG AATCATTGGT TAATGATATC
AGTCTGTTAC ATCCGGGACT GCTGGTCACT GAATGCGTGT ATAACCCGCA TATGACGAAG
TTATTGCAGC AGGCGCAACA AGCTGGTTGC AAAACGATTG ATGGATACGG CATGTTGTTG
TGGCAAGGGG CTGAACAGTT CACGTTATGG ACTGGCAAAG ATTTCCCTCT GGAATATGTT
AAACAGGTCA TGGGGTTCGG TGCCTGA
 
Protein sequence
MDVTAKYELI GLMAYPIRHS LSPEMQNKAL EKAGLPFTYM AFEVDNDSFP GAIEGLKALK 
MRGTGVSMPN KQLACEYVDE LTPAAKLVGA INTIVNDDGY LRGYNTDGTG HIRAIKESGF
DIKGKTMVLL GAGGASTAIG AQGAIEGLKE IKLFNRRDEF FDKALAFAQR VNENTDCVVT
VTDLADQQAF AEALASADIL TNGTKVGMKP LENESLVNDI SLLHPGLLVT ECVYNPHMTK
LLQQAQQAGC KTIDGYGMLL WQGAEQFTLW TGKDFPLEYV KQVMGFGA