Gene EcSMS35_0516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0516 
SymbolhtpG 
ID6147205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp523433 
End bp525307 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content53% 
IMG OID641615410 
Productheat shock protein 90 
Protein accessionYP_001742617 
Protein GI170683488 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0326] Molecular chaperone, HSP90 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0117874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAC AAGAAACTCG TGGTTTTCAG TCAGAAGTGA AACAGCTTCT GCACCTGATG 
ATCCATTCTC TCTATTCCAA TAAAGAAATC TTCCTGCGTG AGCTTATCTC TAACGCCTCC
GATGCGGCGG ACAAGCTGCG TTTCCGCGCG CTCTCTAACC CGGACCTGTA CGAAGGTGAT
GGCGAACTGC GCGTTCGTGT CTCTTTCGAT AAAGACAAGC GCACGCTGAC CATCTCCGAT
AACGGCGTGG GGATGACCCG CGACGAAGTG ATTGACCATC TGGGGACTAT CGCTAAATCC
GGCACCAAAT CATTCCTCGA ATCCCTGGGT TCTGACCAGG CGAAAGACAG CCAGCTGATC
GGTCAGTTTG GTGTTGGTTT CTACTCTGCG TTTATCGTGG CCGACAAAGT GACCGTGCGT
ACTCGCGCGG CAGGCGAAAA ACCAGAAAAT GGCGTCTTCT GGGAATCGGC TGGCGAAGGT
GAATACACCG TTGCCGATAT CACCAAAGAA GATCGTGGTA CTGAAATCAC CCTGCATCTG
CGTGAAGGCG AAGACGAGTT CCTCGATGAC TGGCGCGTGC GTTCCATCAT CAGCAAATAC
TCCGACCATA TCGCGCTGCC GGTAGAGATC GAAAAACGCG AAGAGAAAGA CGGCGAAACC
GTTATCTCCT GGGAGAAAAT CAACAAAGCG CAGGCGCTGT GGACTCGTAA CAAGTCGGAA
ATCACCGATG AAGAGTACAA AGAGTTCTAC AAACACATCG CCCACGACTT TAACGATCCG
CTGACCTGGA GCCACAACCG CGTTGAAGGT AAGCAGGAGT ACACCAGCCT GCTGTACATT
CCGTCCCAGG CTCCGTGGGA TATGTGGAAC CGCGATCATA AACACGGACT GAAACTGTAC
GTCCAGCGTG TGTTCATCAT GGACGACGCA GAACAGTTCA TGCCGAACTA TCTGCGCTTC
GTGCGTGGTC TGATTGACTC CAGCGATCTG CCGCTGAACG TTTCCCGTGA AATCCTCCAG
GACAGCACGG TAACGCGTAA CCTGCGCAAT GCGCTGACCA AGCGTGTGCT GCAAATGCTG
GAAAAACTGG CGAAAGACGA CGCGGAAAAA TACCAGACCT TCTGGCAACA GTTTGGCCTG
GTACTGAAAG AAGGTCCGGC GGAAGATTTC GCTAACCAGG AAGCGATCGC CAAATTGCTG
CGTTTTGCTT CTACCCATAC CGATTCTTCT GCGCAGACCG TATCTCTGGA AGACTACGTT
TCCCGCATGA AAGAAGGGCA GGAGAAAATC TACTACATCA CCGCAGACAG CTATGCGGCA
GCGAAGAGCA GCCCGCACCT GGAACTGCTG CGTAAGAAAG GCATCGAAGT TCTGCTGCTT
TCCGACCGCA TCGATGAGTG GATGATGAAC TATCTGACTG AGTTCGACGG TAAACCGTTC
CAGTCTGTGT CTAAAGTTGA CGAGTCGCTG GAAAAACTGG CTGACGAAGT TGATGAGAGC
GCGAAAGAAG CGGAGAAAGC ACTGACTCCG TTCATCGACC GTGTGAAAGC CCTGCTCGGC
GAGCGCGTGA AAGATGTCCG TCTGACTCAC CGTCTGACCG ATACGCCAGC GATTGTCTCT
ACCGACGCGG ACGAAATGAG CACCCAGATG GCGAAACTGT TTGCCGCAGC GGGCCAGAAA
GTGCCGGAAG TGAAATACAT CTTCGAACTG AACCCGGATC ACGTACTGGT GAAACGTGCG
GCAGATACTG AAGATGAAGC CAAGTTCAGC GAGTGGGTAG AACTGCTGCT GGATCAGGCG
CTGCTGGCAG AACGCGGCAC GCTGGAAGAT CCGAACCTGT TTATTCGTCG TATGAACCAG
CTGCTGGTTT CCTGA
 
Protein sequence
MKGQETRGFQ SEVKQLLHLM IHSLYSNKEI FLRELISNAS DAADKLRFRA LSNPDLYEGD 
GELRVRVSFD KDKRTLTISD NGVGMTRDEV IDHLGTIAKS GTKSFLESLG SDQAKDSQLI
GQFGVGFYSA FIVADKVTVR TRAAGEKPEN GVFWESAGEG EYTVADITKE DRGTEITLHL
REGEDEFLDD WRVRSIISKY SDHIALPVEI EKREEKDGET VISWEKINKA QALWTRNKSE
ITDEEYKEFY KHIAHDFNDP LTWSHNRVEG KQEYTSLLYI PSQAPWDMWN RDHKHGLKLY
VQRVFIMDDA EQFMPNYLRF VRGLIDSSDL PLNVSREILQ DSTVTRNLRN ALTKRVLQML
EKLAKDDAEK YQTFWQQFGL VLKEGPAEDF ANQEAIAKLL RFASTHTDSS AQTVSLEDYV
SRMKEGQEKI YYITADSYAA AKSSPHLELL RKKGIEVLLL SDRIDEWMMN YLTEFDGKPF
QSVSKVDESL EKLADEVDES AKEAEKALTP FIDRVKALLG ERVKDVRLTH RLTDTPAIVS
TDADEMSTQM AKLFAAAGQK VPEVKYIFEL NPDHVLVKRA ADTEDEAKFS EWVELLLDQA
LLAERGTLED PNLFIRRMNQ LLVS