Gene EcSMS35_4284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4284 
Symbol 
ID6145104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4383843 
End bp4385678 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content51% 
IMG OID641619105 
Producthypothetical protein 
Protein accessionYP_001746229 
Protein GI170681573 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0326] Molecular chaperone, HSP90 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAT TACAGTTACC TGGTAGTTCA TATTCTACAG AAGTTAATTT AAACGGCTTA 
ATTGAGGTGC TCAGTAAGCA TCTTTACTCC ACTCCCGTGG TTGCCGTGCG CGAGCTGGTG
CAGAACGGCC ATGATGCGAT CGTTCGCCGC AGGATTGAGC AGCCCGATGC ACCAAAGGAT
AACGCGATTC GTGTGGTGGC AGACGTGGCG AAGTCCACTA TCACTATTAG CGATACTGGC
GCTGGACTGA CAGAAAGTGA AATTCACGGC TTCCTGGCGA CAGTAGGCGT GGGTTATACC
CGAATGTTGC GCCAGCAGGA TGACAACACC GGTTTAATTG GTATGTTCGG CCTCGGTTTT
TTGTCGGCCT TTGTGTTGGC GAAAGAGGTC ACGGTGTTGA CCACATCCTG GCAAACGCCG
GATCAGAGCT GGAAATACCA CTCTACCGAC GGGCAAAAAT ATACCGTTAC GCCGCATCAG
TCCTCGGAAA CGGGTACGCA GGTGATTCTG ACGCTCAAAG AAGAGTACAG CCATCTGGCG
AGTAACAATT TGCTGAACCG CGTTCTTTCC CGCTACTGTA TATTGCTGCA CGAACCGGTC
TATGTCGGCG ATGCCAGCGA GCCGGTAAAT AAACTTCAAC CACCGTGGCG TGAAGTTGCC
CCCGAAGGCG TAACCATGCA CCGCGCGCTG GTACAGCGTA AAAATCTCGC CTTTGCCGCC
CAGTTTGAAT CCTCCTTCGA ACCGATTTGC ACCATTCCGG TGGTGCCCGT GGGGATGAGC
GACGCGGTTG GGATTTTATG GATTCAGGAT GGCGCAACCT ACGGCACCAG CGATAACCGC
AACCTGTCGC TGTTTTTGCG CGGTATGTTA CTGGATGATG AAGCGCGTGA GTTGTTACCT
CCCTGGGCCG GATTTATTGG CGGCGTGATT GAGTCATCGA AACTAACGCC TACGGCGAGC
CGGGAAGATC TCCAGCGGGA CGAAACCTGG GTTGCGGTGC AGGAGGCGTT AAAAGAGGCG
CTGATTTCTG GTTTGTCCGA TCTCGCACAA AATCAGCCAG AAATCTGGCG GCGTGTATTA
ATGCGCCACA ACGAAGCGTT GCTCGGTGCG GCATTATGTG ATGACCGTCT GTTTGATTTG
CTCAAAGATC GCTTGCAGGT GCCAACGTCA AAAGGGGCGT TGCTGGCGAA GGATTTACGC
GTTAATAACA GCATTCATAT TCTGTTAAGC CGCGACGGCG GTTTTGAAGA GATGTTGTTC
CACATTCTGC AACGGCCCGT TGCCCGTGGC GATCGCTATG CCGTCGTGCC ATTTTTACGT
CGCTGGGCGC TGTTATATCA CTGCCGGATT GTCGAAGTCG GTACGCAAAC AGGTAATGAG
CAGTTGTTCA GCCTGGCGGA ATTACCCGAA GAGCAGGTAG CTTATCTGGA AGAGCATCTC
TGCGATGGCG AGCAATTAAT TATCTCCCGC TTCGAACCCG CCGTTTTACC GTTAGTGGTT
ACGCCAGACC GCGAAGCAGA ATTAAAACAA ATTCTCGAAC AGGATGACGC AGATAAACGC
ATCAGCACCG CAGCGTTAAT GCTGGCGCGG CAATTTACTT CACAAATCCA AAAAACGAAA
ACCTCAAGTT TATACATCAA CCTTAATAAC CCTTGCATCA TGCAACTGGT GACGGCATTA
CAACACCAGC AACAGCCCGC AGCGGCATTA CGCTTATTAA AATCGCTGAA AGTGATTTTG
TGCTCCAGCG GTAATAAAGA ACAGCAGTGG GATTTACACC AGGCACTGGA AGATTTTACT
CAGGTTATTC CTGTCTTAAT TAATCAAGGA AAATAA
 
Protein sequence
MSTLQLPGSS YSTEVNLNGL IEVLSKHLYS TPVVAVRELV QNGHDAIVRR RIEQPDAPKD 
NAIRVVADVA KSTITISDTG AGLTESEIHG FLATVGVGYT RMLRQQDDNT GLIGMFGLGF
LSAFVLAKEV TVLTTSWQTP DQSWKYHSTD GQKYTVTPHQ SSETGTQVIL TLKEEYSHLA
SNNLLNRVLS RYCILLHEPV YVGDASEPVN KLQPPWREVA PEGVTMHRAL VQRKNLAFAA
QFESSFEPIC TIPVVPVGMS DAVGILWIQD GATYGTSDNR NLSLFLRGML LDDEARELLP
PWAGFIGGVI ESSKLTPTAS REDLQRDETW VAVQEALKEA LISGLSDLAQ NQPEIWRRVL
MRHNEALLGA ALCDDRLFDL LKDRLQVPTS KGALLAKDLR VNNSIHILLS RDGGFEEMLF
HILQRPVARG DRYAVVPFLR RWALLYHCRI VEVGTQTGNE QLFSLAELPE EQVAYLEEHL
CDGEQLIISR FEPAVLPLVV TPDREAELKQ ILEQDDADKR ISTAALMLAR QFTSQIQKTK
TSSLYINLNN PCIMQLVTAL QHQQQPAAAL RLLKSLKVIL CSSGNKEQQW DLHQALEDFT
QVIPVLINQG K