Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4284 |
Symbol | |
ID | 6145104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4383843 |
End bp | 4385678 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641619105 |
Product | hypothetical protein |
Protein accession | YP_001746229 |
Protein GI | 170681573 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0326] Molecular chaperone, HSP90 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAT TACAGTTACC TGGTAGTTCA TATTCTACAG AAGTTAATTT AAACGGCTTA ATTGAGGTGC TCAGTAAGCA TCTTTACTCC ACTCCCGTGG TTGCCGTGCG CGAGCTGGTG CAGAACGGCC ATGATGCGAT CGTTCGCCGC AGGATTGAGC AGCCCGATGC ACCAAAGGAT AACGCGATTC GTGTGGTGGC AGACGTGGCG AAGTCCACTA TCACTATTAG CGATACTGGC GCTGGACTGA CAGAAAGTGA AATTCACGGC TTCCTGGCGA CAGTAGGCGT GGGTTATACC CGAATGTTGC GCCAGCAGGA TGACAACACC GGTTTAATTG GTATGTTCGG CCTCGGTTTT TTGTCGGCCT TTGTGTTGGC GAAAGAGGTC ACGGTGTTGA CCACATCCTG GCAAACGCCG GATCAGAGCT GGAAATACCA CTCTACCGAC GGGCAAAAAT ATACCGTTAC GCCGCATCAG TCCTCGGAAA CGGGTACGCA GGTGATTCTG ACGCTCAAAG AAGAGTACAG CCATCTGGCG AGTAACAATT TGCTGAACCG CGTTCTTTCC CGCTACTGTA TATTGCTGCA CGAACCGGTC TATGTCGGCG ATGCCAGCGA GCCGGTAAAT AAACTTCAAC CACCGTGGCG TGAAGTTGCC CCCGAAGGCG TAACCATGCA CCGCGCGCTG GTACAGCGTA AAAATCTCGC CTTTGCCGCC CAGTTTGAAT CCTCCTTCGA ACCGATTTGC ACCATTCCGG TGGTGCCCGT GGGGATGAGC GACGCGGTTG GGATTTTATG GATTCAGGAT GGCGCAACCT ACGGCACCAG CGATAACCGC AACCTGTCGC TGTTTTTGCG CGGTATGTTA CTGGATGATG AAGCGCGTGA GTTGTTACCT CCCTGGGCCG GATTTATTGG CGGCGTGATT GAGTCATCGA AACTAACGCC TACGGCGAGC CGGGAAGATC TCCAGCGGGA CGAAACCTGG GTTGCGGTGC AGGAGGCGTT AAAAGAGGCG CTGATTTCTG GTTTGTCCGA TCTCGCACAA AATCAGCCAG AAATCTGGCG GCGTGTATTA ATGCGCCACA ACGAAGCGTT GCTCGGTGCG GCATTATGTG ATGACCGTCT GTTTGATTTG CTCAAAGATC GCTTGCAGGT GCCAACGTCA AAAGGGGCGT TGCTGGCGAA GGATTTACGC GTTAATAACA GCATTCATAT TCTGTTAAGC CGCGACGGCG GTTTTGAAGA GATGTTGTTC CACATTCTGC AACGGCCCGT TGCCCGTGGC GATCGCTATG CCGTCGTGCC ATTTTTACGT CGCTGGGCGC TGTTATATCA CTGCCGGATT GTCGAAGTCG GTACGCAAAC AGGTAATGAG CAGTTGTTCA GCCTGGCGGA ATTACCCGAA GAGCAGGTAG CTTATCTGGA AGAGCATCTC TGCGATGGCG AGCAATTAAT TATCTCCCGC TTCGAACCCG CCGTTTTACC GTTAGTGGTT ACGCCAGACC GCGAAGCAGA ATTAAAACAA ATTCTCGAAC AGGATGACGC AGATAAACGC ATCAGCACCG CAGCGTTAAT GCTGGCGCGG CAATTTACTT CACAAATCCA AAAAACGAAA ACCTCAAGTT TATACATCAA CCTTAATAAC CCTTGCATCA TGCAACTGGT GACGGCATTA CAACACCAGC AACAGCCCGC AGCGGCATTA CGCTTATTAA AATCGCTGAA AGTGATTTTG TGCTCCAGCG GTAATAAAGA ACAGCAGTGG GATTTACACC AGGCACTGGA AGATTTTACT CAGGTTATTC CTGTCTTAAT TAATCAAGGA AAATAA
|
Protein sequence | MSTLQLPGSS YSTEVNLNGL IEVLSKHLYS TPVVAVRELV QNGHDAIVRR RIEQPDAPKD NAIRVVADVA KSTITISDTG AGLTESEIHG FLATVGVGYT RMLRQQDDNT GLIGMFGLGF LSAFVLAKEV TVLTTSWQTP DQSWKYHSTD GQKYTVTPHQ SSETGTQVIL TLKEEYSHLA SNNLLNRVLS RYCILLHEPV YVGDASEPVN KLQPPWREVA PEGVTMHRAL VQRKNLAFAA QFESSFEPIC TIPVVPVGMS DAVGILWIQD GATYGTSDNR NLSLFLRGML LDDEARELLP PWAGFIGGVI ESSKLTPTAS REDLQRDETW VAVQEALKEA LISGLSDLAQ NQPEIWRRVL MRHNEALLGA ALCDDRLFDL LKDRLQVPTS KGALLAKDLR VNNSIHILLS RDGGFEEMLF HILQRPVARG DRYAVVPFLR RWALLYHCRI VEVGTQTGNE QLFSLAELPE EQVAYLEEHL CDGEQLIISR FEPAVLPLVV TPDREAELKQ ILEQDDADKR ISTAALMLAR QFTSQIQKTK TSSLYINLNN PCIMQLVTAL QHQQQPAAAL RLLKSLKVIL CSSGNKEQQW DLHQALEDFT QVIPVLINQG K
|
| |