Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0990 |
Symbol | |
ID | 6146757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1003964 |
End bp | 1005910 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615877 |
Product | hypothetical protein |
Protein accession | YP_001743069 |
Protein GI | 170681285 |
COG category | [R] General function prediction only |
COG ID | [COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.169795 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACAA ATATAAAAGT ATTTACATCG ACAGATGAAT TGACCACTCT CGGGCGTGAA CTGGGCAAAG GCGGCGAAGG TGCGGTTTAT GATATCGAGG AGTTTGTCGA TAGCGTCGCC AAGATTTATC ACACGCCGCC ACCCGCCTTA AAACAGGACA AACTTGCCTT TATGGCTGCG ACAGCTGACG CGCAATTGTT GAATTATGTC GCCTGGCCGC AGGCAACGCT TCACGGTGGA CGAGGCGGAA AAGTAATCGG TTTTATGATG CCAAAAGTTT CTGGTAAAGA ACCGATTCAT ATGATCTATA GCCCGGCACA TCGTCGTCAG AGTTACCCTC ATTGTGCGTG GGATTTTCTC CTCTATGTTG CGCGCAATAT TGCTTCATCT TTTGCTACGG TTCACGAGCA CGGGCACGTT GTGGGTGACG TAAACCAGAA CAGCTTTATG GTAGGCCGTG ACAGCAAAGT GGTGTTGATT GATAGCGACT CCTTTCAGAT TAACGCCAAT GGAACGCTGC ATTTATGCGA AGTGGGCGTG TCGCATTTTA CGTCGCCAGA GCTGCAAACC TTGTCGTCAT TTGTTGGCTT TGAACGCACC GCGAATCACG ATAATTTTGG CCTTGCGTTG CTGATTTTTC ACGTCTTGTT TGGTGGTCGG CATCCTTATT CCGGTGTACC GCTTATCTCT GATGCGGGTA ATGCGCTGGA GACGGATATT GCCCATTTCC GTTATGCCTA CGCGTCAGAT AATCAGCGAC GTGGTTTAAA ACCGCCGCCA CGATCTATTC CGCTGTCGAT GTTACCGGGC GATGTTGAAG CCATGTTTCA GCAGGCATTC ACGGAAAGTG GCGTGGCAAC CGGGCGTCCG ACGGCAAAAG CGTGGGTAGC GGCACTTGAT TCTCTACGCC AACAATTAAA GAAATGTACC GTTTCGGCAA TGCATGTTTA TCCCGGTCAT TTGGCTGACT GCCCGTGGTG TGCTCTGGAT AATCAAGGCG TTATCTATTT TATTGATCTC GGCGAAGAGG TCATTACCAC CAGCGGTGAT TTTGTGCTGG CGAAAGTCTG GGCGATGGTG ATGGCGTCAG TAGCACCGCC AGCATTGCAA CTGCCATTAC CCGATCATTT CCAACCGACT GGCAGGCCGC TTCCTTTAGG CCTGTTAAGG CGTGAATACA TCATTCTGAT TGAGATCGCA CTGTCAGCGT TATCGCTGTT GCTTTGCGGC CTTCAGACAG AACCACGTTA TATTATTTTG GTTCCTGTGC TGTCGGCTAT CTGGATTATT GGCAGCCTGA CAAGCAAAGC GTACAAAGCA GAAATCCAGC AACGAAGAGA GGCTTTTAAT CGCGCAAAAA TGGACTATGA CCATTTAGTC AGCCAGATCC AACAGTTGGG CGGGCTGGAA GGTTTTATCG CCAAACGGAC GATGCTCGAA AAAATGAAGG ACGAAATTCT TGGGTTACCG GAAGAAGAAA AGCGCGATCT GGCAGCACTT CAGGACACCG CAAGGGAACG GCAGAAGCAG AAGTTTCTGG AGGGATTTTT TATTGATGTT GCCTCTATTC CCGGCGTTGG CCCTGCGCGT AAAGCGGCGT TACGGTCCTT TGGTATTGAA ACGGCGGCAG ACGTTACCCG ACGGAGCGTT AAGCAAGTAA AAGGTTTTGG TGATCATCTG ACCCAGGCGG TTATCGACTG GAAAGCGAGT TGCGAACGCC GTTTTGTTTT CAGGCCGAAC GAAGCGGTAA CGCCTGCAGA CAGACAAGCG GTACTGACTA AAATGGCCGC CAAACGACAT CGGCTGGAAT CGGCGTTGAC TGTCGGTGCG ACAGAGTTGC AGCGATTCCG CCTTCATGCT CCAGCACGGA CCATGCCGTT GATGGAGCCG CTACGTCAGG CGGCAGAAAA ACTGGCTCAG GCGCAGGCTG ATTTAAGCCG CTGCTGA
|
Protein sequence | MKTNIKVFTS TDELTTLGRE LGKGGEGAVY DIEEFVDSVA KIYHTPPPAL KQDKLAFMAA TADAQLLNYV AWPQATLHGG RGGKVIGFMM PKVSGKEPIH MIYSPAHRRQ SYPHCAWDFL LYVARNIASS FATVHEHGHV VGDVNQNSFM VGRDSKVVLI DSDSFQINAN GTLHLCEVGV SHFTSPELQT LSSFVGFERT ANHDNFGLAL LIFHVLFGGR HPYSGVPLIS DAGNALETDI AHFRYAYASD NQRRGLKPPP RSIPLSMLPG DVEAMFQQAF TESGVATGRP TAKAWVAALD SLRQQLKKCT VSAMHVYPGH LADCPWCALD NQGVIYFIDL GEEVITTSGD FVLAKVWAMV MASVAPPALQ LPLPDHFQPT GRPLPLGLLR REYIILIEIA LSALSLLLCG LQTEPRYIIL VPVLSAIWII GSLTSKAYKA EIQQRREAFN RAKMDYDHLV SQIQQLGGLE GFIAKRTMLE KMKDEILGLP EEEKRDLAAL QDTARERQKQ KFLEGFFIDV ASIPGVGPAR KAALRSFGIE TAADVTRRSV KQVKGFGDHL TQAVIDWKAS CERRFVFRPN EAVTPADRQA VLTKMAAKRH RLESALTVGA TELQRFRLHA PARTMPLMEP LRQAAEKLAQ AQADLSRC
|
| |