Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1961 |
Symbol | |
ID | 6145482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1984471 |
End bp | 1986003 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616837 |
Product | SpoVR family protein |
Protein accession | YP_001744013 |
Protein GI | 170681824 |
COG category | [S] Function unknown |
COG ID | [COG2719] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00693869 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA TCGATTCTAT GAATAAGGAC ACCACACGTT TGAGCGATGG ACCCGACTGG ACGTTCGACC TGCTGGATGT TTATCTGGCA GAGATAGACC GGGTGGCGAA ACTCTACCGG CTGGATACCT ACCCGCACCA GATTGAAGTG ATAACCTCAG AACAGATGAT GGATGCCTAC TCCAGCGTCG GCATGCCAAT TAACTATCCG CACTGGTCAT TCGGTAAAAA GTTTATCGAG ACTGAACGGC TGTATAAGCA CGGTCAGCAA GGACTGGCCT ATGAAATCGT CATTAACTCT AACCCGTGTA TCGCTTACCT GATGGAAGAA AACACCATTA CCATGCAAGC GCTGGTGATG GCACACGCCT GCTATGGGCA TAACTCTTTC TTCAAAAACA ATTACTTATT CCGTAGCTGG ACCGACGCCA GTTCGATTGT CGATTATCTG ATTTTTGCCC GTAAATATAT TACCGAGTGC GAAGAGCGCT ATGGTGTGGA TGAAGTAGAA CGGCTGCTGG ACTCGTGCCA TGCGCTGATG AACTACGGCG TGGACCGGTA TAAACGCCCA CAAAAAATCT CGCTGCAAGA GGAGAAAGCC CGGCAGAAAA GCCGCGAAGA GTACTTGCAA AGTCAGGTCA ATATGCTCTG GCGTACCCTG CCGAAGCGCG AGGAAGAGAA AACGGTTGCT GAAGCGCGCC GCTATCCGTC CGAGCCACAA GAAAACCTGC TCTATTTTAT GGAGAAAAAT GCGCCACTGC TGGAATCATG GCAGCGTGAA ATTCTGCGTA TTGTGCGTAA GGTGAGCCAG TATTTTTATC CGCAAAAACA GACTCAGGTG ATGAACGAAG GCTGGGCGAC CTTCTGGCAC TACACCATCC TTAACCATCT GTATGATGAA GGGAAAGTAA CGGAACGTTT TATGCTGGAG TTTTTGCACA GCCACACCAA TGTGGTCTTC CAGCCGCCCT ATAACAGCCC GTGGTACAGC GGCATCAACC CGTATGCCCT CGGGTTCGCC ATGTTCCAGG ATATTAAACG GATTTGTCAG TCGCCAACGG AAGAAGACAA ATACTGGTTC CCGGATATCG CCGGTTCTGA CTGGCTGGAC ACATTACATT TTGCGATGCG TGATTTCAAA GATGAGAGCT TTATCAGTCA GTTCCTGTCA CCGAAAGTGA TGCGTGATTT CCGCTTCTTC ACCGTGCTGG ACGACGATCG GCATAATTAT CTGGAGATTT CCGCTATTCA TAATGAAGAA GGTTATCGGG AGATCCGTAA CCGGTTATCG TCGCAATATA ATTTAAGTAA TCTGGAGCCG AATATTCAGA TCTGGAACGT GGATTTGCGC GGCGACCGTT CGCTGACGCT GCGTTACATT CCACATAATC GCGCACCGCT GGATCGGGGG CGCAAAGAAG TACTGAAGCA TGTGCATCGC CTGTGGGGAT TTGATGTGAT GCTGGAACAG CAAAACGAAG ACGGCAGCGT CGAGTTGCTG GAACGTTGCC CGCCGAGAAT GGGAAATCTG TAA
|
Protein sequence | MATIDSMNKD TTRLSDGPDW TFDLLDVYLA EIDRVAKLYR LDTYPHQIEV ITSEQMMDAY SSVGMPINYP HWSFGKKFIE TERLYKHGQQ GLAYEIVINS NPCIAYLMEE NTITMQALVM AHACYGHNSF FKNNYLFRSW TDASSIVDYL IFARKYITEC EERYGVDEVE RLLDSCHALM NYGVDRYKRP QKISLQEEKA RQKSREEYLQ SQVNMLWRTL PKREEEKTVA EARRYPSEPQ ENLLYFMEKN APLLESWQRE ILRIVRKVSQ YFYPQKQTQV MNEGWATFWH YTILNHLYDE GKVTERFMLE FLHSHTNVVF QPPYNSPWYS GINPYALGFA MFQDIKRICQ SPTEEDKYWF PDIAGSDWLD TLHFAMRDFK DESFISQFLS PKVMRDFRFF TVLDDDRHNY LEISAIHNEE GYREIRNRLS SQYNLSNLEP NIQIWNVDLR GDRSLTLRYI PHNRAPLDRG RKEVLKHVHR LWGFDVMLEQ QNEDGSVELL ERCPPRMGNL
|
| |