Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1291 |
Symbol | |
ID | 5594109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1284148 |
End bp | 1285680 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640920448 |
Product | SpoVR family protein |
Protein accession | YP_001458009 |
Protein GI | 157160691 |
COG category | [S] Function unknown |
COG ID | [COG2719] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000000000558794 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGA TCGATTCTAT GAATAAGGAC ACCACACGTT TGAGCGATGG ACCCGACTGG ACGTTCGACC TGCTGGATGT TTATCTGGCA GAGATAGACC GGGTGGCGAA ACTCTACCGG CTGGATACCT ACCCGCACCA GATTGAAGTG ATAACCTCAG AACAGATGAT GGATGCCTAC TCCAGCGTCG GCATGCCAAT TAACTATCCG CACTGGTCAT TCGGTAAAAA GTTTATCGAG ACTGAACGGC TGTATAAGCA CGGTCAGCAA GGACTGGCCT ATGAAATCGT CATTAACTCT AACCCGTGTA TCGCTTACCT GATGGAAGAG AACACCATTA CCATGCAAGC GCTGGTGATG GCTCATGCCT GCTATGGGCA TAACTCTTTC TTTAAAAACA ATTACTTATT CCGTAGCTGG ACCGACGCCA GTTCGATTGT CGATTATCTG ATTTTTGCCC GTAAATATAT TACCGAGTGC GAAGAGCGTT ATGGCGTTGA TGAAGTAGAA CGGCTTCTGG ACTCGTGCCA CGCGCTGATG AACTACGGCG TGGACCGGTA CAAACGCCCG CAAAAAATCT CGCTGCAAGA AGAGAAAGCC CGGCAGAAAA GTCGCGAAGA GTATCTGCAA AGTCAGGTCA ATATGCTCTG GCGTACCCTG CCGAAGCGCG AGGAAGAGAA AACGGTTGCT GAAGCGCGCC GCTATCCGTC CGAACCACAA GAAAACCTGC TCTATTTTAT GGAGAAAAAT GCGCCACTGC TGGAATCATG GCAGCGTGAA ATCCTGCGTA TTGTGCGTAA GGTGAGCCAG TATTTTTATC CGCAAAAACA GACTCAGGTG ATGAACGAAG GCTGGGCGAC CTTCTGGCAC TACACCATCC TTAACCATCT GTATGATGAA GGGAAAGTAA CGGAACGTTT TATGCTGGAG TTTTTGCACA GCCACACCAA TGTGGTCTTC CAGCCCCCCT ATAACAGCCC GTGGTACAGC GGCATCAACC CGTATGCCCT CGGGTTCGCC ATGTTCCAGG ATATTAAACG GATTTGTCAG TCGCCAACGG AAGAAGACAA ATACTGGTTC CCGGATATCG CCGGTTCCGA CTGGCTGGAA ACGCTGCATT TCGCGATGCG TGATTTCAAA GATGAGAGTT TTATCAGCCA GTTCCTGTCA CCGAAAGTGA TGCGTGATTT CCGCTTCTTC ACCGTGCTGG ATGACGATCG GCATAATTAT CTGGAGATTT CCGCTATTCA TAATGAAGAA GGTTATCGGG AGATCCGTAA CCGGTTATCG TCGCAATATA ACTTAAGTAA TCTGGAGCCG AATATTCAGA TCTGGAACGT GGATTTGCGC GGCGACCGTT CGCTGACGCT GCGTTATATT CCACATAATC GCGCACCGCT GGATCGGGGG CGCAAAGAAG TCCTGAAGCA TGTGCATCGC CTGTGGGGAT TTGATGTGAT GCTCGAACAG CAAAACGAAG ACGGCAGCAT CGAGTTGCTG GAACGTTGCC CGCCAAGAAT GGGAAATCTG TAA
|
Protein sequence | MATIDSMNKD TTRLSDGPDW TFDLLDVYLA EIDRVAKLYR LDTYPHQIEV ITSEQMMDAY SSVGMPINYP HWSFGKKFIE TERLYKHGQQ GLAYEIVINS NPCIAYLMEE NTITMQALVM AHACYGHNSF FKNNYLFRSW TDASSIVDYL IFARKYITEC EERYGVDEVE RLLDSCHALM NYGVDRYKRP QKISLQEEKA RQKSREEYLQ SQVNMLWRTL PKREEEKTVA EARRYPSEPQ ENLLYFMEKN APLLESWQRE ILRIVRKVSQ YFYPQKQTQV MNEGWATFWH YTILNHLYDE GKVTERFMLE FLHSHTNVVF QPPYNSPWYS GINPYALGFA MFQDIKRICQ SPTEEDKYWF PDIAGSDWLE TLHFAMRDFK DESFISQFLS PKVMRDFRFF TVLDDDRHNY LEISAIHNEE GYREIRNRLS SQYNLSNLEP NIQIWNVDLR GDRSLTLRYI PHNRAPLDRG RKEVLKHVHR LWGFDVMLEQ QNEDGSIELL ERCPPRMGNL
|
| |