Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4513 |
Symbol | |
ID | 5593018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4515915 |
End bp | 4517111 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640923609 |
Product | hypothetical protein |
Protein accession | YP_001461050 |
Protein GI | 157163732 |
COG category | [S] Function unknown |
COG ID | [COG4269] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.00333203 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAAG TTATTAATGA AATGGATGTT CCGTCCCATT CGTTTGTTTT TCATGGTACA GGTGAGAGAT ATTTTCTTAT TTGTGTGGTG AATGTGTTGT TAACGATTAT AACGCTAGGT ATCTATTTGC CATGGGCATT AATGAAATGT AAGCGTTATC TCTATGCTAA TATGGAAGTT AACGGACAAC GATTTTCTTA TGGAATTACT GGTGGGAATG TTTTTGTTAG TTGTCTTGTT TTTGTTTTTT GCTATTTCGC AATCTTAATG ACAGTGTCAG CAGATATGCC ACTTGTTGGC TGTGTTTTGA CTTTGTTACT GTTGGTTTTG CTTATATTTA TGGCAGCAAA AGGACTGCGT TATCAGGCCT TGATGACCAG TCTCAACGGC GTAAGATTTA GTTTTAATTG CTCTATGAAA GGGTTCTGGT GGGTAACCTT TTTCTTGCCG ATTTTAATGG CCATTGGGAT GGGGACTGTT TTCTTTATCT CGACAAAGAT GCTACATGCC AATAGTTCAA GTAGTGTTAT TATATCTGTG GTTCTGATGG CAATAGTTGG TATTGTTTCC ATTGGTATTT TTAATGGTAC TTTATATAGC CTGGTAATGA GTTTTCTCTG GAGCAATACC AGTTTCGGTA TACATCGTTT CAAGGTGAAA TTAGATACTA CGTATTGTAT AAAATATGCC ATTCTCGCAT TTTTAGCTTT ATTACCTTTT CTCGCTGTTG CTGGTTATAT TATCTTCGAT CAAATATTAA ATGCATATGA TAGTTCTGTG TATGCAAATG ATGATATTGA GAATTTACAG CAATTTATGG AAATGCAACG TAAAATGATA ATCGCGCAGT TAATCTATTA TTTTGGGATT GCTGTTAGCA CCAGTTATTT AACGGTGTCG TTGCGAAATC ATTTTATGAG CAACCTGTCA CTGAATGATG GGCGTATTCG TTTTCGCTCA ACTTTAACGT ACCACGGTAT GCTTTATCGC ATGTGTGCGT TGGTGGTGAT ATCCGGGATT ACGGGCGGTC TGGCTTATCC ACTGCTGAAA TTATGGATGA TTGACTGGCA GGCAAAAAAT ACGTATTTGC TGGGCGATTT GGATGACCTT CCTTTAATCA ATAAAGAAGA ACAACCAGAT AAAGGCTTCT TAGCCAGGAT TTCACGGGGA ATTATGCCTT CTTTACCATT TCTGTAA
|
Protein sequence | MAQVINEMDV PSHSFVFHGT GERYFLICVV NVLLTIITLG IYLPWALMKC KRYLYANMEV NGQRFSYGIT GGNVFVSCLV FVFCYFAILM TVSADMPLVG CVLTLLLLVL LIFMAAKGLR YQALMTSLNG VRFSFNCSMK GFWWVTFFLP ILMAIGMGTV FFISTKMLHA NSSSSVIISV VLMAIVGIVS IGIFNGTLYS LVMSFLWSNT SFGIHRFKVK LDTTYCIKYA ILAFLALLPF LAVAGYIIFD QILNAYDSSV YANDDIENLQ QFMEMQRKMI IAQLIYYFGI AVSTSYLTVS LRNHFMSNLS LNDGRIRFRS TLTYHGMLYR MCALVVISGI TGGLAYPLLK LWMIDWQAKN TYLLGDLDDL PLINKEEQPD KGFLARISRG IMPSLPFL
|
| |