Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_02049 |
Symbol | yehM |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 2100557 |
End bp | 2102836 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | hypothetical protein |
Protein accession | ACT43873 |
Protein GI | 253978203 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGC CGTTAATTGT CGGCATCCGG CATCATAGTC CGGCCTGCGC CCGGCTGGTG AAATCGTTAA TCGAAAGCCA GCGGCCACGA TACGTGTTGA TTGAAGGCCC GGCTGATTTT AATGACCGGG TAGACGAACT GTTTTTAGCC CACCAGCTTC CGGTAGCTAT TTACAGTTAT TGCCAGTATC AGGACGGTGC AGCCCCCGGG CGTGGTGCCT GGACGCCATT TGCTGAATTT TCGCCGGAGT GGCAGGCGCT ACAAGCCGCA CGTCGCATTC AGGCACAAAC TTACTTCATC GATTTGCCTT GCTGGGCGCA GAGTGAAGAA GAGGACGATT CGCCTGATAC GCAAGATGAA AGCCAGGCCT TACTGCTGCG TGCCACCCGC ATGGATAACA GCGATACCCT GTGGGATCAC TTGTTCGAAG ATGAAAGCCA GCAAACTGCA TTACCCTCTG CGCTGGCGCA CTATTTTGCC CAACTGCGGG GCGACGCCTC CGGCGATGCG CTCAATCGTC AGCGCGAAGC CTTTATGGCC CGCTGGATTG GATGGGCGAT GCAGCAAAAT AATGGCGACG TGTTAGTTGT CTGCGGTGGC TGGCACGCTC CGGCACTGGC AAAGATGTGG CGCGAATGCC CTCAGAAAAT TAACAAGCCA GAATTGCCCT CGCTGGCAGA TGCCGTTACA GGTTGTTATC TCACACCCTA CAGTGAAAAG CGCCTTGATG TGCTGGCAGG ATACCTTTCA GGAATGCCTG CCCCGGTATG GCAAAACTGG TGCTGGCAGT GGGGCTTGCA GAAGGCCGGT GAACAACTGC TAAAAACTAT CCTTACCCGT TTGCGCCAGC ACAAATTGCC CGCTTCTACC GCGGATATGG CTGCCGCTCA TCTGCATGCG ATGGCGCTGG CACAGTTGCG CGGTCATACA CTACCGTTAC GCACTGACTG GCTGGATGCC ATAGCAGGCT CGCTGATTAA AGAAGCCCTG AACGCGCCGT TGCCGTGGAG CTATCGCGGC GTTATTCATC CCGATACCGA TCCGATTCTG CTAACGTTGA TAGACAAATT AGCGGGTGAC GGATTCGGTA AACTTGCCCC TTCTACACCA CAACCGCCTC TGCCAAAAGA TGTCACCTGC GAACTGGAAC GTACCGCAAT CTCTCTTCCG GCGGAGCTTA CCTTAAATCG CTTTAACCCC AATGGACTAG CGCAAAGTCA GGTGTTACAT CGGCTGGCAA TACTGGAGAT CCCAGGGATT GTACGCCAGC AGGGAAGCAC ACTGACACTT GCAGGCAACG GTGAAGAACG CTGGAAATTA ACCCGCCCGC TTAGCCAGCA TGCGGCATTG ATTGAGGCCG CCTGCTTTGG TGCCACACTC CAGGAAGCCG CACGCCATAA ATTAGAAGCC GATATGCTGG ACGCGGGTGG AATCGGCAGT ATCACCACAT GTCTTAGCCA GGCGGCGTTA GCGGGTCTGG CGTCCTTCAG TCAACAATTA CTGGAGCAAC TCACATTATT AATCGCCCAG GAAAATCAAT TTGCCGAAAT GGGCCAGGCG CTGGAAGTGC TATATGCCTT ATGGCGGCTG GATGAAATTA GCGGTATGCA AGGCGCGCAG ATATTACAAA CGACGTTATG CGCGGCTATC GATCGCACGC TGTGGCTGTG TGAATCTAAC GGCAGGCCGG ATGAAAAGGA GTTTCACGCT CACCTGCATA GCTGGCAAGC GCTTTGCCAT ATTCTGCGCG ATCTACATAG CGGCGTTAAT TTATCCGGCG TTTCGCTTTC TGCGGCGGTA GCCTTACTGG AGCGACGCAG TCAGGCGATT CATGCCCCGG CGCTGGATCG CGGCGCGGTT CTTGGCGCAC TAATGCGTCT GGAACATCCC AACGCCAGTG CCGAAGCGGC GCTGACGATG CTGGCGCAGT TATCCCCGGC ACAATCCGGC GAGGCGCTGC ACGGTTTGCT GGCATTAGCC CGTCATCAAC TGGCCTGTCA GCCGGCATTT ATCGCCGGTT TCAGCAGTCA TTTAAATCAA CTAAGTGATG CCGATTTTAC CAATGCCCTG CCCGATTTAC GCGCGGCAAT GGCCTGGCTA CCGCCACGAG AACGCGGGAC GCTGGCGCAT CAGGTGCTTG AGCATTATCA ACTGGCGCAA CTTCCCGTTT CGGCACTGCA AATGCCGTTG CATTGTCCAC CGCAAGCCAT TGCACATCAT CAACAACTCG AACAGCAGGC ACTGGCATCG CTGCAACACT GGGGAGTTTT CCATGTCTGA
|
Protein sequence | MSEPLIVGIR HHSPACARLV KSLIESQRPR YVLIEGPADF NDRVDELFLA HQLPVAIYSY CQYQDGAAPG RGAWTPFAEF SPEWQALQAA RRIQAQTYFI DLPCWAQSEE EDDSPDTQDE SQALLLRATR MDNSDTLWDH LFEDESQQTA LPSALAHYFA QLRGDASGDA LNRQREAFMA RWIGWAMQQN NGDVLVVCGG WHAPALAKMW RECPQKINKP ELPSLADAVT GCYLTPYSEK RLDVLAGYLS GMPAPVWQNW CWQWGLQKAG EQLLKTILTR LRQHKLPAST ADMAAAHLHA MALAQLRGHT LPLRTDWLDA IAGSLIKEAL NAPLPWSYRG VIHPDTDPIL LTLIDKLAGD GFGKLAPSTP QPPLPKDVTC ELERTAISLP AELTLNRFNP NGLAQSQVLH RLAILEIPGI VRQQGSTLTL AGNGEERWKL TRPLSQHAAL IEAACFGATL QEAARHKLEA DMLDAGGIGS ITTCLSQAAL AGLASFSQQL LEQLTLLIAQ ENQFAEMGQA LEVLYALWRL DEISGMQGAQ ILQTTLCAAI DRTLWLCESN GRPDEKEFHA HLHSWQALCH ILRDLHSGVN LSGVSLSAAV ALLERRSQAI HAPALDRGAV LGALMRLEHP NASAEAALTM LAQLSPAQSG EALHGLLALA RHQLACQPAF IAGFSSHLNQ LSDADFTNAL PDLRAAMAWL PPRERGTLAH QVLEHYQLAQ LPVSALQMPL HCPPQAIAHH QQLEQQALAS LQHWGVFHV
|
| |