Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1538 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 1668885 |
End bp | 1671164 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | conserved hypothetical protein |
Protein accession | ACX39207 |
Protein GI | 260448785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.163739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGC CGTTAATTGT CGGCATCCGG CATCATAGTC CGGCCTGCGC CCGGCTGGTG AAATCGTTAA TCGAAAGCCA GCGGCCACGA TACGTGTTGA TTGAAGGCCC GGCTGATTTT AATGACCGGG TAGACGAACT GTTTTTAGCC CACCAGCTTC CGGTAGCTAT TTACAGTTAT TGCCAGTATC AGGACGGTGC AGCCCCCGGG CGTGGTGCCT GGACGCCATT TGCTGAATTT TCGCCGGAGT GGCAGGCGCT ACAAGCCGCA CGTCGCATTC AGGCACAAAC TTACTTCATC GATTTGCCTT GCTGGGCGCA GAGTGAAGAA GAGGACGATT CGCCTGATAC GCAAGATGAA AGCCAGGCCT TACTGCTGCG TGCCACCCGC ATGGATAACA GCGATACCCT GTGGGATCAC TTGTTCGAAG ATGAAAGCCA GCAAACTGCA TTACCCTCTG CGCTGGCGCA CTATTTTGCC CAACTGCGGG GCGACGCCTC CGGCGATGCG CTCAATCGTC AGCGCGAAGC CTTTATGGCC CGCTGGATTG GATGGGCGAT GCAGCAAAAT AATGGCGACG TGTTAGTTGT CTGCGGTGGC TGGCACGCTC CGGCACTGGC AAAGATGTGG CGCGAATGCC CTCAGAAAAT TAACAAGCCA GAATTGCCCT CGCTGGCAGA TGCCGTTACA GGTTGTTATC TCACACCCTA CAGTGAAAAG CGCCTTGATG TGCTGGCAGG ATACCTTTCA GGAATGCCTG CCCCGGTATG GCAAAACTGG TGCTGGCAGT GGGGCTTGCA GAAGGCCGGT GAACAACTGC TAAAAACTAT CCTTACCCGT TTGCGCCAGC ACAAATTGCC CGCTTCTACC GCGGATATGG CTGCCGCTCA TCTGCATGCG ATGGCGCTGG CACAGTTGCG CGGTCATACA CTACCGTTAC GCACTGACTG GCTGGATGCC ATAGCAGGCT CGCTGATTAA AGAAGCCCTG AACGCGCCGT TGCCGTGGAG CTATCGCGGC GTTATTCATC CCGATACCGA TCCGATTCTG CTAACGTTGA TAGACACATT AGCGGGTGAC GGATTCGGTA AACTTGCCCC TTCTACGCCA CAACCGCCTC TGCCAAAAGA TGTCACCTGC GAACTGGAAC GTACCGCAAT CTCCCTTCCG GCGGAGCTTA CCTTAAATCG CTTTACCCCC GATGGGCTGG CGCAAAGTCA GGTGTTACAT CGGCTGGCAA TACTGGAGAT CCCTGGGATT GTACGCCAGC AGGGAAGTAC ACTGACACTT GCAGGCAACG GTGAAGAACG CTGGAAATTA ACCCGCCCGC TTAGCCAGCA TGCGGCATTG ATTGAGGCCG CCTGTTTTGG TGCCACACTC CAGGAAGCCG CACGCAATAA ATTAGAAGCC GATATGCTGG ACGCGGGCGG AATCGGCAGT ATCACCACAT GTCTTAGCCA GGCGGCGTTA GCGGGTCTGG CGTCCTTCAG TCAACAATTA CTGGAGCAAC TCACACTATT AATCGCCCAG GAAAATCAAT TTGCCGAAAT GGGCCAGGCG CTGGAAGTGC TTTATGCCTT ATGGCGGCTG GATGAAATTA GCGGTATGCA AGGCGCGCAG ATATTACAAA CGACGTTATG CGCGACTATC GATCGCACGC TGTGGCTGTG TGAATCTAAC GGCAGACCGG ATGAAAAGGA GTTTCACGCT CACCTGCATA GCTGGCAAGC GCTTTGCCAT ATTCTGCGCG ATCTACATAG CGGCGTTAAT TTACCCGGCG TTTCTCTTTC TGCGGCGGTA GCCTTACTGG AGCGACGCAG TCAGGCAATT CATGCCCCGG CGCTGGATCG CGGCGCGGCT CTTGGCGCAC TAATGCGTCT GGAACATCCC AACGCCAGTG CCGAAGCGGC GCTGACGATG CTGGCGCAGT TATCCCCGGC ACAATCTGGT GAGGCGCTGC ACGGTTTGCT GGCGCTGGCC CGCCATCAAC TGGCCTGTCA GCCGGCATTT ATCGCCGGTT TCAGCAGTCA TTTAAATCAA CTGAGTGAAG CCGATTTTAT TAACGCCCTG CCCGATTTAC GCGCGGCGAT GGCCTGGCTA CCACCACGAG AACGCGGGAC GCTGGCGCAT CAGGTGCTTG AGCATTATCA ACTGGCGCAA CTTCCCGTTT CGGCGCTGCA AATGCCGTTG CATTGTCCAC CACAGGCCAT TGCACATCAT CAACAACTCG AACAGCAGGC ACTGGCATCG CTGCAAAACT GGGGAGTTTT CCATGTCTGA
|
Protein sequence | MSEPLIVGIR HHSPACARLV KSLIESQRPR YVLIEGPADF NDRVDELFLA HQLPVAIYSY CQYQDGAAPG RGAWTPFAEF SPEWQALQAA RRIQAQTYFI DLPCWAQSEE EDDSPDTQDE SQALLLRATR MDNSDTLWDH LFEDESQQTA LPSALAHYFA QLRGDASGDA LNRQREAFMA RWIGWAMQQN NGDVLVVCGG WHAPALAKMW RECPQKINKP ELPSLADAVT GCYLTPYSEK RLDVLAGYLS GMPAPVWQNW CWQWGLQKAG EQLLKTILTR LRQHKLPAST ADMAAAHLHA MALAQLRGHT LPLRTDWLDA IAGSLIKEAL NAPLPWSYRG VIHPDTDPIL LTLIDTLAGD GFGKLAPSTP QPPLPKDVTC ELERTAISLP AELTLNRFTP DGLAQSQVLH RLAILEIPGI VRQQGSTLTL AGNGEERWKL TRPLSQHAAL IEAACFGATL QEAARNKLEA DMLDAGGIGS ITTCLSQAAL AGLASFSQQL LEQLTLLIAQ ENQFAEMGQA LEVLYALWRL DEISGMQGAQ ILQTTLCATI DRTLWLCESN GRPDEKEFHA HLHSWQALCH ILRDLHSGVN LPGVSLSAAV ALLERRSQAI HAPALDRGAA LGALMRLEHP NASAEAALTM LAQLSPAQSG EALHGLLALA RHQLACQPAF IAGFSSHLNQ LSEADFINAL PDLRAAMAWL PPRERGTLAH QVLEHYQLAQ LPVSALQMPL HCPPQAIAHH QQLEQQALAS LQNWGVFHV
|
| |