Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3212 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 3455202 |
End bp | 3458348 |
Gene Length | 3147 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | exonuclease SbcC |
Protein accession | ACX40838 |
Protein GI | 260450416 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.223053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTC TCAGCCTGCG CCTGAAAAAC CTGAACTCAT TAAAAGGCGA ATGGAAGATT GATTTCACCC GCGAGCCGTT CGCCAGCAAC GGGCTGTTTG CTATTACCGG CCCAACAGGT GCGGGGAAAA CCACCCTGCT GGACGCCATT TGTCTGGCGC TGTATCACGA AACTCCGCGT CTCTCTAACG TTTCACAATC GCAAAATGAT CTCATGACCC GCGATACCGC CGAATGTCTG GCGGAGGTGG AGTTTGAAGT GAAAGGTGAA GCGTACCGTG CATTCTGGAG CCAGAATCGG GCGCGTAACC AACCCGACGG TAATTTGCAG GTGCCACGCG TAGAGCTGGC GCGCTGCGCC GACGGCAAAA TTCTCGCCGA CAAAGTGAAA GATAAGCTGG AACTGACAGC GACGTTAACC GGGCTGGATT ACGGGCGCTT CACCCGTTCG ATGCTGCTTT CGCAGGGGCA ATTTGCTGCC TTCCTGAATG CCAAACCCAA AGAACGCGCG GAATTGCTCG AGGAGTTAAC CGGCACTGAA ATCTACGGGC AAATCTCGGC GATGGTTTTT GAGCAGCACA AATCGGCCCG CACAGAGCTG GAGAAGCTGC AAGCGCAGGC CAGCGGCGTC ACGTTGCTCA CGCCGGAACA AGTGCAATCG CTGACAGCGA GTTTGCAGGT ACTTACTGAC GAAGAAAAAC AGTTAATTAC CGCGCAGCAG CAAGAACAAC AATCGCTAAA CTGGTTAACG CGTCAGGACG AATTGCAGCA AGAAGCCAGC CGCCGTCAGC AGGCCTTGCA ACAGGCGTTA GCCGAAGAAG AAAAAGCGCA ACCTCAACTG GCGGCGCTTA GTCTGGCACA ACCGGCACGA AATCTTCGTC CACACTGGGA ACGCATCGCA GAACACAGCG CGGCGCTGGC GCATATTCGC CAGCAGATTG AAGAAGTAAA TACTCGCTTA CAGAGCACAA TGGCGCTTCG CGCGAGCATT CGCCACCACG CGGCGAAGCA GTCAGCAGAA TTACAGCAGC AGCAACAAAG CCTGAATACC TGGTTACAGG AACACGACCG CTTCCGTCAG TGGAACAACG AACCGGCGGG TTGGCGTGCG CAGTTCTCCC AACAAACCAG CGATCGCGAG CATCTGCGGC AATGGCAGCA ACAGTTAACC CATGCTGAGC AAAAACTTAA TGCGCTTGCG GCGATCACGT TGACGTTAAC CGCCGATGAA GTTGCTACCG CCCTGGCGCA ACATGCTGAG CAACGCCCAC TGCGTCAGCA CCTGGTCGCG CTGCATGGAC AGATTGTTCC CCAACAAAAA CGTCTGGCGC AGTTACAGGT CGCTATCCAG AATGTCACGC AAGAACAGAC GCAACGTAAC GCCGCACTTA ACGAAATGCG CCAGCGTTAT AAAGAAAAGA CGCAGCAACT TGCCGATGTG AAAACCATTT GCGAGCAGGA AGCGCGCATC AAAACGCTGG AAGCTCAACG TGCACAGTTA CAGGCGGGTC AGCCTTGCCC ACTTTGTGGT TCCACCAGCC ACCCGGCGGT CGAGGCGTAT CAGGCGCTGG AGCCTGGCGT TAATCAGTCT CGATTACTGG CGCTGGAAAA CGAAGTTAAA AAGCTCGGTG AAGAAGGTGC GACGCTACGT GGGCAACTGG ACGCCATAAC AAAGCAGCTT CAGCGTGATG AAAACGAAGC GCAAAGCCTC CGACAAGATG AGCAAGCACT TACTCAACAA TGGCAAGCCG TCACGGCCAG CCTCAATATC ACCTTGCAGC CACTGGACGA TATTCAACCG TGGCTGGATG CACAAGATGA GCACGAACGC CAGCTGCGGT TACTCAGCCA ACGGCATGAA TTACAAGGGC AGATTGCCGC GCATAATCAG CAAATTATCC AGTATCAACA GCAAATTGAA CAACGCCAGC AACTACTTTT AACGACATTG ACGGGTTATG CACTGACATT GCCACAGGAA GATGAAGAAG AGAGCTGGTT GGCGACACGT CAGCAAGAAG CGCAGAGCTG GCAGCAACGC CAGAACGAAT TAACCGCGCT GCAAAACCGT ATTCAGCAGC TGACGCCGAT TCTGGAAACG TTGCCGCAAA GTGATGAACT CCCGCACTGC GAAGAAACTG TGGTATTGGA AAACTGGCGG CAGGTACATG AACAATGTCT CGCATTACAC AGCCAGCAGC AGACGTTACA GCAACAGGAT GTTCTGGCGG CGCAAAGTCT GCAAAAAGCC CAGGCGCAGT TTGACACCGC GCTACAGGCC AGCGTCTTTG ACGATCAGCA GGCGTTCCTT GCGGCGCTAA TGGATGAACA AACACTAACG CAGCTGGAAC AGCTCAAGCA GAATCTGGAA AACCAGCGCC GTCAGGCGCA AACTCTGGTC ACTCAGACAG CAGAAACGCT GGCACAGCAT CAACAACACC GACCTGACGA CGGGTTGGCT CTCACTGTGA CGGTGGAGCA GATTCAGCAA GAGTTAGCGC AAACTCACCA AAAGTTGCGT GAAAACACCA CGAGTCAAGG CGAGATTCGC CAGCAGCTGA AGCAGGATGC AGATAACCGT CAGCAACAAC AAACCTTAAT GCAGCAAATT GCTCAAATGA CGCAGCAGGT TGAGGACTGG GGATATCTGA ATTCGCTAAT AGGTTCCAAA GAGGGCGATA AATTCCGCAA GTTTGCCCAG GGGCTGACGC TGGATAATTT AGTCCATCTC GCTAATCAGC AACTTACCCG GCTGCACGGG CGCTATCTGT TACAGCGCAA AGCCAGCGAG GCGCTGGAAG TCGAGGTTGT TGATACCTGG CAGGCAGATG CGGTACGCGA TACCCGTACC CTTTCCGGCG GCGAAAGTTT CCTCGTTAGT CTGGCGCTGG CGCTGGCGCT TTCGGATCTG GTCAGCCATA AAACACGTAT TGACTCGCTG TTCCTTGATG AAGGTTTTGG CACGCTGGAT AGCGAAACGC TGGATACCGC CCTTGATGCG CTGGATGCCC TGAACGCCAG TGGCAAAACC ATCGGTGTGA TTAGCCACGT AGAAGCGATG AAAGAGCGTA TTCCGGTGCA GATCAAAGTG AAAAAGATCA ACGGCCTGGG CTACAGCAAA CTGGAAAGTA CGTTTGCAGT GAAATAA
|
Protein sequence | MKILSLRLKN LNSLKGEWKI DFTREPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR LSNVSQSQND LMTRDTAECL AEVEFEVKGE AYRAFWSQNR ARNQPDGNLQ VPRVELARCA DGKILADKVK DKLELTATLT GLDYGRFTRS MLLSQGQFAA FLNAKPKERA ELLEELTGTE IYGQISAMVF EQHKSARTEL EKLQAQASGV TLLTPEQVQS LTASLQVLTD EEKQLITAQQ QEQQSLNWLT RQDELQQEAS RRQQALQQAL AEEEKAQPQL AALSLAQPAR NLRPHWERIA EHSAALAHIR QQIEEVNTRL QSTMALRASI RHHAAKQSAE LQQQQQSLNT WLQEHDRFRQ WNNEPAGWRA QFSQQTSDRE HLRQWQQQLT HAEQKLNALA AITLTLTADE VATALAQHAE QRPLRQHLVA LHGQIVPQQK RLAQLQVAIQ NVTQEQTQRN AALNEMRQRY KEKTQQLADV KTICEQEARI KTLEAQRAQL QAGQPCPLCG STSHPAVEAY QALEPGVNQS RLLALENEVK KLGEEGATLR GQLDAITKQL QRDENEAQSL RQDEQALTQQ WQAVTASLNI TLQPLDDIQP WLDAQDEHER QLRLLSQRHE LQGQIAAHNQ QIIQYQQQIE QRQQLLLTTL TGYALTLPQE DEEESWLATR QQEAQSWQQR QNELTALQNR IQQLTPILET LPQSDELPHC EETVVLENWR QVHEQCLALH SQQQTLQQQD VLAAQSLQKA QAQFDTALQA SVFDDQQAFL AALMDEQTLT QLEQLKQNLE NQRRQAQTLV TQTAETLAQH QQHRPDDGLA LTVTVEQIQQ ELAQTHQKLR ENTTSQGEIR QQLKQDADNR QQQQTLMQQI AQMTQQVEDW GYLNSLIGSK EGDKFRKFAQ GLTLDNLVHL ANQQLTRLHG RYLLQRKASE ALEVEVVDTW QADAVRDTRT LSGGESFLVS LALALALSDL VSHKTRIDSL FLDEGFGTLD SETLDTALDA LDALNASGKT IGVISHVEAM KERIPVQIKV KKINGLGYSK LESTFAVK
|
| |