Gene EcDH1_3212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3212 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3455202 
End bp3458348 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content54% 
IMG OID 
Productexonuclease SbcC 
Protein accessionACX40838 
Protein GI260450416 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.223053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTC TCAGCCTGCG CCTGAAAAAC CTGAACTCAT TAAAAGGCGA ATGGAAGATT 
GATTTCACCC GCGAGCCGTT CGCCAGCAAC GGGCTGTTTG CTATTACCGG CCCAACAGGT
GCGGGGAAAA CCACCCTGCT GGACGCCATT TGTCTGGCGC TGTATCACGA AACTCCGCGT
CTCTCTAACG TTTCACAATC GCAAAATGAT CTCATGACCC GCGATACCGC CGAATGTCTG
GCGGAGGTGG AGTTTGAAGT GAAAGGTGAA GCGTACCGTG CATTCTGGAG CCAGAATCGG
GCGCGTAACC AACCCGACGG TAATTTGCAG GTGCCACGCG TAGAGCTGGC GCGCTGCGCC
GACGGCAAAA TTCTCGCCGA CAAAGTGAAA GATAAGCTGG AACTGACAGC GACGTTAACC
GGGCTGGATT ACGGGCGCTT CACCCGTTCG ATGCTGCTTT CGCAGGGGCA ATTTGCTGCC
TTCCTGAATG CCAAACCCAA AGAACGCGCG GAATTGCTCG AGGAGTTAAC CGGCACTGAA
ATCTACGGGC AAATCTCGGC GATGGTTTTT GAGCAGCACA AATCGGCCCG CACAGAGCTG
GAGAAGCTGC AAGCGCAGGC CAGCGGCGTC ACGTTGCTCA CGCCGGAACA AGTGCAATCG
CTGACAGCGA GTTTGCAGGT ACTTACTGAC GAAGAAAAAC AGTTAATTAC CGCGCAGCAG
CAAGAACAAC AATCGCTAAA CTGGTTAACG CGTCAGGACG AATTGCAGCA AGAAGCCAGC
CGCCGTCAGC AGGCCTTGCA ACAGGCGTTA GCCGAAGAAG AAAAAGCGCA ACCTCAACTG
GCGGCGCTTA GTCTGGCACA ACCGGCACGA AATCTTCGTC CACACTGGGA ACGCATCGCA
GAACACAGCG CGGCGCTGGC GCATATTCGC CAGCAGATTG AAGAAGTAAA TACTCGCTTA
CAGAGCACAA TGGCGCTTCG CGCGAGCATT CGCCACCACG CGGCGAAGCA GTCAGCAGAA
TTACAGCAGC AGCAACAAAG CCTGAATACC TGGTTACAGG AACACGACCG CTTCCGTCAG
TGGAACAACG AACCGGCGGG TTGGCGTGCG CAGTTCTCCC AACAAACCAG CGATCGCGAG
CATCTGCGGC AATGGCAGCA ACAGTTAACC CATGCTGAGC AAAAACTTAA TGCGCTTGCG
GCGATCACGT TGACGTTAAC CGCCGATGAA GTTGCTACCG CCCTGGCGCA ACATGCTGAG
CAACGCCCAC TGCGTCAGCA CCTGGTCGCG CTGCATGGAC AGATTGTTCC CCAACAAAAA
CGTCTGGCGC AGTTACAGGT CGCTATCCAG AATGTCACGC AAGAACAGAC GCAACGTAAC
GCCGCACTTA ACGAAATGCG CCAGCGTTAT AAAGAAAAGA CGCAGCAACT TGCCGATGTG
AAAACCATTT GCGAGCAGGA AGCGCGCATC AAAACGCTGG AAGCTCAACG TGCACAGTTA
CAGGCGGGTC AGCCTTGCCC ACTTTGTGGT TCCACCAGCC ACCCGGCGGT CGAGGCGTAT
CAGGCGCTGG AGCCTGGCGT TAATCAGTCT CGATTACTGG CGCTGGAAAA CGAAGTTAAA
AAGCTCGGTG AAGAAGGTGC GACGCTACGT GGGCAACTGG ACGCCATAAC AAAGCAGCTT
CAGCGTGATG AAAACGAAGC GCAAAGCCTC CGACAAGATG AGCAAGCACT TACTCAACAA
TGGCAAGCCG TCACGGCCAG CCTCAATATC ACCTTGCAGC CACTGGACGA TATTCAACCG
TGGCTGGATG CACAAGATGA GCACGAACGC CAGCTGCGGT TACTCAGCCA ACGGCATGAA
TTACAAGGGC AGATTGCCGC GCATAATCAG CAAATTATCC AGTATCAACA GCAAATTGAA
CAACGCCAGC AACTACTTTT AACGACATTG ACGGGTTATG CACTGACATT GCCACAGGAA
GATGAAGAAG AGAGCTGGTT GGCGACACGT CAGCAAGAAG CGCAGAGCTG GCAGCAACGC
CAGAACGAAT TAACCGCGCT GCAAAACCGT ATTCAGCAGC TGACGCCGAT TCTGGAAACG
TTGCCGCAAA GTGATGAACT CCCGCACTGC GAAGAAACTG TGGTATTGGA AAACTGGCGG
CAGGTACATG AACAATGTCT CGCATTACAC AGCCAGCAGC AGACGTTACA GCAACAGGAT
GTTCTGGCGG CGCAAAGTCT GCAAAAAGCC CAGGCGCAGT TTGACACCGC GCTACAGGCC
AGCGTCTTTG ACGATCAGCA GGCGTTCCTT GCGGCGCTAA TGGATGAACA AACACTAACG
CAGCTGGAAC AGCTCAAGCA GAATCTGGAA AACCAGCGCC GTCAGGCGCA AACTCTGGTC
ACTCAGACAG CAGAAACGCT GGCACAGCAT CAACAACACC GACCTGACGA CGGGTTGGCT
CTCACTGTGA CGGTGGAGCA GATTCAGCAA GAGTTAGCGC AAACTCACCA AAAGTTGCGT
GAAAACACCA CGAGTCAAGG CGAGATTCGC CAGCAGCTGA AGCAGGATGC AGATAACCGT
CAGCAACAAC AAACCTTAAT GCAGCAAATT GCTCAAATGA CGCAGCAGGT TGAGGACTGG
GGATATCTGA ATTCGCTAAT AGGTTCCAAA GAGGGCGATA AATTCCGCAA GTTTGCCCAG
GGGCTGACGC TGGATAATTT AGTCCATCTC GCTAATCAGC AACTTACCCG GCTGCACGGG
CGCTATCTGT TACAGCGCAA AGCCAGCGAG GCGCTGGAAG TCGAGGTTGT TGATACCTGG
CAGGCAGATG CGGTACGCGA TACCCGTACC CTTTCCGGCG GCGAAAGTTT CCTCGTTAGT
CTGGCGCTGG CGCTGGCGCT TTCGGATCTG GTCAGCCATA AAACACGTAT TGACTCGCTG
TTCCTTGATG AAGGTTTTGG CACGCTGGAT AGCGAAACGC TGGATACCGC CCTTGATGCG
CTGGATGCCC TGAACGCCAG TGGCAAAACC ATCGGTGTGA TTAGCCACGT AGAAGCGATG
AAAGAGCGTA TTCCGGTGCA GATCAAAGTG AAAAAGATCA ACGGCCTGGG CTACAGCAAA
CTGGAAAGTA CGTTTGCAGT GAAATAA
 
Protein sequence
MKILSLRLKN LNSLKGEWKI DFTREPFASN GLFAITGPTG AGKTTLLDAI CLALYHETPR 
LSNVSQSQND LMTRDTAECL AEVEFEVKGE AYRAFWSQNR ARNQPDGNLQ VPRVELARCA
DGKILADKVK DKLELTATLT GLDYGRFTRS MLLSQGQFAA FLNAKPKERA ELLEELTGTE
IYGQISAMVF EQHKSARTEL EKLQAQASGV TLLTPEQVQS LTASLQVLTD EEKQLITAQQ
QEQQSLNWLT RQDELQQEAS RRQQALQQAL AEEEKAQPQL AALSLAQPAR NLRPHWERIA
EHSAALAHIR QQIEEVNTRL QSTMALRASI RHHAAKQSAE LQQQQQSLNT WLQEHDRFRQ
WNNEPAGWRA QFSQQTSDRE HLRQWQQQLT HAEQKLNALA AITLTLTADE VATALAQHAE
QRPLRQHLVA LHGQIVPQQK RLAQLQVAIQ NVTQEQTQRN AALNEMRQRY KEKTQQLADV
KTICEQEARI KTLEAQRAQL QAGQPCPLCG STSHPAVEAY QALEPGVNQS RLLALENEVK
KLGEEGATLR GQLDAITKQL QRDENEAQSL RQDEQALTQQ WQAVTASLNI TLQPLDDIQP
WLDAQDEHER QLRLLSQRHE LQGQIAAHNQ QIIQYQQQIE QRQQLLLTTL TGYALTLPQE
DEEESWLATR QQEAQSWQQR QNELTALQNR IQQLTPILET LPQSDELPHC EETVVLENWR
QVHEQCLALH SQQQTLQQQD VLAAQSLQKA QAQFDTALQA SVFDDQQAFL AALMDEQTLT
QLEQLKQNLE NQRRQAQTLV TQTAETLAQH QQHRPDDGLA LTVTVEQIQQ ELAQTHQKLR
ENTTSQGEIR QQLKQDADNR QQQQTLMQQI AQMTQQVEDW GYLNSLIGSK EGDKFRKFAQ
GLTLDNLVHL ANQQLTRLHG RYLLQRKASE ALEVEVVDTW QADAVRDTRT LSGGESFLVS
LALALALSDL VSHKTRIDSL FLDEGFGTLD SETLDTALDA LDALNASGKT IGVISHVEAM
KERIPVQIKV KKINGLGYSK LESTFAVK