Gene EcDH1_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0115 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp117447 
End bp121580 
Gene Length4134 bp 
Protein Length1377 aa 
Translation table11 
GC content60% 
IMG OID 
ProductYD repeat protein 
Protein accessionACX37810 
Protein GI260447388 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGAA AACCGGCAGC GCGTCAGGGC GACATGACGC AGTATGGCGG TAGCATTGTT 
CAGGGTTCAG CCGGGGTGCG CATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC
GGCGGAGTGA CGTCCGGCCA TCCGGTCAAT CCCCTGCTCG GTGCAAAGGT CCTTCCCGGT
GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATCC TCTCCCGCAC CTACAGCAGT
TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG
GATATCCGCT TACAGCTGCG CGATAACACA CTGATACTCA GTGATAACGG CGGCAGAAGC
CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA GTCACTGTGG
CTGGTGCGCG GCGGCGTGGC GAAACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG
GCGCTGCCGG AAGAACTCCG CTTAAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG
GGGCCGTGGT GGCTGCTCGG TTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT
GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG
ACGTTCCACC GCGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT
GCCTGGCGTC ACTTCCGGCT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG
CAGCAGGCCA TTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC
ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG
GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTGCGCT ATGGCTGGAC GCCCCGCGGC
GAACTGGCGG TGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT
AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT
TACGACAGCG ACGGGCGGGT GACAGAACAG CTAAACCCGG CAGGCTTAAG CTACACGTAT
CAGTATGAGA AAGACCGCAT CACCATCACC GACAGCCTGG ACCGCCGTGA AGTGCTGCAC
ACGCAGGGCG AAGCCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC
ACGCAGAGTC AGTTTGACGC CGTGGGCAGG CTCAGGGCAC AGACGGATGC CGCAGGCAGG
ACAACAGAGT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATAAC CACGCCGGAT
GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAACCAGT TAACGTCAGC CACCGGGCCT
GACGGGCTGG AATTGCGCCG GGAATATGAT GAATTGGGCC GTCTGATTCA GGAAACTGCC
CCTGACGGCG ATATCACCCG CTACCGTTAT GATAATCCAC ACAGTGACTT ACCCTGCGCA
ACGGAAGATG CCACCGGCAG CCGGAAAACC ATGACGTGGA GCCGTTACGG TCAGTTGCTG
AGCTTCACCG ACTGTTCCGG TTATGTAACC CGTTATGACC ATGACCGCTT CGGGCAGATG
ACGGCGGTGC ACCGCGAGGA AGGGCTGAGT CAGTACCGCG CATACGACAG CCGTGGACAG
TTAATTGCCG TGAAAGACAC GCAGGGCCAT GAAACGCGGT ATGAATACAA CATCGCCGGT
GACCTGACCG CCGTCATTGC CCCGGACGGC AGCAGAAACG GGACACAGTA CGATGCGTGG
GGAAAGGCCG TCCGTACCAC GCAGGGCGGG CTAACGCGCA GTATGGAATA CGATGCTGCC
GGACGGGTCA TCCGCCTGAC CAGTGAAAAC GGCAGCCACA CCACCTTCCG TTACGATGTA
CTTGACCGGC TGATACAGGA AACCGGCTTT GACGGCCGCA CACAGCGTTA TCACCACGAC
CTGACCGGCA AACTTATCCG CAGCGAGGAT GAGGGTCTGG TCACCCACTG GCACTATGAC
GAAGCAGACC GCCTCACGCA CCGCACCGTG AAGGGTGAAA CCGCAGAGCG GTGGCAGTAT
GACGAACGTG GCTGGCTGAC AGACATCAGC CATATCAGCG AAGGGCACCG GGTGGCGGTG
CATTACAGGT ATGATGAGAA AGGCCGGCTG ACCGGTGAGC GTCAGACGGT GCATCACCCG
CAGACGGAAG CACTGCTCTG GCAGCATGAG ACCAGACATG CGTACAACGC GCAGGGGCTG
GCGAACCGCT GTATACCGGA CAGCCTGCCC GCCGTGGAAT GGCTGACCTA CGGCAGCGGT
TACCTGGCAG GCATGAAACT CGGCGACACA CCGCTGGTGG AGTACACCCG CGACCGCCTG
CACCGGGAAA CGCTGCGCAG CTTCGGCCGT TATGAACTCA CCACCGCTTA TACCCCTGCC
GGGCAGTTAC AGAGCCAGCA CCTGAACAGC CTGCTGTCTG ACCGCGATTA CACCTGGAAC
GACAACGGCG AACTCATCCG CATCAGCAGC CCGCGCCAGA CCCGGAGTTA CAGCTACAGC
ACCACCGGCA GGCTGACCGG CGTTCACACC ACCGCAGCGA ATCTGGATAT CCGCATCCCG
TATGCCACAG ACCCGGCAGG TAACCGCCTG CCCGACCCGG AGCTGCACCC GGACAGCACC
CTCAGCATGT GGCCGGATAA CCGTATCGCC CGTGACGCGC ACTATCTTTA CCGGTATGAC
CGTCACGGCA GGCTGACAGA GAAAACCGAC CTCATCCCGG AAGGGGTTAT CCGCACGGAT
GATGAGCGGA CTCACCGGTA CCATTACGAC AGTCAGCACC GGCTGGTGCA CTACACGCGG
ACACAATATG AAGAGCCGCT GGTCGAAAGT CGCTATCTTT ACGACCCGCT GGGCCGCAGG
GTGGCAAAAC GGGTGTGGCG GCGTGAACGG GACCTGACGG GCTGGATGTC GCTGTCACGG
AAACCGCAAG TGACCTGGTA CGGCTGGGAC GGCGACCGGC TGACCACAAT ACAGAACGAC
AGAACCCGCA TCCAGACGAT TTATCAGCCG GGGAGCTTCA CGCCACTCAT CAGAGTTGAA
ACCGCCACCG GTGAGCTGGC GAAAACGCAG CGCCGCAGCC TGGCGGATGC GCTTCAGCAG
TCCGGCGGCG AAGACGGTGG CAGTGTGGTG TTCCCGCCGG TGCTGGTGCA GATGCTCGAC
CGGCTGGAAA GTGAAATCCT GGCTGACCGG GTGAGTGAGG AAAGCCGCCG CTGGCTGGCA
TCGTGCGGCC TGACCGTGGA GCAGATGCAA AACCAGATGG ACCCGGTGTA CACGCCGGCG
CGAAAAATCC ACCTGTACCA CTGCGACCAT CGCGGCCTGC CGCTGGCCCT TATCAGCAAG
GAAGGGACAA CAGAATGGTG CGCAGAATAC GATGAATGGG GCAACCTGCT GAATGAAGAG
AACCCGCATC AGCTGCAGCA GCTTATCCGC CTGCCGGGGC AGCAGTATGA TGAGGAGTCC
GGCCTGTATT ACAACCGCCA CCGCTATTAT GACCCGCTGC AGGGGCGGTA TATCACTCAG
GATCCGATTG GGCTGAAGGG GGGATGGAAT TTTTATCAGT ATCCGTTGAA TCCAGTTACG
AATACAGATC CTCTGGGGTT AGAAGTTTTT CCTAGACCAT TCCCCTTGCC AATTCCATGG
CCCAAAAGCC CTGCACAGCA GCAAGCAGAT GATAATGCTG CAAAAGCATT GACAAAATGG
TGGAACGATA CAGCATCACA AAGAATATTT GACTCTCTAA TATTGAATAA TCCGGGACTA
GCATTAGATA TAACAATGAT AGCTTCTCGT GGAAATGTTG CAGACACAGG GATAACTGAT
CGTGTCAATG ACATAATAAA TGACAGATTC TGGAGTGATG GGAAAAAACC CGACAGATGT
GACGTACTTC AGGAACTAAT TGATTGTGGT GATATTAGTG CTAAAGATGC AAAAAGCACA
CAGAAAGCCT GGAATTGTCG TCACTCCAGA CAGTCAAACG ATAAAAAAAG ATAG
 
Protein sequence
MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG 
ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS
LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ
GPWWLLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHREAAGEF SGEITGVTDG
AWRHFRLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP
EYPENLPAAP LVRYGWTPRG ELAVVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR
YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLDRREVLH TQGEAGLKRV VKKEHADGSV
TQSQFDAVGR LRAQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HNQLTSATGP
DGLELRREYD ELGRLIQETA PDGDITRYRY DNPHSDLPCA TEDATGSRKT MTWSRYGQLL
SFTDCSGYVT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNIAG
DLTAVIAPDG SRNGTQYDAW GKAVRTTQGG LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV
LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWHYD EADRLTHRTV KGETAERWQY
DERGWLTDIS HISEGHRVAV HYRYDEKGRL TGERQTVHHP QTEALLWQHE TRHAYNAQGL
ANRCIPDSLP AVEWLTYGSG YLAGMKLGDT PLVEYTRDRL HRETLRSFGR YELTTAYTPA
GQLQSQHLNS LLSDRDYTWN DNGELIRISS PRQTRSYSYS TTGRLTGVHT TAANLDIRIP
YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYRYD RHGRLTEKTD LIPEGVIRTD
DERTHRYHYD SQHRLVHYTR TQYEEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR
KPQVTWYGWD GDRLTTIQND RTRIQTIYQP GSFTPLIRVE TATGELAKTQ RRSLADALQQ
SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVEQMQ NQMDPVYTPA
RKIHLYHCDH RGLPLALISK EGTTEWCAEY DEWGNLLNEE NPHQLQQLIR LPGQQYDEES
GLYYNRHRYY DPLQGRYITQ DPIGLKGGWN FYQYPLNPVT NTDPLGLEVF PRPFPLPIPW
PKSPAQQQAD DNAAKALTKW WNDTASQRIF DSLILNNPGL ALDITMIASR GNVADTGITD
RVNDIINDRF WSDGKKPDRC DVLQELIDCG DISAKDAKST QKAWNCRHSR QSNDKKR