Gene EcDH1_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1119 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1201017 
End bp1204298 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionACX38793 
Protein GI260448371 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCCAG TAAAAGTGTG GCAAGAGCGC GTTGAGATCC CGACCTATGA AACCGGGCCG 
CAGGATATAC ATCCCATGTT CCTGGAAAAT CGCGTTTATC AGGGATCGTC CGGCGCGGTT
TATCCCTACG GCGTGACCGA TACGCTGAGC GAGCAGAAAA CCCTGAAATC CTGGCAGGCG
GTGTGGCTGG AAAACGACTA CATCAAAGTG ATGATCCTGC CGGAACTGGG CGGTCGGGTG
CATCGCGCGT GGGATAAAGT GAAACAGCGC GATTTTGTTT ATCACAATGA AGTCATTAAA
CCTGCGCTGG TGGGGCTGCT GGGGCCGTGG ATTTCCGGTG GGATTGAGTT TAACTGGCCG
CAACATCATC GCCCGACCAC CTTTATGCCC GTTGATTTCA CCCTCGAAGC CCATGAAGAC
GGTGCACAGA CGGTGTGGGT AGGCGAAACG GAGCCGATGC ATGGTTTACA GGTGATGACA
GGTTTCACCC TGCGCCCTGA CCGGGCGGCG CTGGAAATCG CCAGCCGCGT CTATAACGGG
AACGCCACGC CGCGTCATTT CTTGTGGTGG GCCAACCCGG CAGTGAAAGG GGGGGAAGGG
CATCAGAGCG TCTTCCCGCC GGATGTAACG GCGGTGTTTG ATCACGGCAA ACGGGCCGTC
TCCGCTTTCC CCATCGCTAC CGGCACTTAC TACAAAGTGG ACTACTCCGC TGGAGTGGAC
ATTTCTCGCT ATAAAAATGT GCCTGTTCCA ACCTCATATA TGGCCGAAAA ATCGCAGTAC
GATTTTGTCG GCGCGTGGTG TCACGATGAA GATGGCGGTT TGCTGCACGT TGCCAACCAC
CATATTGCGC CAGGTAAAAA ACAGTGGAGT TGGGGACACA GTGAATTTGG CCAGGCGTGG
GATAAGAGTC TGACCGACAA TAACGGCCCG TATATCGAAC TGATGACCGG TATTTTTGCC
GATAACCAGC CCGATTTTAC CTGGCTTGAT GCTTACGAAG AGAAGCGTTT TGAGCAGTAT
TTCCTGCCTT ATCATTCTTT GGGCATGGTG CAAAATGCCT CCCGCGATGC GGTGATAAAA
CTCCAGCGTA GTAAGCGGGG GATTGAGTGG GGGCTGTATG CCATCTCTCC GTTGAACGGA
TACCGCCTGG CGATCCGCGA AATCGGTAAA TGCAACGCGT TACTTGATGA TGCCGTGGCA
CTGATGCCTG CGACCGCCAT CCAGGGCGTG TTGCACGGTA TCAATCCTGA AAGGCTGACC
ATTGAGCTCT CCGATGCCGA CGGCAATATT GTACTGAGTT ATCAGGAACA TCAGCCGCAA
GCGTTGCCGT TACCGGACGT CGCCAAAGCG CCACTGGCAG CACAAGACAT TACCAGTACA
GATGAAGCCT GGTTTATCGG TCAGCATCTG GAGCAATATC ATCACGCCAG CCGTTCACCG
TTCGATTACT ACCTGCGCGG CGTGGCGCTG GACCCGCTGG ATTATCGCTG TAACCTGGCG
CTGGCGATGC TGGAATATAA CCGCGCAGAT TTCCCGCAAG CGGTGGCGTA TGCCACTCAG
GCTCTGAAAC GCGCACATGC GCTGAACAAA AATCCGCAGT GCGGACAGGC GAGTTTGATT
CGCGCCAGTG CTTACGAACG TCAGGGACAA TATCAACAAG CCGAAGAGGA TTTCTGGCGT
GCGGTCTGGA GCGGCAACAG CAAAGCCGGT GGCTATTATG GCCTGGCACG ACTGGCGGCG
CGTAATGGTA ACTTCGACGC TGGTCTGGAT TTTTGCCAAC AAAGTCTTCG CGCCTGCCCA
ACCAATCAGG AAGTGCTTTG CCTGCATAAT CTGCTGCTGG TGTTAAGTGG TCGTCAGGAC
AACGCGCGTG TGCAGCGCGA GAAACTGCTG CGCGATTATC CGCTGAACGC CACTCTGTGG
TGGCTGAACT GGTTCGATGG TCGTAGCGAA TCAGCTCTCG CGCAGTGGCG CGGTCTGTGT
CAGGGACGCG ACGTTAACGC CCTGATGACC GCCGGGCAAC TGATTAACTG GGGAATGCCC
ACCCTGGCGG CAGAGATGCT GAACGCACTG GACTGCCAGC GCACGCTGCC GCTTTACCTG
CAAGCCAGCT TGCTGCCGAA AGCCGAACGT GGCGAACTGG TCGCAAAAGC CATTGATGTC
TTCCCGCAGT TTGTCCGTTT CCCGAATACG CTGGAAGAAG TGGCGGCGCT GGAGAGTATT
GAAGAGTGCT GGTTTGCTCG CCATTTACTA GCTTGCTTCT ACTACAACAA ACGTAGCTAC
AACAAAGCCA TTGCCTTTTG GCAACGTTGC GTAGAGATGT CGCCGGAGTT TGCCGACGGC
TGGCGCGGGT TAGCGATCCA TGCGTGGAAT AAGCAACACG ATTATGAGCT GGCCGCGCGT
TATCTTGATA ATGCTTATCA GCTTGCGCCG CAGGATGCAC GTCTGCTTTT CGAACGGGAT
TTGCTGGATA AGTTAAGTGG AGCCACACCG GAGAAACGAC TGGCGCGTCT GGAAAATAAT
CTGGAAATTG CGCTGAAACG CGACGACATG ACCGCAGAAC TGCTCAATTT GTGGCATCTC
ACGGGTCAGG CAGACAAAGC GGCGGACATT CTCGCCACGC GCAAATTCCA CCCGTGGGAA
GGCGGGGAAG GGAAGGTCAC CAGTCAGTTT ATCCTCAACC AGTTATTACG CGCCTGGCAG
CATCTTGATG CCAGACAGCC GCAGCAGGCC TGCGAACTGC TTCATGCCGC GCTGCATTAT
CCGGAGAATT TAAGCGAAGG CCGTTTACCG GGGCAAACTG ATAACGACAT CTGGTTCTGG
CAGGCGATAT GCGCCAACGC GCAGGGCGAT GAAACTGAAG CGACGCGTTG TTTACGTCTG
GCGGCGACCG GCGATCGCAC TATTAACATT CACAGTTATT ACAACGATCA GCCGGTTGAT
TATCTCTTCT GGCAAGGGAT GGCGCTGCGA CTGCTGGGCG AACAACAAAC CGCACAGCAA
CTGTTTAGTG AAATGAAACA GTGGGCGCAA GAGATGGCGA AAACCAGTAT CGAAGCGGAT
TTCTTTGCCG TCTCGCAACC TGACTTGTTG TCGCTGTATG GCGATTTACA ACAGCAGCAT
AAAGAAAAAT GCCTGATGGT GGCGATGCTG GCGTCCGCGG GACTCGGGGA GGTTGCGCAA
TATGAGTCTG CTCGCGCTGA ATTGACGGCG ATTAATCCGG CTTGGCCGAA AGCGGCATTA
TTCACCACCG TGATGCCTTT TATTTTTAAC CGCGTTCACT AA
 
Protein sequence
MTPVKVWQER VEIPTYETGP QDIHPMFLEN RVYQGSSGAV YPYGVTDTLS EQKTLKSWQA 
VWLENDYIKV MILPELGGRV HRAWDKVKQR DFVYHNEVIK PALVGLLGPW ISGGIEFNWP
QHHRPTTFMP VDFTLEAHED GAQTVWVGET EPMHGLQVMT GFTLRPDRAA LEIASRVYNG
NATPRHFLWW ANPAVKGGEG HQSVFPPDVT AVFDHGKRAV SAFPIATGTY YKVDYSAGVD
ISRYKNVPVP TSYMAEKSQY DFVGAWCHDE DGGLLHVANH HIAPGKKQWS WGHSEFGQAW
DKSLTDNNGP YIELMTGIFA DNQPDFTWLD AYEEKRFEQY FLPYHSLGMV QNASRDAVIK
LQRSKRGIEW GLYAISPLNG YRLAIREIGK CNALLDDAVA LMPATAIQGV LHGINPERLT
IELSDADGNI VLSYQEHQPQ ALPLPDVAKA PLAAQDITST DEAWFIGQHL EQYHHASRSP
FDYYLRGVAL DPLDYRCNLA LAMLEYNRAD FPQAVAYATQ ALKRAHALNK NPQCGQASLI
RASAYERQGQ YQQAEEDFWR AVWSGNSKAG GYYGLARLAA RNGNFDAGLD FCQQSLRACP
TNQEVLCLHN LLLVLSGRQD NARVQREKLL RDYPLNATLW WLNWFDGRSE SALAQWRGLC
QGRDVNALMT AGQLINWGMP TLAAEMLNAL DCQRTLPLYL QASLLPKAER GELVAKAIDV
FPQFVRFPNT LEEVAALESI EECWFARHLL ACFYYNKRSY NKAIAFWQRC VEMSPEFADG
WRGLAIHAWN KQHDYELAAR YLDNAYQLAP QDARLLFERD LLDKLSGATP EKRLARLENN
LEIALKRDDM TAELLNLWHL TGQADKAADI LATRKFHPWE GGEGKVTSQF ILNQLLRAWQ
HLDARQPQQA CELLHAALHY PENLSEGRLP GQTDNDIWFW QAICANAQGD ETEATRCLRL
AATGDRTINI HSYYNDQPVD YLFWQGMALR LLGEQQTAQQ LFSEMKQWAQ EMAKTSIEAD
FFAVSQPDLL SLYGDLQQQH KEKCLMVAML ASAGLGEVAQ YESARAELTA INPAWPKAAL
FTTVMPFIFN RVH