Gene EcDH1_3061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3061 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3290165 
End bp3293137 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTPR repeat-containing protein 
Protein accessionACX40689 
Protein GI260450267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAGA ATAACCTTAA TCGCGTCATC GGATGGTCTG GTTTACTGCT GACGTCTTTA 
TTGAGTACCA GCGCACTCGC AGACAATATC GGCACCAGCG CAGAAGAGCT GGGGCTGAGC
GATTATCGCC ATTTTGTTAT TTATCCCCGT CTCGATAAAG CGCTGAAGGC ACAGAAAAAT
AACGACGAAG CAACCGCCAT CCGCGAATTT GAATATATAC ACCAGCAGGT GCCGGATAAT
ATTCCGCTGA CTTTATACCT TGCGGAAGCC TATCGCCATT TTGGTCATGA TGACCGGGCG
CGGCTGTTGC TTGAGGATCA ACTGAAACGT CACCCAGGAG ATGCCCGACT TGAGCGCAGT
CTGGCGGCTA TTCCGGTTGA AGTGAAAAGC GTTACGACAG TTGAAGAACT GCTTGCCCAG
CAAAAAGCGT GCGATGCTGC GCCGACCCTG CGTTGTCGCA GTGAAGTCGG GCAGAATGCC
CTGCGGCTGG CACAGTTACC TGTCGCCAGA GCGCAACTGA ACGATGCGAC GTTTGCTGCA
TCGCCGGAAG GAAAAACGCT GCGAACCGAT CTGCTGCAAC GGGCAATCTA CCTGAAACAA
TGGTCCCAGG CAGATACGCT ATACAATGAA GCACGCCAGC AGAACACATT AAGCGCGGCA
GAACGCCGTC AGTGGTTTGA CGTGCTTCTT GCCGGGCAGC TGGACGATCG GATCCTGGCA
CTGCAATCAC AGGGGATCTT CACCGATCCT CAGTCATATA TTACTTACGC GACCGCGCTG
GCTTATCGTG GCGAAAAAGC ACGCCTCCAG CATTATCTCA TTGAAAATAA GCCACTATTT
ACCACGGACG CACAAGAGAA AAGTTGGCTC TATCTGTTAT CTAAATACAG CGCTAACCCC
GTTCAGGCGT TGGCGAATTA TACGGTACAG TTTGCCGACA ACCGCCAGTA TGTTGTTGGC
GCGACGCTAC CGGTGCTGTT AAAAGAAGGT CAGTACGACG CAGCGCAAAA ACTGCTCGCC
ACCCTCCCCG CCAATGAAAT GCTTGAGGAG CGTTATGCTG TCAGCGTGGC GACCCGTAAC
AAGGCTGAAG CTCTGCGTCT GGCACGATTG CTGTATCAGC AAGAACCGGC AAATCTTACC
CGCCTGGATC AACTAACCTG GCAACTGATG CAGAACGAGC AGTCACGCGA AGCTGCCGAT
TTATTGCTGC AACGCTATCC TTTCCAGGGC GATGCGCGTG TCAGCCAGAC TTTAATGGCG
CGACTGGCGT CTCTGCTGGA AAGTCATCCT TACCTGGCAA CGCCGGCGAA GGTGGCGATT
TTATCGAAAC CCTTACCGCT GGCGGAGCAA CGTCAGTGGC AAAGTCAGTT GCCGGGTATT
GCAGATAATT GCCCGGCAAT AGTTCGCTTG CTGGGCGATA TGTCGCCTTC CTACGATGCC
GCCGCCTGGA ACCGTCTGGC AAAGTGTTAT CGGGACACGC TACCCGGTGT GGCGTTGTAT
GCATGGCTTC AGGCCGAACA ACGACAACCG AGCGCCTGGC AACATCGTGC GGTAGCCTAT
CAGGCGTATC AGGTTGAGGA CTACGCCACC GCACTGGCGG CCTGGCAGAA AATCAGTCTT
CACGACATGA GCAATGAGGA TCTGCTTGCT GCTGCCAATA CCGCCCAGGC GGCAGGAAAT
GGTGCGGCTC GCGATCGCTG GCTACAACAG GCAGAAAAAC GTGGACTGGG AAGCAATGCC
CTCTACTGGT GGCTGCATGC GCAACGTTAC ATTCCTGGTC AGCCGGAACT CGCACTGAAC
GATCTCACGC GCTCAATCAA TATTGCGCCT TCTGCCAACG CTTACGTTGC GCGGGCGACA
ATTTATCGCC AACGTCATAA TGTCCCGGCC GCGGTGAGTG ATTTGCGCGC CGCGCTGGAA
CTGGAACCGA ATAATAGCAA CACCCAGGCA GCGCTTGGTT ACGCCTTGTG GGATAGCGGT
GATATCGCAC AGTCGCGGGA AATGCTCGAA CCGGCGCATA AAGGGCTTCC GGACGATCCG
GCACTGATCC GACAACTGGC CTACGTGAAC CAGCGTCTGG ATGACATGCC TGCGACGCAG
CACTACGCCC GGCTGGTGAT TGATGACATT GATAATCAGG CGCTGATAAC CCCACTGACC
CCAGAACAAA ATCAACAACG CTTCAATTTC CGCCGTTTGC ATGAGGAGGT CGGTCGCCGC
TGGACGTTCA GTTTCGATTC TTCCATCGGC TTGCGTTCCG GCGCAATGAG TACCGCTAAC
AATAATGTCG GCGGCGCAGC GCCAGGGAAA AGCTATCGTA GCTACGGACA ACTGGAAGCC
GAGTACCGCA TCGGACGCAA TATGCTGCTG GAAGGCGACC TGCTCTCAGT TTATAGCCGC
GTCTTTGCCG ATACCGGAGA AAACGGGGTG ATGATGCCGG TGAAAAATCC GATGTCCGGC
ACCGGTCTGC GCTGGAAGCC GCTGCGCGAT CAGATCTTTT TCATCGCCGT CGAACAGCAG
TTGCCGCTGA ACGGCCAAAA TGGCGCATCC GATACCATGC TGCGCGCCAG CGCCTCATTC
TTTAATGGCG GCAAATACAG CGACGAATGG CACCCGAACG GTTCAGGCTG GTTTGCCCAA
AACCTGTACC TCGATGCGGC GCAATATATC CGCCAGGATA TTCAGGCGTG GACGGCAGAT
TATCGCGTCA GCTGGCATCA GAAGGTAGCT AACGGACAGA CTATTGAGCC TTACGCTCAC
GTTCAGGACA ACGGCTATCG TGATAAAGGC ACTCAGGGCG CGCAGCTTGG CGGAGTCGGG
GTCCGCTGGA ATATCTGGAC CGGCGAGACG CACTACGACG CCTGGCCGCA CAAAGTCAGT
CTCGGCGTCG AGTATCAACA TACCTTTAAG GCGATTAATC AACGTAACGG AGAGCGCAAC
AACGCGTTTC TCACCATTGG AGTGCACTGG TAA
 
Protein sequence
MKENNLNRVI GWSGLLLTSL LSTSALADNI GTSAEELGLS DYRHFVIYPR LDKALKAQKN 
NDEATAIREF EYIHQQVPDN IPLTLYLAEA YRHFGHDDRA RLLLEDQLKR HPGDARLERS
LAAIPVEVKS VTTVEELLAQ QKACDAAPTL RCRSEVGQNA LRLAQLPVAR AQLNDATFAA
SPEGKTLRTD LLQRAIYLKQ WSQADTLYNE ARQQNTLSAA ERRQWFDVLL AGQLDDRILA
LQSQGIFTDP QSYITYATAL AYRGEKARLQ HYLIENKPLF TTDAQEKSWL YLLSKYSANP
VQALANYTVQ FADNRQYVVG ATLPVLLKEG QYDAAQKLLA TLPANEMLEE RYAVSVATRN
KAEALRLARL LYQQEPANLT RLDQLTWQLM QNEQSREAAD LLLQRYPFQG DARVSQTLMA
RLASLLESHP YLATPAKVAI LSKPLPLAEQ RQWQSQLPGI ADNCPAIVRL LGDMSPSYDA
AAWNRLAKCY RDTLPGVALY AWLQAEQRQP SAWQHRAVAY QAYQVEDYAT ALAAWQKISL
HDMSNEDLLA AANTAQAAGN GAARDRWLQQ AEKRGLGSNA LYWWLHAQRY IPGQPELALN
DLTRSINIAP SANAYVARAT IYRQRHNVPA AVSDLRAALE LEPNNSNTQA ALGYALWDSG
DIAQSREMLE PAHKGLPDDP ALIRQLAYVN QRLDDMPATQ HYARLVIDDI DNQALITPLT
PEQNQQRFNF RRLHEEVGRR WTFSFDSSIG LRSGAMSTAN NNVGGAAPGK SYRSYGQLEA
EYRIGRNMLL EGDLLSVYSR VFADTGENGV MMPVKNPMSG TGLRWKPLRD QIFFIAVEQQ
LPLNGQNGAS DTMLRASASF FNGGKYSDEW HPNGSGWFAQ NLYLDAAQYI RQDIQAWTAD
YRVSWHQKVA NGQTIEPYAH VQDNGYRDKG TQGAQLGGVG VRWNIWTGET HYDAWPHKVS
LGVEYQHTFK AINQRNGERN NAFLTIGVHW