Gene EcDH1_0955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0955 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1024109 
End bp1026670 
Gene Length2562 bp 
Protein Length853 aa 
Translation table11 
GC content56% 
IMG OID 
ProductDNA mismatch repair protein MutS 
Protein accessionACX38638 
Protein GI260448216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCAA TAGAAAATTT CGACGCCCAT ACGCCCATGA TGCAGCAGTA TCTCAGGCTG 
AAAGCCCAGC ATCCCGAGAT CCTGCTGTTT TACCGGATGG GTGATTTTTA TGAACTGTTT
TATGACGACG CAAAACGCGC GTCGCAACTG CTGGATATTT CACTGACCAA ACGCGGTGCT
TCGGCGGGAG AGCCGATCCC GATGGCGGGG ATTCCCTACC ATGCGGTGGA AAACTATCTC
GCCAAACTGG TGAATCAGGG AGAGTCCGTT GCCATCTGCG AACAAATTGG CGATCCGGCG
ACCAGCAAAG GTCCGGTTGA GCGCAAAGTT GTGCGTATCG TTACGCCAGG CACCATCAGC
GATGAAGCCC TGTTGCAGGA GCGTCAGGAC AACCTGCTGG CGGCTATCTG GCAGGACAGC
AAAGGTTTCG GCTACGCGAC GCTGGATATC AGTTCCGGGC GTTTTCGCCT GAGCGAACCG
GCTGACCGCG AAACGATGGC GGCAGAACTG CAACGCACTA ATCCTGCGGA ACTGCTGTAT
GCAGAAGATT TTGCTGAAAT GTCGTTAATT GAAGGCCGTC GCGGCCTGCG CCGTCGCCCG
CTGTGGGAGT TTGAAATCGA CACCGCGCGC CAGCAGTTGA ATCTGCAATT TGGGACCCGC
GATCTGGTCG GTTTTGGCGT CGAGAACGCG CCGCGCGGAC TTTGTGCTGC CGGTTGTCTG
TTGCAGTATG CGAAAGATAC CCAACGTACG ACTCTGCCGC ATATTCGTTC CATCACCATG
GAACGTGAGC AGGACAGCAT CATTATGGAT GCCGCGACGC GTCGTAATCT GGAAATCACC
CAGAACCTGG CGGGTGGTGC GGAAAATACG CTGGCTTCTG TGCTCGACTG CACCGTCACG
CCGATGGGCA GCCGTATGCT GAAACGCTGG CTGCATATGC CAGTGCGCGA TACCCGCGTG
TTGCTTGAGC GCCAGCAAAC TATTGGCGCA TTGCAGGATT TCACCGCCGG GCTACAGCCG
GTACTGCGTC AGGTCGGCGA CCTGGAACGT ATTCTGGCAC GTCTGGCTTT ACGAACTGCT
CGCCCACGCG ATCTGGCCCG TATGCGCCAC GCTTTCCAGC AACTGCCGGA GCTGCGTGCG
CAGTTAGAAA CTGTCGATAG TGCACCGGTA CAGGCGCTAC GTGAGAAGAT GGGCGAGTTT
GCCGAGCTGC GCGATCTGCT GGAGCGAGCA ATCATCGACA CACCGCCGGT GCTGGTACGC
GACGGTGGTG TTATCGCATC GGGCTATAAC GAAGAGCTGG ATGAGTGGCG CGCGCTGGCT
GACGGCGCGA CCGATTATCT GGAGCGTCTG GAAGTCCGCG AGCGTGAACG TACCGGCCTG
GACACGCTGA AAGTTGGCTT TAATGCGGTG CACGGCTACT ACATTCAAAT CAGCCGTGGG
CAAAGCCATC TGGCACCCAT CAACTACATG CGTCGCCAGA CGCTGAAAAA CGCCGAGCGC
TACATCATTC CAGAGCTAAA AGAGTACGAA GATAAAGTTC TCACCTCAAA AGGCAAAGCA
CTGGCACTGG AAAAACAGCT TTATGAAGAG CTGTTCGACC TGCTGTTGCC GCATCTGGAA
GCGTTGCAAC AGAGCGCGAG CGCGCTGGCG GAACTCGACG TGCTGGTTAA CCTGGCGGAA
CGGGCCTATA CCCTGAACTA CACCTGCCCG ACCTTCATTG ATAAACCGGG CATTCGCATT
ACCGAAGGTC GCCATCCGGT AGTTGAACAA GTACTGAATG AGCCATTTAT CGCCAACCCG
CTGAATCTGT CGCCGCAGCG CCGCATGTTG ATCATCACCG GTCCGAACAT GGGCGGTAAA
AGTACCTATA TGCGCCAGAC CGCACTGATT GCGCTGATGG CCTACATCGG CAGCTATGTA
CCGGCACAAA AAGTCGAGAT TGGACCTATC GATCGCATCT TTACCCGCGT AGGCGCGGCA
GATGACCTGG CGTCCGGGCG CTCAACCTTT ATGGTGGAGA TGACTGAAAC CGCCAATATT
TTACATAACG CCACCGAATA CAGTCTGGTG TTAATGGATG AGATCGGGCG TGGAACGTCC
ACCTACGATG GTCTGTCGCT GGCGTGGGCG TGCGCGGAAA ATCTGGCGAA TAAGATTAAG
GCATTGACGT TATTTGCTAC CCACTATTTC GAGCTGACCC AGTTACCGGA GAAAATGGAA
GGCGTCGCTA ACGTGCATCT CGATGCACTG GAGCACGGCG ACACCATTGC CTTTATGCAC
AGCGTGCAGG ATGGCGCGGC GAGCAAAAGC TACGGCCTGG CGGTTGCAGC TCTGGCAGGC
GTGCCAAAAG AGGTTATTAA GCGCGCACGG CAAAAGCTGC GTGAGCTGGA AAGCATTTCG
CCGAACGCCG CCGCTACGCA AGTGGATGGT ACGCAAATGT CTTTGCTGTC AGTACCAGAA
GAAACTTCGC CTGCGGTCGA AGCTCTGGAA AATCTTGATC CGGATTCACT CACCCCGCGT
CAGGCGCTGG AGTGGATTTA TCGCTTGAAG AGCCTGGTGT AA
 
Protein sequence
MSAIENFDAH TPMMQQYLRL KAQHPEILLF YRMGDFYELF YDDAKRASQL LDISLTKRGA 
SAGEPIPMAG IPYHAVENYL AKLVNQGESV AICEQIGDPA TSKGPVERKV VRIVTPGTIS
DEALLQERQD NLLAAIWQDS KGFGYATLDI SSGRFRLSEP ADRETMAAEL QRTNPAELLY
AEDFAEMSLI EGRRGLRRRP LWEFEIDTAR QQLNLQFGTR DLVGFGVENA PRGLCAAGCL
LQYAKDTQRT TLPHIRSITM EREQDSIIMD AATRRNLEIT QNLAGGAENT LASVLDCTVT
PMGSRMLKRW LHMPVRDTRV LLERQQTIGA LQDFTAGLQP VLRQVGDLER ILARLALRTA
RPRDLARMRH AFQQLPELRA QLETVDSAPV QALREKMGEF AELRDLLERA IIDTPPVLVR
DGGVIASGYN EELDEWRALA DGATDYLERL EVRERERTGL DTLKVGFNAV HGYYIQISRG
QSHLAPINYM RRQTLKNAER YIIPELKEYE DKVLTSKGKA LALEKQLYEE LFDLLLPHLE
ALQQSASALA ELDVLVNLAE RAYTLNYTCP TFIDKPGIRI TEGRHPVVEQ VLNEPFIANP
LNLSPQRRML IITGPNMGGK STYMRQTALI ALMAYIGSYV PAQKVEIGPI DRIFTRVGAA
DDLASGRSTF MVEMTETANI LHNATEYSLV LMDEIGRGTS TYDGLSLAWA CAENLANKIK
ALTLFATHYF ELTQLPEKME GVANVHLDAL EHGDTIAFMH SVQDGAASKS YGLAVAALAG
VPKEVIKRAR QKLRELESIS PNAAATQVDG TQMSLLSVPE ETSPAVEALE NLDPDSLTPR
QALEWIYRLK SLV