Gene ECH_0884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0884 
SymbolmutL 
ID3927913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp906936 
End bp908957 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content30% 
IMG OID637902001 
ProductDNA mismatch repair protein 
Protein accessionYP_507679 
Protein GI88658331 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.277202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATTA TACTATTAGA TCCTAGAACA ATTAATAGAA TTGCTGCAGG GGAAGTAATA 
GAATGTCCAG CTAGTGTGGT TAAAGAATTA GTTGAAAATT CAATAGATGC TAAAGCTACT
GCCATAAGTA TTACAATAGA ACGTGGAGGA CGTAATTTAA TAATTGTTAG TGATAATGGT
ATTGGAATAA AAAAAGAAGA TATGGAAATT GCATTTGCTC GTCATGCAAC GTCTAAGCTT
CCTGATGGTG ATTTAACAAA AGTTAGATCC TTGGGGTTTC GAGGAGAAGG ATTAACTTCT
ATTGCAGCTG TTGGAAAAGT AAAAATGGTT TCAAAATATA GAGATTCTGA TACTGCATGG
TTAATGGTAT TTGAAGGTGG AGAAAAAACA CAAGAATTGA CACCAGATGC ACTTTCTTGT
GGTACTTATA TTGAAGTGAG AGATTTGTTT TTTGCTACAC CTAATAGGTT AAAGTTTCTT
AGGACAGAAA AAGCAGAGGT TCAATCTATT ATTGATATGA TGAACAAACT AGCTATGGTA
AATCATAATG TAATGTTTTC ATTATTTGTT GATAATAAAC AGGTATTTAA ATATTTAACA
CAACAATCAA ATATTGATAG ATTATCTGAA ATAAAAACTT TGGGAATGGA ATTTTGTAAA
AATTCTTTAC CAGTGAATGT AAAAGAAGAG CAGATTCAAT TATCAGGTTA TATTGGATCT
CCTACATTAA GTCGTGGTAA GTCAAGTCTT ATATATACTT TTGTTAATAG TCGACCTGTT
TATGATAATT TACTGATAGG TGCAGTTAGA TATGCTTATA GTGATTTTAT AGAAAAGGAT
AAATATCCAG TTGTTGTATT ATATCTTGAT ATTCCATGTG ATCAAGTTGA CGCTAATGTT
CATCCGAATA AATCTGAGGT AAGATTTCAA GATAAAAAGT TAGTATATAG AACTGTAGTT
AATGCAATTA AAGAAGTGTT ATCGATCAAC CTAAATACTA AATTAAAGTC TATAAGTGAA
TTTGAAAATG ATCATTTTGT ACATGCTAGT ATGGTAAATT CAAGAAACAT AGGTAATAGC
GTTTCTTCCG AGTTTTTTAA ATGTTTTCAA AATAGAAAAC CATTACTTAA CAATGACGTG
CAAAAATATA GTTCTAAAAA TGTAGAAACA GATGACCAAT CTTTGTTAGA TACTAATGTC
TCCTTTTGTA CAGATTCAAA AATGATAACG AATAAATTAA AAGAAGAGAG AGTTTATGAA
AATTCTAGAG AGCATATTAA TAAGGGAGAT TCTAAAATAG AGGTTAGTAA TTTTGATATA
TTAGGAGAGA AAAAAAATTT TGTTAATTTA GCTAATAATC TTCTACAGGA GTCACCTAGT
ATAGATAGTG GTAAGTTTAA TACTAGTAAA AAAGTACCAA GTGATTCATT AATTGATACT
TATCCATTAG GCTATGCTTT ATGTCAAATA CACAGTAGAT ATATCATCTC TCAAACACAG
GATTCTATTG TTATTATTGA TCAGCATGCA GCTCATGAGA GATTAACTTA TGAATATATG
AAACAAGTTA TGGCAAAAGA GGGGATAAAG CGTCAGATAC TATTGATACC TGAGATTATT
GAAATGAATA ACCATCTTGA TTTGGAGTTA CTTGTTGAAT ATAAGGAAAA GTTATTAAAA
CTTGGATTAC TCATTGAACC ACTTGGTAAT TTATCGGTAA TAGTAAGGGA AGTTCCAGCA
CTTTTTGGAA GTTTTGATGT TAAATCGCTT ATTATTAATA TAGTTGATAG TATTATGGAA
GTAGGTGATA CTTTATTCTT AGATGATAAG ATTAAGGATA TATGTGGGAC TATAGCATGT
TATAGTTCTA TTAGAAGTGG TAGAAAATTA AAGATTGAAG AAATGAATGC TATTTTAAGG
AATATGGAAA ATACTGCACA TTCTGGACAA TGTAATCATG GTAGGCCAAC TTATGTAGAG
CTAAATTTAG TTGAGATAGA TAGGCTTTTT TCAAGAAGAT AG
 
Protein sequence
MSIILLDPRT INRIAAGEVI ECPASVVKEL VENSIDAKAT AISITIERGG RNLIIVSDNG 
IGIKKEDMEI AFARHATSKL PDGDLTKVRS LGFRGEGLTS IAAVGKVKMV SKYRDSDTAW
LMVFEGGEKT QELTPDALSC GTYIEVRDLF FATPNRLKFL RTEKAEVQSI IDMMNKLAMV
NHNVMFSLFV DNKQVFKYLT QQSNIDRLSE IKTLGMEFCK NSLPVNVKEE QIQLSGYIGS
PTLSRGKSSL IYTFVNSRPV YDNLLIGAVR YAYSDFIEKD KYPVVVLYLD IPCDQVDANV
HPNKSEVRFQ DKKLVYRTVV NAIKEVLSIN LNTKLKSISE FENDHFVHAS MVNSRNIGNS
VSSEFFKCFQ NRKPLLNNDV QKYSSKNVET DDQSLLDTNV SFCTDSKMIT NKLKEERVYE
NSREHINKGD SKIEVSNFDI LGEKKNFVNL ANNLLQESPS IDSGKFNTSK KVPSDSLIDT
YPLGYALCQI HSRYIISQTQ DSIVIIDQHA AHERLTYEYM KQVMAKEGIK RQILLIPEII
EMNNHLDLEL LVEYKEKLLK LGLLIEPLGN LSVIVREVPA LFGSFDVKSL IINIVDSIME
VGDTLFLDDK IKDICGTIAC YSSIRSGRKL KIEEMNAILR NMENTAHSGQ CNHGRPTYVE
LNLVEIDRLF SRR