Gene Emin_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0214 
Symbol 
ID6263201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp228921 
End bp231476 
Gene Length2556 bp 
Protein Length851 aa 
Translation table11 
GC content42% 
IMG OID642610677 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001875113 
Protein GI187250631 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.250019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAT CCAAACAGCT TACCCCTTTA ATGAGACAGT ATAATGACAT TAAAAGTAAA 
AACACCGACT TAATAGTGTT TTTCCGTTTG GGGGATTTTT ACGAAATGTT TGATTCCGAC
GCGCGCGAAG CAAGCCGTAT TTTAGGTATA GCCTTAACAC AGAGAGGCGG CGTGCCTATG
TGCGGGGTAC CCTACCACGC GGCGGCCAAT TATATAAGCA AAATTATTAA AGAAGGTAGA
AAAGTAGGCA TTTGCGAGCA GGTAGGCACG GGGGAAGGAA AAAGCAAACT TTTTGAACGT
AAAATAGTGC GTGTTATAAC GCCGGGCACT GTTATTGAGG ATAATATGCT TCAAAGCAAT
GTAAGCAATT ACCTCGTTTG TGTTTTTACT GATAAAAAAG GCTGGGGAGC CGCCGCTATA
GACGTATCCA CGGGTGAGTT TTGGGTAACC GAACAGGCGG ACGATATTTC TTTAAACGGA
CTTTCCTGCA TGCTGGCTGC TTTAAATCCT GCGGAAATAC TTTTAGACAA AATTACTTTA
GACAGAGTAA AATCATCAAT GATGATACCG GGCAGCGTAG CCACAACGAT AGTGCCCAGG
GAAGAATCTT CCCAAATCCC TTCCAACTGG CCTTCGCAAA GCGTATGGGC AGGCAAAAAA
ACAGCCTTAA CATGCGCTCT TACGGCAATA AAATACATTA ATGTTAACGA GCCGGGTTTT
AAAGATTTGC TTATTCCTTT TTACAAAGAA ATTTCTGATT ATCTTGCGCT TGATGAAAAC
GCCGTACGTA CGCTTGAACT TGTAAGCCAA AACGGCGCGC GCAAAGGCAG TCTTTGGCAT
TTATTGGATT TTACTGTAAC TCCTATGGGT GGCAGAACGC TTAAAAATTG GATTTTAAAC
CCGCTGTTAA ACCTTGCAGA TATTGAAAAA AGGCAAAATT GCGTAAGCAA TTTTTATGAC
AACCCTTTGG CCTATGAGGA ACTTAAAGTT ATTCTAGCCG ATATCAGTGA CATTGAACGC
ATTATGAGCA GGGTAGGCAC AGGTAACGCG GGGCCCAGGG ATTTGGCGGG GCTTGCGCGT
TCTCTTGCGG TGCACGGGGC TTTAAAAAAC TGGTTTGACA AATACGGGGC TGTGGTTCCT
TATTTAAAAG AAAACATTTT ATCTAAAATC ACGGTTATTG AAGATTTAGC TAACCTTTTA
AACTCCGCTA TAGAACCTAA CCCGCCCATT AAAATATCCG ACGGCGGTAT AATAAAACAA
GGTTTTAATG CGGAGCTTGA CGATTTAAGA AACACAAAAA ACAACAGCAG CAAAACCCTC
GCCGAACTTT GCGAGCGTGA AAAAGCGAAA ACCGGCATAA GCACTTTAAA AGTGGGGTTT
AACTCGGTAT TCGGTTATTA TATTGAAGTA AGTAAAGGCC AGTCCGGCAA AGTGCCTTTC
AGTTATACCA GAAAACAAAC TTTAACCAAC GCCGAACGTT TTATTACCGA AGAACTTAAA
GAAATTGAAG ATAAAATTTT ACACGCGGAA GAAAAGATTT TGCGTTTGGA AACAAGCCTT
TTTGACAGCG TGCGTAAACA TTTGGCCGAA CATATCGGCG TTATGCGCTC GTTTGCAAAA
GCGATAGCTG AGTTAGACGT TTACTCTAAC CTGGCGCACT GCGCCAAAGT TTATAAATTT
ACAAAACCTG TTATTGATGA AAGCAATATT TTAAAAGCCG CTGATTTAAG ACACCCCGTT
GTTGAGGCCT GTTTACCTCT TGGAAGTTTT GTTCCTAATG ATATCGACCT TGGGGGCGAA
ACGCAGATAT CCGTTATAAC GGGGCCCAAT ATGGGCGGTA AAAGCGTTTA TTTAAAACAG
GCGGCGGTGT TAGTTATACT CGCGCAGATG GGTAGCTTTG TGCCCGCGGC GTCAGCGCAT
GTGGGTATTG TTGATAAAAT TATGACCCGT ATAGGCGCGC AGGACGCTAT TGCCATGGGG
CAAAGCACGT TTATGGTTGA AATGAAGGAA ACGTCGCACA TTTTAGCTTC CTGTACGCCA
AAAAGTTTAA TTCTGTTAGA TGAAGTCGGC AGAGGCACAA GCACTTTTGA CGGCATTTCA
ATAGCGTGGG CCATAACAGA ATTTTTATAT AAACCGCACG GCGGCGGCGC CAAAGTGCTT
TTTGCCACGC ATTATTTTGA ACTTACGGAT TTGGAAAATA AATATAAAGG CATAAAAAAC
TTTCACGCCG AAGTACAGGA ATATAAAGAC GCGGACGGGC AAAGTAAAAT AGCTTTCCTT
TATAAAATTA AAGAAGGCGC GGGGGATAAA TCCTACGGCA TACATGTCGG GGAGCTTGCG
GGCCTGCCGG CTACTGTTAT CGTGCGCGCT AAAAAAGTTA TTAAAGATTT GGAAGCTAAA
AAAGGAACAA GCGTATCAAA AAAAGAAGAC GATATTGTGG GGGATTTATT TTCCAGCCCG
ATAGTTGAGG AAATTAAGCT TGTTAACACT GACGCTGTTA CTCCGATGCA GGCTTTACAA
ATGATACTTG AGTGGAAAAA AAGAATTAAC AGTTGA
 
Protein sequence
MQESKQLTPL MRQYNDIKSK NTDLIVFFRL GDFYEMFDSD AREASRILGI ALTQRGGVPM 
CGVPYHAAAN YISKIIKEGR KVGICEQVGT GEGKSKLFER KIVRVITPGT VIEDNMLQSN
VSNYLVCVFT DKKGWGAAAI DVSTGEFWVT EQADDISLNG LSCMLAALNP AEILLDKITL
DRVKSSMMIP GSVATTIVPR EESSQIPSNW PSQSVWAGKK TALTCALTAI KYINVNEPGF
KDLLIPFYKE ISDYLALDEN AVRTLELVSQ NGARKGSLWH LLDFTVTPMG GRTLKNWILN
PLLNLADIEK RQNCVSNFYD NPLAYEELKV ILADISDIER IMSRVGTGNA GPRDLAGLAR
SLAVHGALKN WFDKYGAVVP YLKENILSKI TVIEDLANLL NSAIEPNPPI KISDGGIIKQ
GFNAELDDLR NTKNNSSKTL AELCEREKAK TGISTLKVGF NSVFGYYIEV SKGQSGKVPF
SYTRKQTLTN AERFITEELK EIEDKILHAE EKILRLETSL FDSVRKHLAE HIGVMRSFAK
AIAELDVYSN LAHCAKVYKF TKPVIDESNI LKAADLRHPV VEACLPLGSF VPNDIDLGGE
TQISVITGPN MGGKSVYLKQ AAVLVILAQM GSFVPAASAH VGIVDKIMTR IGAQDAIAMG
QSTFMVEMKE TSHILASCTP KSLILLDEVG RGTSTFDGIS IAWAITEFLY KPHGGGAKVL
FATHYFELTD LENKYKGIKN FHAEVQEYKD ADGQSKIAFL YKIKEGAGDK SYGIHVGELA
GLPATVIVRA KKVIKDLEAK KGTSVSKKED DIVGDLFSSP IVEEIKLVNT DAVTPMQALQ
MILEWKKRIN S