Gene Acid345_3219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3219 
Symbol 
ID4070431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3810624 
End bp3812567 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content60% 
IMG OID637985240 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_592294 
Protein GI94970246 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.428383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.251683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGCA TCCACGTTCT CTCCGAACAT GTCGCCAACA AAATCGCCGC CGGCGAGGTG 
GTCGAACGCC CCGCGTCGGT AGTGAAGGAG TTGATCGAGA ACTCGCTCGA TGCCGGCGCC
AAGCGCATTC GCGTGCACGT AGAAGCTGGT GGCAAGAAAC TGATCCACAT TGTGGACGAC
GGCATCGGCA TGTTTCGCGA CGACGCTATG CTTGCGTTCG AACGTCACGC GACTTCGAAA
CTGAAAAATC CCGAGGACCT GCTCAGCATC TCGACACTCG GCTTCCGCGG TGAAGCACTC
CCGTCCATCG CATCGGTTGC GCGCGTCCGC CTTGAAACTC GCGCCAACGA AGAACCAAGC
GGCACCGTCC TCGAGATCGC CGGCGGCAAG ATTCTGAAGA TCGAAGAAGC CGGCCTACCC
CTCGGTACCT CCATCGCAAT AAAAGACCTC TTTTTCAATA CACCCGCCCG CAAGAAATTT
CTGAAAAGCG AATCCACCGA GCTCTCCCAC ATCGCGTCGC TGGTCACCCA TTACGCGCTC
GCGCATCCCG AGATGCATTG GGAGCTGCAC TCCGCGACCA ACGCACTGCT CATTGCTCCG
CCAGTCGCCA CGCAGAGCGA ACGCATCTAC CAGGTCTTCG GCAACGAGAC GCTCGACCAA
CTCATCCCGC TCGCCGCGCA AATCAAACTC GAACGCATCG GGCTGCCTAA GCCACCGCCG
TGGCTACGCA AAAACGAAGA CGACGAGGAA GAACAAACAG TCGAGCCCGG TGAAGTTCGA
CTCCACGGAT TCATCTCGAA GCCCGAAATC CAGAAGCTCA ATCGCAACTC GATCTTCGTC
TTCGTCAACG GCCGTCTCAT TCGCGACCGC CTCGTCCAGC ACGCACTCAC CGAGGCCTAT
CGCAACATCC TGCCACCCAC GCTCTTCCCT GTCGTTCTCC TCTTTCTCGA AATGCCCTAC
ACGGAAGTCG ACGTCAACGT GCATCCGTCG AAGACCGAAG TCCGCTTCCG CCAGCAGTCG
CTCGTCCATG ACTTCGTGCG CGACTCGGTC CGCGCGGCAT TGTCGAAAGC GCGCCCAATC
CCACAGTTCA TTTCCGAGAT TCACGCCCAA CCGAAGGCTT CGCCGTCACT TACACCCGGC
GCACAGACCG CGCCTGCCTT CGCGTTGGAA GCGCAGGAAG AACCCGTGCT GCCCGAGCGC
CTTCAATTCG GTGGCGACGC CATCTCCGTC GAGGCCAACG CAGCTGTGCC CGTCGCCCGC
TTCGGTGCGC AGACCTTCGG TTCTCACGTC GCGCAGCAAC AAACGGTTCC CGAGACCAGT
GGCTGCGACT ACGAACTCCC CGATCTACCG GCCGCTGACG CTCCGCTCGC ATCGCTCAGG
CCCCTCGGCC AGATTCGCGA ATCCTTCATT CTTGCTACGA GCAACGAAGG CCTCTGGATC
ATCGACCAGC ACGTCGCCCA CGAGCGCGTA CTCTTCGAGA AGGTCCTAAA GCAACGAGCA
GCCGCCTCGG TTGAAACCCA GCAATTACTC ATGCCTCTGA TCGTCGAACT CACTCCCGGC
CAGCAAGCCG TCTTCACCGA GATCGCCGAA GAGCTTCACC AGAACGGCTT CGAAGTCGAA
CCCTTTGGCT CAAGGACTTT CGCCGTCAAA GCAGCACCCG CAGGAATCCG CGCCGAAGAT
ATCGAAAAGA CCCTGAGCGA AGTCCTCGAT TCCTTCGAAC GCGAGCAGCA GGCCTTCAAC
CTTGAGCACG CGCAAAGCCG CATCGCCGCC ACCATTGCCT GCCACGCAGC CATCAAGGTC
AACATGCCGC TCACGCAGGA TAAAATGGAG TGGCTCCTCG CCGAGCTCGC CAAAACCGAG
CACCCCATGA CTTGTCCGCA CGGACGCCCC ATCGTTCTTC GCTATTCTGT CAAGGACATT
CAGAAGGCCT TTAAGCGCAT CTGA
 
Protein sequence
MGRIHVLSEH VANKIAAGEV VERPASVVKE LIENSLDAGA KRIRVHVEAG GKKLIHIVDD 
GIGMFRDDAM LAFERHATSK LKNPEDLLSI STLGFRGEAL PSIASVARVR LETRANEEPS
GTVLEIAGGK ILKIEEAGLP LGTSIAIKDL FFNTPARKKF LKSESTELSH IASLVTHYAL
AHPEMHWELH SATNALLIAP PVATQSERIY QVFGNETLDQ LIPLAAQIKL ERIGLPKPPP
WLRKNEDDEE EQTVEPGEVR LHGFISKPEI QKLNRNSIFV FVNGRLIRDR LVQHALTEAY
RNILPPTLFP VVLLFLEMPY TEVDVNVHPS KTEVRFRQQS LVHDFVRDSV RAALSKARPI
PQFISEIHAQ PKASPSLTPG AQTAPAFALE AQEEPVLPER LQFGGDAISV EANAAVPVAR
FGAQTFGSHV AQQQTVPETS GCDYELPDLP AADAPLASLR PLGQIRESFI LATSNEGLWI
IDQHVAHERV LFEKVLKQRA AASVETQQLL MPLIVELTPG QQAVFTEIAE ELHQNGFEVE
PFGSRTFAVK AAPAGIRAED IEKTLSEVLD SFEREQQAFN LEHAQSRIAA TIACHAAIKV
NMPLTQDKME WLLAELAKTE HPMTCPHGRP IVLRYSVKDI QKAFKRI