Gene Oter_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOter_4044 
Symbol 
ID6204835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOpitutus terrae PB90-1 
KingdomBacteria 
Replicon accessionNC_010571 
Strand
Start bp5244531 
End bp5246417 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content68% 
IMG OID641693712 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001820918 
Protein GI182415852 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0816003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.310643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAG TCCGGATCCT GTCCGATCGC GTCGCGAACC AGATCGCCGC GGGCGAAGTC 
ATCGAACGTC CCGCCGCGGT CGTGAAGGAA CTCGTGGAAA ACGCGCTCGA CGCCGGCGCG
ACGCGGATCG AGGTGGAATT CCGGCACGGC GGCCGGTCGT TGATGCGCGT CGAGGACAAC
GGCTCCGGCA TGTCGCGCGA CGACGCGCTG CTCGCGCTCG AACGGCATGC GACCAGCAAG
ATCAGCGAGG CCGCCGATCT CGACCGGCTG GGGAGCTACG GCTTCCGCGG CGAGGCGCTG
CCGTCGATCG CGAGCGTGTC GCGATTCGAG CTGCAGACGC GCGAGGCCGG CCAGAACGTG
GGCACCGAGG TGCTGGTGAG CGGCGGCAAG CTGGTGCACG TGCGCGACTG TGGTCGGCCC
GTCGGCACGC GGATCGAGGT GGCACAGCTG TTCAACTCGG TGCCCGCGCG GCGGAAATTT
CTCAAGAGCG ACCAGACCGA GGCGGCCCAT ATCGTGCAAT GCGTGCGGCT GTATGCCCTG
GCCTGTCCGG GAACGGCTTT CTCTCTCATC GAGGACGGAC GCGTGATTTT CCGCTCGCCG
GAATGCCCTA CACTCGCAGA GCGGATTGCG GAAATTTTCG GTCGGCAGAC CGCCGAGTCG
CTCGTGCCGA TCGAATCGGT GGAATCCGGC ATGCGGCTCG GCGGGCTGAT CGGCCGGCCA
GGTGTGGGCC GCGGCACGCG GCATGAGATG ATCGTGTTCG TGAACCAGCG GCCAGTCGAC
AGCCGGACGC TGAACTATGC GCTGATCGAG AGCTATTACG AGTCCGTGCC GAAGGGGCGC
TATCCGCTGG CGTTCGTGTT TTTCGAGTGT GATCCCGCGG CGGTCGACGT GAACGTGCAT
CCGGCAAAGC GCGAGGTGCG GTTCCGCAAC GAGCCCGCGG TCCGCAGCTT TGTGATCCGG
TCGGTGCTGC AGCGGCTGAG GGAGATTGCC GACCATCGAT CCGACTTCGC CCAGCCTTCG
GCGGACAACA TGCCCAAGCC CGAGTCGCCA GGCGCGCCGG CGGCGCACGG GCGGAAGGAC
GACGCGCCTG CAGCGCATGC CGAGGGTAGG GCGGCGACTC CGCTCGCCGC CGGGAACCTG
ATTGTAACGG CGCGCTTCGG CGCGGAGTCC ACGCCGTACC TCGAGAAATC CGGGGCGATC
GCGGGTGCGC GGCCGGCGGG GGTGTTGCCG CCCGCGGTGC CGCGAATACC GGCCGCGCCA
ATGCCGGTGA ATGCCGGAGC CGCCGCCGTA CCGGCGCCGC TCAAGCCCGC TTCGCCCTCG
TGGCGGTTTG TCGGACTGGC GCACGGCAAC TACGCGCTGT TCGAGACGAC CGCGGGTCTG
ATCCTGCTGG ATCGCCGGGC GGCGCACGAG CGCGTCTGGT TCGAGCGGCT GCAGGAACAG
TTTCGCTCCG GCGCGGTGCC GAGCCAGCGG CTGCTGCTGC CGGTGCCGGT GGAACTCGAT
CCGATCGCCG CGGCGTTGCT GCTGGACCGA GTGCAGTTTC TCAACGCGCA CGGGTTCGAG
ATCGCGGAGT TTGGCCGAAA TTTTTTCCGC ATCGAGGCGG TGCCGGCGTG GATGGAGCCC
GCGGATGCCG AGCCGTTCCT GCGCGATCTG CTCGGGGCAT TCCGCGAGGG CCACTGGCCC
GATCGCGACG CCAACCTCGC GCGGGAGGAA CTGGCCCGAC TCGCCTCGGT CAAAGCGGTC
CGCCTGCCCG CCGTCACGGG CGAGCAGGAG CTCCGGGCCT TGGTCACGCA CTTGTTCGCC
ACGCGTACGC CCATGACCAA TCCAGCCGGC CGACCGACCT ACATTGAGCT GAATCACGCG
GAGCTGGCGC GGCGGTTCCA AAAATGA
 
Protein sequence
MAKVRILSDR VANQIAAGEV IERPAAVVKE LVENALDAGA TRIEVEFRHG GRSLMRVEDN 
GSGMSRDDAL LALERHATSK ISEAADLDRL GSYGFRGEAL PSIASVSRFE LQTREAGQNV
GTEVLVSGGK LVHVRDCGRP VGTRIEVAQL FNSVPARRKF LKSDQTEAAH IVQCVRLYAL
ACPGTAFSLI EDGRVIFRSP ECPTLAERIA EIFGRQTAES LVPIESVESG MRLGGLIGRP
GVGRGTRHEM IVFVNQRPVD SRTLNYALIE SYYESVPKGR YPLAFVFFEC DPAAVDVNVH
PAKREVRFRN EPAVRSFVIR SVLQRLREIA DHRSDFAQPS ADNMPKPESP GAPAAHGRKD
DAPAAHAEGR AATPLAAGNL IVTARFGAES TPYLEKSGAI AGARPAGVLP PAVPRIPAAP
MPVNAGAAAV PAPLKPASPS WRFVGLAHGN YALFETTAGL ILLDRRAAHE RVWFERLQEQ
FRSGAVPSQR LLLPVPVELD PIAAALLLDR VQFLNAHGFE IAEFGRNFFR IEAVPAWMEP
ADAEPFLRDL LGAFREGHWP DRDANLAREE LARLASVKAV RLPAVTGEQE LRALVTHLFA
TRTPMTNPAG RPTYIELNHA ELARRFQK