Gene RPD_4032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4032 
SymbolmutL 
ID4024549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4480602 
End bp4482395 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content70% 
IMG OID637964235 
ProductDNA mismatch repair protein 
Protein accessionYP_571152 
Protein GI91978493 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.733731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTTC GCCAGCTTCC TGAAACCATC GTCAACCGCA TCGCCGCCGG CGAAGTGGTC 
GAGCGGCCGG CGAGCGTGGT CAAGGAGCTG GTCGAGAACG CGATCGACGC CGGTGCTTGT
CGCGTCGACG TTTTCAGCGA CGGCGGCGGC CGGCGGAAGA TCGTGATCGC CGACGACGGC
GGCGGCATGA CGCAGGCGGA TCTCGCGCTC GCGGTCGATC GCCATGCCAC CTCCAAGCTC
GACGACGAGG ATCTGCTGGC GATCCGCACG CTGGGGTTTC GCGGCGAGGC GCTGCCCTCG
ATCGGCGCGG TCGCGCGGCT CAGCCTCACC ACCCGCCACG CCGCCGAACC GCACGCCTGG
ACGCTCAGCG TCGAGGGCGG CGCCAAATCG CCGATCTCGC CGGCCGCGCT GTCGCAGGGC
ACGCGCGTCG AGGTCGCCGA CCTGTTTTTC GCCACGCCGG CGCGGCTGAA ATTCCTCAAG
ACCGACCGCA CCGAGGCAGA GGCGATCCGC GAAGTGGTGC GTCGCCTGGC GATGGCGCGG
CCGGACATCG CCTTCACGCT CGCGGGCGAA GAGCGCGCGC CGGTGACCTG GGCGGCGGCG
CTGCCCGGCG CGCCCGGCCG GCTGACACGG CTCGGCGACA TCCTCGGCGC GGATTTTCGC
GCCAATGCGA TCGAGGTCGG TTCGGAGCGC GAAGGCGTCG CGGTCGAGGG CTTCGCCGCG
TCACCGTCGC TGACCCGCGC CAATGCGCTC GGCCAATATC TGTTCGTCAA CGGCCGCCCG
GTGCGCGACA AGCTGATCCT GGGCGCGGTG CGCGCGGCCT ATGCCGACTA TCTGCCGCGC
GACCGCCATC CGGTGGTGGC GCTGTTCGTC ACGCTGGATT CGCGCGAAGT CGACGCCAAT
GTGCATCCGG CGAAAACCGA AGTGCGGTTC CGCAACGCCG GTCTCGTCCG CGCGCTGATC
GTCCACGCGT TGAAGGAGGG GCTAGCGCGC GAGGGCCGCC GCACGGCCGC CAACAGCGCC
GGCGCGGCGA TCTCGAATTT TCGTCCGGCG TCGATGCCGC CCGGCAATTG GGACTGGCGC
AGCTCGCCGT CTTATCCGGT CGGTGGCGGA TCGAGTGCAG CGCCGTCCTT CGGCGAGCGT
CCGCAGGCCG CGTTCGACGT CGGCGGGCCG AGCGCCGACA TCCGACCGCA CGAGGCCGCG
CCGGAGCTGC TCGACCGGCC GCTCGGCGCG GCGCGGACCC AGATCCACGA GACCTACATC
GTGTCGCAGA CCCGCGACGG GCTGATTGTG GTGGATCAGC ACGCCGCGCA TGAGCGGATC
GTCTATGAAC GGCTGAAAGC GTCGCTCGAC GCCAACGGCG TGCAGCGGCA GATCCTGCTG
ATCCCGGACA TCGTCGAGAT GGACGAGGCG ACGGTCGAGC GACTGGTCGC GCGCGCCGAG
GAGCTTTCGA AGTTCGGCCT CGTGGTCGAG AGCTTCGGCC CCGGCGCGGT GGCGGTGCGC
GAGACGCCGT CGCTGCTCGG CAAGGTCAAT GCGGCGTCGC TTTTGCGCGA CCTCGCAGAG
CACATGGCGG AGTGGGACGA GGCGCTGCCG CTGGAGCGGC GGCTGATGCA TGTCGCCGCC
ACCATGGCCT GCCACGGCTC GGTCCGCGCC GGCCGCGTGC TCAAGCCCGA GGAGATGAAC
GCGCTTTTGC GCGAGATGGA AGCCACCCCG AATTCCGGCC AGTGCAACCA CGGCCGTCCG
ACCTATGTCG AACTGACGCT GGCGGACATC GAGAAGCTGT TCGGGCGCAG ATAG
 
Protein sequence
MPVRQLPETI VNRIAAGEVV ERPASVVKEL VENAIDAGAC RVDVFSDGGG RRKIVIADDG 
GGMTQADLAL AVDRHATSKL DDEDLLAIRT LGFRGEALPS IGAVARLSLT TRHAAEPHAW
TLSVEGGAKS PISPAALSQG TRVEVADLFF ATPARLKFLK TDRTEAEAIR EVVRRLAMAR
PDIAFTLAGE ERAPVTWAAA LPGAPGRLTR LGDILGADFR ANAIEVGSER EGVAVEGFAA
SPSLTRANAL GQYLFVNGRP VRDKLILGAV RAAYADYLPR DRHPVVALFV TLDSREVDAN
VHPAKTEVRF RNAGLVRALI VHALKEGLAR EGRRTAANSA GAAISNFRPA SMPPGNWDWR
SSPSYPVGGG SSAAPSFGER PQAAFDVGGP SADIRPHEAA PELLDRPLGA ARTQIHETYI
VSQTRDGLIV VDQHAAHERI VYERLKASLD ANGVQRQILL IPDIVEMDEA TVERLVARAE
ELSKFGLVVE SFGPGAVAVR ETPSLLGKVN AASLLRDLAE HMAEWDEALP LERRLMHVAA
TMACHGSVRA GRVLKPEEMN ALLREMEATP NSGQCNHGRP TYVELTLADI EKLFGRR