Gene RPC_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1389 
SymbolmutL 
ID3973310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1514339 
End bp1516138 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content69% 
IMG OID637924504 
ProductDNA mismatch repair protein 
Protein accessionYP_531270 
Protein GI90422900 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.41073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.777612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTTC GCCAATTGCC CGAGACCGTG GTCAACCGCA TCGCCGCCGG CGAGGTGGTG 
GAGCGGCCGG CCAGCGTGGT CAAGGAACTG GTGGAGAACG CGATCGACGC CGGCGCGAGC
CGCATCGATA TTTTCACCGA CGGCGGCGGA AGGCGTCGGA TCGGCATCAC CGATGACGGT
GCCGGCATGA CCCACGCCGA CCTGACATTG GCGGTCGATC GCCACGCCAC CTCGAAGCTC
GACGACGAGG ATCTGCTGGC GATCCGCACG CTGGGATTTC GCGGCGAGGC GCTACCCTCG
ATCGGCTCGG TGGCGAAGCT GTCGATCACC ACGCGGCACG CCGCCGAACC GCACGCCTGG
GCGCTCGCCG TCGAGGGCGG CGCCAAGTCG CCGATCGTGC CGGCGGCGCT GAGCCACGGC
ACCCGCGTCG AGGTCTCCGA CCTGTTCTAT GCGACGCCGG CGCGGCTGAA ATTTCTCAAG
ACCGACCGCA CCGAGGCCGA AGCGATCCGC GAGGTGGTGC GACGGCTGGC GATGGCGCGG
CCGGACATCG CCTTCAGCAT GGCCGGCGAG GAGCGCGCGC CGGTGACCTG GGCGGCGGCG
TTGCCCGGCG CGGCGGGACG GTTGACCCGG CTCGGCGACA TTTTGGGCGG CGATTTTCGC
AGCAACGCCA TCGAGGTGCG TTCGGAACGC GACGGCGTGG TGGTCGAGGG CTTCGCCGCC
GCTCCGTCGC TGACCCGCGC CAACGCGCTC GGGCAATATC TGTTCGTCAA CGGCCGCCCG
GTGCGCGACA AACTGATCAT CGGCGCGGTG CGAGCGGCTT ACTCCGACTA TCTGCCGCGC
GACCGCCATC CGGTGGTGGC GCTGTTCGTC TCCCTCGACA GCCGCGAGGT CGACGCCAAT
GTGCATCCGG CCAAGACCGA GGTGCGGTTT CGCGATGCCG GGTTGGTTCG CGCGCTGATC
GTGCACGCGC TGAAAGAGGG ATTGGCGCGC GAGGGCAAAC GTACCGCCGC CAACGACGCC
GGCGCCACGA TCTCCTCGTT CCGTCCATCG TTCGCGCCGC GCGCCAATTG GGACTGGCGC
AGTTCGCCGT CCTATCCGGT GGCCGGGAGT GCGGCGTTCG ACGCTGCGGC CGGTTTCGCC
GAGCGCGAGC AGTCGGGTTT CGACGTCGGC GCGCCGTCGG CCGATGTGCG CAGCTATCAG
CCGTCCGCCG ATTTCACCGA TCGGCCGCTC GGCGCCGCGC GGACGCAGAT TCACCAGACC
TATATCGTGG CGCAGACCCG CGATGGCCTC GTCGTGGTCG ATCAGCACGC CGCCCATGAG
CGGCTGGTCT ATGAGAAGCT GAAGGCCTCG CTCGCCACCA ACGGCGTGCA GCGGCAGATC
CTGCTGATCC CGGAAATCGT CGAGCTCGAC GAGGCCACGG TGGAGCGCCT GGTCGCGCGC
GGCGAGGAAC TGGCGACGTT TGGCCTGGTG GTGGAATCCT TCGGCCCGGG TGCGGTGGCG
GTGCGCGAGA CGCCGTCGCT GCTCGGCAAG ACCGATGCCG GCGCGCTGCT GCGCGATCTC
GCCGAGCACA TGGCGGAATG GGACGAGGCG CTGCCGCTGG AACGGCGCTT GCTGCACGTC
GCAGCCACCA TGGCCTGCCA CGGCTCGGTG CGCGCCGGCC GGGTGCTGAA GCCGGAGGAA
ATGAACGCGC TGCTCCGCGA AATGGAAGAC ACCCCGAATT CCGGCCAGTG CAACCACGGC
CGCCCGACCT ATGTCGAACT GAAATTGTCG GACATCGAGA AGCTGTTCGG GCGCAGGTAG
 
Protein sequence
MPVRQLPETV VNRIAAGEVV ERPASVVKEL VENAIDAGAS RIDIFTDGGG RRRIGITDDG 
AGMTHADLTL AVDRHATSKL DDEDLLAIRT LGFRGEALPS IGSVAKLSIT TRHAAEPHAW
ALAVEGGAKS PIVPAALSHG TRVEVSDLFY ATPARLKFLK TDRTEAEAIR EVVRRLAMAR
PDIAFSMAGE ERAPVTWAAA LPGAAGRLTR LGDILGGDFR SNAIEVRSER DGVVVEGFAA
APSLTRANAL GQYLFVNGRP VRDKLIIGAV RAAYSDYLPR DRHPVVALFV SLDSREVDAN
VHPAKTEVRF RDAGLVRALI VHALKEGLAR EGKRTAANDA GATISSFRPS FAPRANWDWR
SSPSYPVAGS AAFDAAAGFA EREQSGFDVG APSADVRSYQ PSADFTDRPL GAARTQIHQT
YIVAQTRDGL VVVDQHAAHE RLVYEKLKAS LATNGVQRQI LLIPEIVELD EATVERLVAR
GEELATFGLV VESFGPGAVA VRETPSLLGK TDAGALLRDL AEHMAEWDEA LPLERRLLHV
AATMACHGSV RAGRVLKPEE MNALLREMED TPNSGQCNHG RPTYVELKLS DIEKLFGRR