Gene RPB_4176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4176 
SymbolmutL 
ID3911984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4747341 
End bp4749134 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content70% 
IMG OID637886080 
ProductDNA mismatch repair protein 
Protein accessionYP_487779 
Protein GI86751283 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.285632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTCC GCCAGCTTCC CGAAACCATC GTCAACCGCA TCGCCGCCGG CGAGGTGGTG 
GAGCGGCCGG CGAGCGTGGT GAAGGAGCTG GTCGAAAACG CGATCGATGC CGGCGCCAGC
CGCATCGACA TCTTCTCGGA TGGCGGCGGC CGGCGAAAGA TCGTGATCGC CGACGACGGC
AGCGGGATGA CGCGGGCCGA TCTCGCGCTC GCGGTCGATC GCCACGCCAC CTCCAAGCTC
GACGACGAGG ACCTCTTACA GATCCGCACG CTCGGCTTTC GCGGCGAGGC GCTGCCCTCG
ATCGGCGCGG TGGCGCGGCT GACGATCACC ACGCGCCATG CCGGCGAGCC GCATGCCTGG
ACGCTCGGCG TCGAGGGCGG TGACAAATCG CCGATTGCTC CCGCGGCGCT GTCGCAAGGC
ACCCGGGTCG AGGTCGCTGA TCTGTTCTTC GCCACCCCGG CGCGGCTGAA GTTTCTCAAG
ACCGATCGCA CCGAGGCCGA GGCGATCCGC GACGTGGTGC GGCGGCTGGC GATGGCGCGG
CCGGACATCG CCTTCACGCT GGCCGGCGAA GAGCGCGCAC CGGTGACCTG GGCGGCGGCG
CTGCCCGGCG CGCCGGGGCA ATTGATCCGG CTCGGCGACA TTCTCGGCGC GGACTTTCGC
GCCAATGCCA TCGAGGTCCG CTCGGAACGC GAAGGCGTCG CGGTCGAGGG CTTCGCCGCG
TCGCCGGCGC TGACCCGCGC CAATGCGCTC GGGCAATATC TGTTCGTCAA CGGCCGGCCG
GTGCGCGACA AGCTGATCCT CGGCGCTGTG CGCGCGGCCT ATTCGGATTA CTTGCCGCGC
GATCGCCATC CGGTGGTGGC GCTGTTCGTC ACGCTGGAGT CGCGCGAGGT CGACGCCAAT
GTGCATCCGG CCAAGACCGA GGTGCGCTTC CGCAATGCCG GCCTGGTGCG TGCGCTGATC
GTCCACGCGC TGAAAGAAGG GCTGGCGCGC GAGGGCCGCC GCACCGCCGC CAACAGCGCC
GGCAGCGTGA TCTCGACCTT CCGTCCCGCC TCGATGCCGG CTGCGAATTG GGACTGGCGT
GCCTCGCCGT CTTATCCGGT CGGCGGCTCC GCGATCGATG CGCCGTCTTT TGCCGAGCGC
CCGCAGGCCG CGTTCGATGT CGGCGGGCCG AGCGCCGACA TCCGCACGCA CGAGGTCGCG
CCGGATCTGC TCGACCGCCC GCTCGGCGCG GCGCGGACGC AGATCCACGA GACCTACATC
GTGTCGCAGA CCCGCGACGG GCTGATCGTT GTGGACCAGC ACGCCGCCCA CGAGCGCATC
GTCTACGAGC GCTTGAAGGC GTCGCTCGCC GCCAACGGCG TGCAGCGCCA GATCCTGCTG
ATCCCGGATA TCGTTGAGAT GGACGAGGCG ACCGTCGAAC GCCTGGTGGC GCGCGCCGAC
GAGCTGGCGC AATTCGGCCT CGTCGTCGAG AGCTTTGGCC CCGGCGCGGT CGCGGTGCGC
GAGACGCCGT CGCTGCTCGG CAAGACCGAT GCGGCGTCGC TGCTGCGCGA TCTCGCCGAG
CATATGGCCG AATGGGACGA GGCGCTGCCG CTGGAGCGCC GCCTGATGCA CGTCGCCGCG
ACGATGGCCT GCCACGGCTC GGTGCGGGCC GGGCGCGTGC TCAAGCCCGA AGAGATGAAC
GCGCTGCTGC GCGAAATGGA AGCGACGCCG AATTCCGGCC AATGCAATCA CGGCCGCCCG
ACCTATGTCG AACTGACGTT GACCGATATC GAGAAGCTGT TCGGGCGAAG GTAG
 
Protein sequence
MPVRQLPETI VNRIAAGEVV ERPASVVKEL VENAIDAGAS RIDIFSDGGG RRKIVIADDG 
SGMTRADLAL AVDRHATSKL DDEDLLQIRT LGFRGEALPS IGAVARLTIT TRHAGEPHAW
TLGVEGGDKS PIAPAALSQG TRVEVADLFF ATPARLKFLK TDRTEAEAIR DVVRRLAMAR
PDIAFTLAGE ERAPVTWAAA LPGAPGQLIR LGDILGADFR ANAIEVRSER EGVAVEGFAA
SPALTRANAL GQYLFVNGRP VRDKLILGAV RAAYSDYLPR DRHPVVALFV TLESREVDAN
VHPAKTEVRF RNAGLVRALI VHALKEGLAR EGRRTAANSA GSVISTFRPA SMPAANWDWR
ASPSYPVGGS AIDAPSFAER PQAAFDVGGP SADIRTHEVA PDLLDRPLGA ARTQIHETYI
VSQTRDGLIV VDQHAAHERI VYERLKASLA ANGVQRQILL IPDIVEMDEA TVERLVARAD
ELAQFGLVVE SFGPGAVAVR ETPSLLGKTD AASLLRDLAE HMAEWDEALP LERRLMHVAA
TMACHGSVRA GRVLKPEEMN ALLREMEATP NSGQCNHGRP TYVELTLTDI EKLFGRR