Gene Rpal_0513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0513 
Symbol 
ID6408162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp558419 
End bp561142 
Gene Length2724 bp 
Protein Length907 aa 
Translation table11 
GC content69% 
IMG OID642710425 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001989548 
Protein GI192288943 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATCC GCCCCGACAT CGCTCTACCG CCCGACGCCG CTCCGCCTCC GGAGGCGCCC 
GCCAAAATGT CGCCGATGAT GGAGCAGTAC CACGAGATCA AAGCCGCCAA TCCTGGCCTG
TTGCTGTTCT ACCGGATGGG CGATTTCTAC GAGCTGTTCT TCGAGGATGC CGAAATCGCG
TCACGCGCGC TCGGTATTAC CCTGACCAAG CGCGGCAAGC ATCTCGGCGC CGACATTCCG
ATGTGCGGTG TGCCGGTCGA GCGCTCCGAC GACTACCTGC ACCGGCTGAT CGCGCTGGGT
CACCGCGTCG CTGTGTGCGA GCAGACCGAA GACCCGGCCG CGGCGCGCGC CCGCAAGAGC
GTGGTGCGGC GCGACGTGGT GCGGCTGATC ACGCCCGGTA CGCTGACCGA AGATACCCTG
CTCGACGCCC GCGCCAACAA CTACCTGCTG GCGATCGCGC GCGCCCGTGG CTCGGCCGGC
GCCGATCGCA TCGGGCTCGC CTGGATCGAC ATCTCGACTG GCGAATTCTG CGTCACCGAG
TGCACGACCG CAGAACTCGC CGCGACGCTG GCGCGGATCA ATCCGAACGA AGCCATCGTG
CCGGACGCGC TGTACAGCGA CACAGAACTC GCCCCGACCT TGCGCGAGCT CGCCGCCGTC
ACGCCGCTGA CGCGTGACGT GTTCGATTCC GCCACCGCCG AGCGGCGGCT GTGCGATTAC
TTCGCTGTCG CCACCATGGA CGGCCTCGCC GCGCTGTCAC GGCTGGAAGC GACCGCCGCC
GCGGCCTGCG TCACCTATGT CGACCGTACC CAGCTCGGCA AACGGCCGCC GCTGTCGCCG
CCGTCACGCG AAGCCGCCGG CACCACGATG GCGATCGACC CGGCGACCCG CGCCAATCTC
GAACTCACCC GCACGCTCGC CGGCGAACGC CGCGGCTCGC TGCTCGACGC GATCGACTGC
ACGGTTACAG CCGCAGGATC TCGCCTCTTG GCGCAGCGGC TCGCCGCGCC GCTGACCGAT
GCGGCGGCGA TCGCGCGGCG GCTCGACGCG GTCGAAGCCT TCACCGGGGA TGCGGGACTT
CGCGAACAGA TCCGCAGCTC GCTGCGTGCG GCGCCCGACA TGGCGCGTGC ACTGGCGCGG
CTGTCGCTCG GCCGCGGCGG CCCGCGGGAT CTCGCGAACT TGCGCGATGG CATCCGCGCT
GCCGACGAGG TGATTGCGCA GCTCGGCCAG CTCGCAAGCC CGCCGCAGGA GATCGCGAGC
GCGATGGCGG CGCTGCAGCG GCCGTCACGC GCATTGTGCG CCGAGCTCGG CCGCGCGCTC
GCCGACGATC TGCCGCTTCT CAAGCGCGAC GGCGGCTTCG TGCGCGAAGG CTACGAGCCG
GCGCTCGACG AGACCCGCAA GCTGCGCGAC GCCTCGCGGC TGGTGGTGGC GTCGATGCAG
GCGCGCTACG CCGACGACAC CGGGATCAAG GCGCTGAAGA TCCGGCACAA CAACGTGCTC
GGTTACTTCG TCGAGGTCTC GGCGCAGCAC GGCGACAAGT TGATGGCGCC GCCACTGAAC
GCCACTTTCA TCCATCGCCA GACGCTGGCC GGGCAGGTGC GCTTCACCAC CGCCGAACTC
GGCGAGATCG AGGCCAAGAT CGCCAATGCG GGCGACCGCG CACTCGGGCT GGAGCTGGAG
ATCTTCGACC GCCTCGCCGC GATGATCGAT GCGGCCGGTG AAGACCTGCG CGCCGCCGCC
CATGCGTTCG CGCTGCTCGA TGTCGCCACC GCGCTCGCCA AGCTCGCCAG CGACGACAAC
TACGTGCGGC CCGAGGTCGA CGAGTCGCTG AGCTTTGCGA TCGAAGGCGG CAGGCATCCG
GTGGTCGAGC AGGCGCTGAA GAAGGCTGGC GAGCCGTTCA TCGCCAATGC CTGCGACCTG
TCGCCCGGCC CGGCGCAGAC CAACGGCCAG ATCTGGCTGC TGACCGGCCC GAACATGGCC
GGTAAGTCGA CCTTCCTGCG CCAGAACGCG CTGATCGCCC TGCTCGCCCA GGTCGGCAGC
TTCGTGCCGG CGATCCGGGC ACGGATCGGC ATCGTCGACC GGCTGTTCTC GCGCGTCGGC
GCCGCCGACG ACCTCGCCCG CGGCCGTTCG ACCTTCATGG TCGAGATGGT CGAGACCGCC
GCGATCCTGA ACCAGGCCTC CGAACGGGCG CTGGTGATCC TCGACGAGAT CGGCCGCGGC
ACCGCGACGT TCGACGGCCT CTCGATCGCC TGGGCGGCGA TCGAGCACCT GCACGAACAG
AACAGGTGTC GTTCGCTGTT CGCCACGCAC TACCATGAAC TGACCGCACT GTCGGCCAAG
CTGCCGCGGC TGTTCAACGC CACCGTGCGG GTCAAGGAAT GGCGCGGCGA GGTGGTGTTT
CTGCACGAGG TGCTGCCGGG CTCCGCCGAC CGCTCCTACG GCATTCAGGT CGCCAAGCTG
GCGGGGCTCC CCCCCAGCGT GGTGAGCCGC GCGAAGGCCG TGCTGGCCAA GCTCGAAGCC
AACGACCGCG GTCAGCCGAA GACGCTGATC GACGACCTGC CGCTGTTCGC CATCACGGCT
CGCGCACCCG CCGAAGCCGC CCCACCGAGC GAGGCCGAGC AGCTGATCGA CGCGGTCAAG
GCGCTGCATC CCGACGAGAT GACCCCGCGC GAGGCGCTGG ATGCGTTGTA CGCCCTGAAG
GCGAAGCTAC CGAAGGCCGA CTGA
 
Protein sequence
MTIRPDIALP PDAAPPPEAP AKMSPMMEQY HEIKAANPGL LLFYRMGDFY ELFFEDAEIA 
SRALGITLTK RGKHLGADIP MCGVPVERSD DYLHRLIALG HRVAVCEQTE DPAAARARKS
VVRRDVVRLI TPGTLTEDTL LDARANNYLL AIARARGSAG ADRIGLAWID ISTGEFCVTE
CTTAELAATL ARINPNEAIV PDALYSDTEL APTLRELAAV TPLTRDVFDS ATAERRLCDY
FAVATMDGLA ALSRLEATAA AACVTYVDRT QLGKRPPLSP PSREAAGTTM AIDPATRANL
ELTRTLAGER RGSLLDAIDC TVTAAGSRLL AQRLAAPLTD AAAIARRLDA VEAFTGDAGL
REQIRSSLRA APDMARALAR LSLGRGGPRD LANLRDGIRA ADEVIAQLGQ LASPPQEIAS
AMAALQRPSR ALCAELGRAL ADDLPLLKRD GGFVREGYEP ALDETRKLRD ASRLVVASMQ
ARYADDTGIK ALKIRHNNVL GYFVEVSAQH GDKLMAPPLN ATFIHRQTLA GQVRFTTAEL
GEIEAKIANA GDRALGLELE IFDRLAAMID AAGEDLRAAA HAFALLDVAT ALAKLASDDN
YVRPEVDESL SFAIEGGRHP VVEQALKKAG EPFIANACDL SPGPAQTNGQ IWLLTGPNMA
GKSTFLRQNA LIALLAQVGS FVPAIRARIG IVDRLFSRVG AADDLARGRS TFMVEMVETA
AILNQASERA LVILDEIGRG TATFDGLSIA WAAIEHLHEQ NRCRSLFATH YHELTALSAK
LPRLFNATVR VKEWRGEVVF LHEVLPGSAD RSYGIQVAKL AGLPPSVVSR AKAVLAKLEA
NDRGQPKTLI DDLPLFAITA RAPAEAAPPS EAEQLIDAVK ALHPDEMTPR EALDALYALK
AKLPKAD