Gene Paes_1599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1599 
Symbol 
ID6459669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1742634 
End bp1745261 
Gene Length2628 bp 
Protein Length875 aa 
Translation table11 
GC content52% 
IMG OID642725587 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002016264 
Protein GI194334404 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGAA AACCCGGCAT TAAACGCTCT GGGGAAGCAA CTCCCATGAT GCGTCAGTAT 
CTGGAAGTCA AGGAACGCTA TCCTGATTTT CTTCTGCTGT TTCGTGTGGG TGATTTTTAC
GAGTCTTTTT ACGACGATGC CCGCCAGGTG TCTGAAGCAC TCAATATCGT TCTGACCCGG
CGTTCAAACG GCGCGGCTTC CGATATTGCC ATGGCCGGGT TTCCTCATCA TTCCTGTGAG
GGTTACATCG CTAAACTGGT GAGGAAAGGG TTCAAGGTTG CCGTTTGCGA TCAGGTCGAA
GATCCTTCCG AGGCCAAAGG CATTGTGAGG CGGGAGATTA CCGATATCGT TACGCCCGGC
GTCACCTACA GCGACAAGAT TCTTGACGAC AGGCACAATA ATTATCTCTC TGCGATCAGT
TTTCTGAAAA AAGGACGCCA ACAGCGTGCT GGAATTGCTT TTATCGATGT GACGACAGCC
GAGTTCAGGG TTGCCGATGT CGAAGTTGAC CAGCTCAAGG ATATGCTCCA ATCCGTTCAG
CCGGCAGAGG TTCTTCTCTC TTCAAGGAAT CGCGAGTACG CTGATGAGAT CAAAAAAATA
CTTCCTCCGG GAGTACTGGT TTCGCTTCAG GATGACTGGA TGTTCAGCCA GGAGAATGCT
GAGCAGATAC TGCTGCGCCA TTTCAAAACC CATTCACTCA AAGGATTCGG CATCGAACAT
GCCTATGCGG CCAGGATAGC GGCAGGGGTC ATTCTGCATT ATCTTGAGGA GACCCAGCAG
AACAAACTGC AGTATATTAC CCGCATTGCG ACCGTTGACA GCCATGAGTA TATGACTCTG
GATCTTCAGA CCAGGCGCAA CCTCGAGATT ATTTTCTCGA TGCATGACGG ATCGATGAAC
GGAAGTCTTC TGCATGTGAT CGACCGGACA CGATGCCCTA TGGGGGCCAG GCAGCTCCGA
CGGTGGCTGC TGCATCCGCT GAAGCAGATG GCGCCCATAT TGCAACGCCA TGATGCCGTT
GATGAACTCT CCCGACATCC CGATGTGCGT CGTGAGCTCG GAGAGGTGAT CGGGAGTATT
CATGATCTTG AGCGTGCGTT ATCGCGGATC GCAACCCTGC GCTCTATGCC GAGAGAGGTC
AGAATGCTCG GCTCTGCCCT GGAGCAGCTT CCCCGTCTGC AGGGCCTTTT CGATCAATCG
GAAAGTTCAC GGCTCTGTTT TCTTTCCAGG AGGCTTTCGA TGCTTCCCGA GCTTGCCCGG
AGAATCGATG AGGCCATTGA TCCGGATGCA GGAGCGACGA TGCGCGATGG AGGCTACATT
CGTGAGGGCT ATAATGCTGA GCTTGATTCA TTGCGTTCGC TCTCTTCAAC GGCAAAGGAG
CGTCTTCTGG AAATTCAGCA GCAGGAGCGA GCCGCAACCA CTATTTCCAC GCTCAAGGTC
CAGTATAATA AGGTCTTTGG ATACTATATC GAGGTGAGCC GGGCTAACAG TGACAAGGTC
CCCGCCTATT ATGAAAAAAA ACAGACCCTG GTCAATGCCG AGCGATATAC GATTCCTGCA
CTGAAAGAAT ATGAGGAAAC CATTCTTACT GCGGAGGAAA AAAGTCTTTC GCTCGAACAG
CGACTTTTCA GGGAGCTCTG TCAGGCAATA GCCTCAGAAG CCGGCCAGAT ACAAACGAAC
GCTGAAAGTA TAGCGGAGCT TGATTGCCTC TGTTCTTTCG CGGTGAATGC CGATGAGTAT
CGCTATTGTA AACCCGTTAT GGTTGAAGAA CCGGTGCTTC GCATACAGAA CGGACGTCAT
CCGGTCCTGG AACGCATTGT CGATGTTGAC GAGCCTTATG TCTCAAATGA TTGTCTGTTT
GACGAGCGAC AGCGCATGCT TATCGTTACG GGTCCGAATA TGGCAGGCAA AAGCTCGTAT
CTTCGCCAGA TCGGCCTCAT TTCCCTGCTT GCCCAGGTGG GGAGTTTCGT TCCTGCCGAC
GAGGCTGAAA TAGGCCTTGT CGATCGGATC TTTACCCGTG TCGGAGCATC GGACAATCTT
GCTTCCGGTG AGAGTACCTT TATGGTCGAG ATGAATGAGG CGGCCAGTAT CCTCAACAAT
GCGACCAGGA GCAGTCTCAT TCTGCTTGAC GAGGTCGGCA GGGGAACGAG TACCTATGAC
GGTATGTCGA TTGCCTGGGC GATGAGTGAG TACATTCATT CCGCTATAGG TGCAAGGACA
CTTTTTGCAA CGCATTACCA CGAGCTTGCC GAACTCGAGG AGCGATTGGA TGGCGTTTTC
AATTATAATG CCACGGTGAC CGAGACAGCT GATCGGGTTG TTTTTCTGCG CAAAATTGTC
AGGGGCGCGT CCGATAACAG CTATGGAATC GAAGTGGCCA GAATGGCAGG AATGCCTTCC
GGGGTGATTC AGCGAGCCAA AGAGATTCTT TCGGGTATGG AGGGCCGGGA AATAGAGATG
CCCGATCGTT TGAAAGGTTG CGGGCGAACC AATCAGATCT ATCTGTTCGA GGAGGAGGAG
CGTGCGCTTA AACGTGCGGT TCAGAGTATA GATATCAATA GTCTTACCCC TATCGAGGCC
ATGATGGAAC TCAAAAGACT TCAGGATCTT GCCGCCGGGG GGGGTTAA
 
Protein sequence
MGRKPGIKRS GEATPMMRQY LEVKERYPDF LLLFRVGDFY ESFYDDARQV SEALNIVLTR 
RSNGAASDIA MAGFPHHSCE GYIAKLVRKG FKVAVCDQVE DPSEAKGIVR REITDIVTPG
VTYSDKILDD RHNNYLSAIS FLKKGRQQRA GIAFIDVTTA EFRVADVEVD QLKDMLQSVQ
PAEVLLSSRN REYADEIKKI LPPGVLVSLQ DDWMFSQENA EQILLRHFKT HSLKGFGIEH
AYAARIAAGV ILHYLEETQQ NKLQYITRIA TVDSHEYMTL DLQTRRNLEI IFSMHDGSMN
GSLLHVIDRT RCPMGARQLR RWLLHPLKQM APILQRHDAV DELSRHPDVR RELGEVIGSI
HDLERALSRI ATLRSMPREV RMLGSALEQL PRLQGLFDQS ESSRLCFLSR RLSMLPELAR
RIDEAIDPDA GATMRDGGYI REGYNAELDS LRSLSSTAKE RLLEIQQQER AATTISTLKV
QYNKVFGYYI EVSRANSDKV PAYYEKKQTL VNAERYTIPA LKEYEETILT AEEKSLSLEQ
RLFRELCQAI ASEAGQIQTN AESIAELDCL CSFAVNADEY RYCKPVMVEE PVLRIQNGRH
PVLERIVDVD EPYVSNDCLF DERQRMLIVT GPNMAGKSSY LRQIGLISLL AQVGSFVPAD
EAEIGLVDRI FTRVGASDNL ASGESTFMVE MNEAASILNN ATRSSLILLD EVGRGTSTYD
GMSIAWAMSE YIHSAIGART LFATHYHELA ELEERLDGVF NYNATVTETA DRVVFLRKIV
RGASDNSYGI EVARMAGMPS GVIQRAKEIL SGMEGREIEM PDRLKGCGRT NQIYLFEEEE
RALKRAVQSI DINSLTPIEA MMELKRLQDL AAGGG