Gene Paes_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1689 
Symbol 
ID6459893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1842019 
End bp1844400 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content53% 
IMG OID642725677 
ProductSmr protein/MutS2 
Protein accessionYP_002016354 
Protein GI194334494 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGTT TTACGAAAAA GAGACTTGAG TTTGATAGAA TAGTAGACTG CGTTTCGCAG 
TTGTGTTTGT CGGATATGGG GCGCGATGAA CTTTCCGCTG CTTCGCCGCT CCTTTGCAGA
GAGGCTCTTG TCAGAGAGCT CGAACGAGTG ATGGAGCTGA AAAACTTTCT TCTCGAAGGT
CAGCCGCTGC CCTTTGCTGC GCTTCCTGAT ACCAGATCGC TTGTGCGGAA GCTGGAAACG
CTTGATACCT ATCTCGAACC GGAGGAGTTA CTTGATATCC ATGATCTTCT GCAGGCGTCG
GTTTCTCTCA GAACGTTCAT GTACCGCAAT CGAACGCTCT ATCCGGCGGT CAATGAGTTT
ACGGTGCAGC TCTGGATGGA AAAGTCTCTT CAGTTCGAGA TCAAAGGGAT TGTTGATGAA
GAGGCCAGGG TGCGTGATAC TGCCAGCGAC GGCTTGCTTA TGATCCGGCG TGAGCTGCGG
GAGGGTCGTC AAACTTTGCG ACGGAAGATG GAGCGTCTTC TGCGTCGTTG TCAGGAGAAC
AACTGGCTGA TGGAGGATAC CGTGGCTATG AAGAACGGTC GTCTGGTCCT CGGGCTCAGG
GTGGAGTATA AGTATAAGCT TCCGGGCTAT ATCCAGGACT ATTCGCAAAG CGGCCAGACG
GTGTTTGTTG AACCGGCTGA AACGCTTGAG CTGAACAATC GTCTGCAGGA TCTTGAGCTT
GATGAGCGGC GTGAGGTTGA GCGGATTCTC AGGGAAATGT CATCAAGGAT CCGGGAGGAG
CTGGATAATG TGCAGCATAA CCAGTCGGTG ATGGCTGTAT TTGATTCGAT CTACGCAAGG
GCGCGTTTTG CTGTTGATAC GGCCGGAGTC ATGCCGGCGC TTGATGACGG GCGGCGATTG
AAGATTGTTC GTGGTTTTCA TCCATGGCTT CTGGTGACGC ACCGGGCTCG CGGTGAAGAG
GTGTTTCCTC TCGATATGGA GCTGGATGGT GATGAGCAGG TTCTCGTGAT TTCCGGGCCG
AATGCAGGAG GAAAGTCGGT GGGGATGAAA ACGGTTGGTC TGCTCTGTTG TATGCTGCAT
CACGGCTATC TTGTTCCCTG TAGCGAGAGT TCTGTGTTTC CTCTTTTCGA TGATATTTTT
ATAGGGATCG GTGACGAACA GTCGATTGAA AACGATCTTT CAACCTTCAG TTCGCATCTT
GAGCAAATTC GTCTGATTCT CGCTTCGGCA TCGGCAAGAA GTCTTGTGCT CATCGATGAA
CTCTGCTCGG GTACCGATGT CGAGGAGGGG AGCGCAATTG CCCGGGCGGT GATCGAGGAG
CTGCTTCTTA AGGAGTGCAA GGCGGTGATT ACGACCCATC TTGGCGAATT GAAGGTCTAT
GCTCATGAGC GGAAGGGCGC CGTTAACGGC GCTATGGAGT TTGATCGTGC GACGTTGAGT
CCGTCGTTTC GTTTTCTCAA GGGTCTTCCC GGTAACAGCT TTGCTTTTGC TATGATGCAG
CGCCTTGGGT TTGACCGGGG GGTTATCGAC CGGGCTAACG GTTTTCTTTG CCGCAAAAGC
GCCGGGTTTG AACTGATGCT TGATGATCTG AAAACGGTTC TGGAGGAGAA TCGTTTGCTT
CGTGAGCGTC TTGGTGAGGA GCGCAGGCAG CTTGAGTGTC GGGAGCAGCA GATCGCTCTT
TCTGAAGCTG CGTTGCTGCG CCGTCAGAAA GAGATGAAAG CGACTGTTTC GCGGGAGGTG
CAGAAGGAGG TTGAGCATGC TCGTAAAATG ATTCGCGATA TTGTCCGGGA AGCTAAAGCC
TCACCTGATG CGCACGCTGT CGAATCTGCA AGGCAAAAAC TTCATGCGCG CAAGAAACAG
GCCGAGGCTG AGGAGGTGAA GGCTGTTGCA TCATTGGAGG AGCATGTTGA TGAAGATCGT
ACGATCAGGC CGGGTGATAT GGTTCGTGTG ATGACGACCA ACACGACAGG TGAGGTGGTA
TCGGTTGACC GCGACGATGT GGTGGTGCAG TGCGGAACGT TCAGGCTGTC GACGTCGCTG
AAACATGTAG AGAAGACCTC GAAAACCCAG GCGAAAAAAC TTGAGCGAGA GGTTCGGGGT
TCAAAAGCGA AAGGGTGGAG TTCCCGTTCA TCGGTTCTTG AATCGACGCG TCTTGATCTT
CGAGGCCTCA ATGGCGATGA GGCGATTGTG GAAATTGACC GGTTTATCGA TAAACTTCGT
CTGAACAGGG TGTCGAGCGC TGTGATCATT CATGGTAAAG GAACCGGCGC CCTGCGGATG
CGTATCGCGG ATTTTCTGAA AACCCACAAT CATGTGGAGC ATTTTCGTCT TGGGGAGTTG
TCTGAAGGGG GAAGCGGCGT GACGGTCATC GATGTCTTGT AA
 
Protein sequence
MDGFTKKRLE FDRIVDCVSQ LCLSDMGRDE LSAASPLLCR EALVRELERV MELKNFLLEG 
QPLPFAALPD TRSLVRKLET LDTYLEPEEL LDIHDLLQAS VSLRTFMYRN RTLYPAVNEF
TVQLWMEKSL QFEIKGIVDE EARVRDTASD GLLMIRRELR EGRQTLRRKM ERLLRRCQEN
NWLMEDTVAM KNGRLVLGLR VEYKYKLPGY IQDYSQSGQT VFVEPAETLE LNNRLQDLEL
DERREVERIL REMSSRIREE LDNVQHNQSV MAVFDSIYAR ARFAVDTAGV MPALDDGRRL
KIVRGFHPWL LVTHRARGEE VFPLDMELDG DEQVLVISGP NAGGKSVGMK TVGLLCCMLH
HGYLVPCSES SVFPLFDDIF IGIGDEQSIE NDLSTFSSHL EQIRLILASA SARSLVLIDE
LCSGTDVEEG SAIARAVIEE LLLKECKAVI TTHLGELKVY AHERKGAVNG AMEFDRATLS
PSFRFLKGLP GNSFAFAMMQ RLGFDRGVID RANGFLCRKS AGFELMLDDL KTVLEENRLL
RERLGEERRQ LECREQQIAL SEAALLRRQK EMKATVSREV QKEVEHARKM IRDIVREAKA
SPDAHAVESA RQKLHARKKQ AEAEEVKAVA SLEEHVDEDR TIRPGDMVRV MTTNTTGEVV
SVDRDDVVVQ CGTFRLSTSL KHVEKTSKTQ AKKLEREVRG SKAKGWSSRS SVLESTRLDL
RGLNGDEAIV EIDRFIDKLR LNRVSSAVII HGKGTGALRM RIADFLKTHN HVEHFRLGEL
SEGGSGVTVI DVL