Gene Cagg_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1710 
Symbol 
ID7269416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2089820 
End bp2092282 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content58% 
IMG OID643566552 
ProductMutS2 family protein 
Protein accessionYP_002463047 
Protein GI219848614 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCC CAGAAACATC GCTTCATACA CTTGAATTTG ATGCTGTGCG CGACCGTCTT 
GCACACTATA CCGCGTTCTC GGCGTCGCGT GAATTGGCCT TAAGTCTGAC GCCATCCACC
GATCTGAACG AGGTACGCCG ACGACAAGCC CTCACGGCTG AGGCTCGGCT GTTGCTCGAA
GAATGGCCTG ATCTCACGAT TGGTGGTGCT CGTGATGTGC GCCGATCTGC TCATCATGCG
GCACGCGGTG GGATGCTCGA TGGGACGACA CTCCGTGATA TTGCAGCAAC CCTGCGAAGC
GCTGCCACGT TGCGCCAACG TCTCAGCCGA CTCGATGATC GCTTCCCAAA CCTGCGTGAT
CTCGGATATA CGTTGCCTGC GCTACCACAT CTGATCGATG CTATTGAGCA GGCTATCGGC
GATGATGGGC AAGTGCTCGA TAGTGCTAGC CCGACCTTGG CCCGTCTCCG CCACGAGGTA
CGAGTGGCGT TCAATCGCTT GCAAGAGCGA TTGCAAAGCA TGATCCATTC CCCTACGTTG
GCTGCTGCTC TGCAAGAACC GATTATTACT GTACGCAACG GACGATATGT GATTCCGGTC
AAGGCGATCC ATCGCCGTGA AGTGCGTGGT CTGGTTCACG ATCAGTCGGG TTCGGGAGCA
ACACTCTATA TCGAGCCGCT GGCGATTGTT GAGCTGAATA ACCGTTGGCG TGAATTGCAA
TCGGCCGAGG CGGAAGAAGT GCAACGCATT CTCGGCGCCC TCTCAGAGCA GGTTGGTGAA
GCGGTAAGTG CGATTGTTAG TACCGTCAAT ATGCTGGCCG CGCTTGATCT TGTGTTTGCC
CTCGCCCGGT ACGCCATTGC TACCCGCAGT ACTGCGCCGG AGATCGTTGA TTGGCGACCC
GATGACCCAC CGTCGACTGA GCCGCCGTTG CGCCTGATCC GCGCTCGCCA TCCGCTCTTG
CCGCCTGATA AGGTGGTACC GATTGATCTC TGGTTGGGTG GCACGTTTTC AATCCTGTTG
ATTACCGGGC CAAATACCGG TGGAAAGACG GTTGCGCTCA AGACAACCGG GCTGCTTGCA
TTAATGGCCC AGGCCGGTAT GCAGATTCCA GCCGATCAAC CGTCTCGATT GCCGGTCTTT
CAATACATCT TCGCTGATAT TGGTGACGAG CAGAGCATCG AGCAGAGTCT GAGCACGTTC
TCTTCGCATA TGGCGAACAT TATTCGGGTG TTACAGACGC TGACCGAGGC GCAGTCATTC
CCCGCTGCTC CCACCGATCA GGCATTGTTC GACTATCGGC GACCTGCGGC GTTGGTGCTC
TTCGATGAGT TGGGGGCCGG TACCGATCCG GTTGAGGGAT CGGCGTTGGC ACGGGCAATT
ATCGGGCGTT TGCTGGAATT GGGTGTGTTG GCGGTTGCAA CGACCCATTA CCCTGAGTTG
AAGGCTTTTG CTTACGCAAC ACCGGGGGTT GAGAATGCTT CGGTCGAGTT CGATGTTGAG
ACCCTGGCAC CGACCTATCG GCTGAGTATC GGTGTGCCTG GTCACTCAAA CGCCCTGGCG
ATTGCTGCTC GGCTTGGTCT CGATCCGGCG TTGATCGAAC AGGCACGTTC GTTTATCGAC
CGCAATGAGG CACAGGTCGA AGACCTACTG GCCGGTATCC AGCGGGAACG GGCAGCGGCA
ACGGAGGCTT TGCAGCGGGC CGAAGAGTTG CGTGCTGATG CCGAAAAGTA TCGTGCCCGG
CTGGCCGCCG AGCAGCAAGC CTTTGCCGCC GAACGTGAAG TAGCACTGGC TGCCGCTCGC
CAAGAGATCG AGGCTGAGCT ACGCGAGGTC CGCCAGCAGT TGCGTCGGTT ACGTGAAGAG
TATCGTTCGG TGAGCATTTC GCGTCAGTGG TTGGAAGAGG CTGAAAAGCG CTTGGCCGCT
ACCGCCGAGC AGGCACAGCA GGCTACCGAA CGGTTGCAGC GACAAATGGT GCCGTCCGCT
CCGCCACCAC CTGCCGAGCG TCCGTTGCAG GTTGGTGATA CGGTTCATGT GGCATCGGTC
GGTCTCAACG GCGAAATTAT GGCCATCGAT ACGGACGATG AGACGGCAAC GGTGCAAGTC
GGTGGTTTTC GTTTGACCGT CAAATGCAGC GAGTTGAAGC GTGCTAAGGC AGCGGATAAT
GGTGAACGCC GCTTTGCACC TCCAGAGCGA CCGGTGAATC TGCCGTCAAT GCCCGATGTC
TCGATGACCT TCGATATGCG AGGCTGGCGT GTCTCCGAAG TGAGTGATCG GCTCGACCGT
TACCTCAACG ATGCCTATCT GGCGGGTTTG CACCAGGTTC GCCTCATTCA CGGTAAAGGA
ACCGGTGCCT TGCGCCAGGT GGTGCGTGAT GTGTTAGCTT CCCATCCGCT AGTGGCATCG
TTTACCGGCG GTGGTCGTGA TGGCGGTGAT GGGGTGACCA TCGCTACACT CGTGGATCGG
TGA
 
Protein sequence
MSIPETSLHT LEFDAVRDRL AHYTAFSASR ELALSLTPST DLNEVRRRQA LTAEARLLLE 
EWPDLTIGGA RDVRRSAHHA ARGGMLDGTT LRDIAATLRS AATLRQRLSR LDDRFPNLRD
LGYTLPALPH LIDAIEQAIG DDGQVLDSAS PTLARLRHEV RVAFNRLQER LQSMIHSPTL
AAALQEPIIT VRNGRYVIPV KAIHRREVRG LVHDQSGSGA TLYIEPLAIV ELNNRWRELQ
SAEAEEVQRI LGALSEQVGE AVSAIVSTVN MLAALDLVFA LARYAIATRS TAPEIVDWRP
DDPPSTEPPL RLIRARHPLL PPDKVVPIDL WLGGTFSILL ITGPNTGGKT VALKTTGLLA
LMAQAGMQIP ADQPSRLPVF QYIFADIGDE QSIEQSLSTF SSHMANIIRV LQTLTEAQSF
PAAPTDQALF DYRRPAALVL FDELGAGTDP VEGSALARAI IGRLLELGVL AVATTHYPEL
KAFAYATPGV ENASVEFDVE TLAPTYRLSI GVPGHSNALA IAARLGLDPA LIEQARSFID
RNEAQVEDLL AGIQRERAAA TEALQRAEEL RADAEKYRAR LAAEQQAFAA EREVALAAAR
QEIEAELREV RQQLRRLREE YRSVSISRQW LEEAEKRLAA TAEQAQQATE RLQRQMVPSA
PPPPAERPLQ VGDTVHVASV GLNGEIMAID TDDETATVQV GGFRLTVKCS ELKRAKAADN
GERRFAPPER PVNLPSMPDV SMTFDMRGWR VSEVSDRLDR YLNDAYLAGL HQVRLIHGKG
TGALRQVVRD VLASHPLVAS FTGGGRDGGD GVTIATLVDR