Gene Cagg_3768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3768 
Symbol 
ID7267841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4592846 
End bp4595752 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content57% 
IMG OID643568575 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002465040 
Protein GI219850607 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCGA TTGAATTGCA TGCGTGGTAT CGCCAGTATC GCAAACTAAA AGAAGAGGCT 
GCCGATGCGA TTTTGCTCTT TCGCTTCGGT GATTTCTACG AGACGTTTGA TGATGATGCA
AAGTTGATTG CAGAGTTACT TGACGTAACT TTAACACGTA AAGAATACGC CGTTGATAAG
CGGGCACCGA AGGATCAGCA GAAGTTGTAT GCGCCGATGG CGGGAATGCC TTACCACGCC
GTCGATCGCT ATGTGAGCGA ATTAGTTGCC CGTGGCTATC GGGTGGCAAT TGCCGAGCAG
TTGAGCGAGA CGGAGGCAAT GCGCAATGAT ACGCGGCCTC GTTCGGTCTA CGCTGCCGGT
TTGACCCCGC TCGAAAGCAG CGGCAAGATG GTACAGCGGG CCATTGTACG GATTATCACG
CCGGGTACCG TGATCGATCC GGCGATGTTG CCCGATCGCA CGAACAACTA TTTGGCAGCA
GTGCTGGTTG AACAAGGGAA GGTAGGGTTG GCGTATGCCG ACCTTTCAAC CGGTGAATTT
GCCGCTGCCG AGTTTGTTGA TGCACGTGCA TTGACCCAGT TGCAAGCCGA GTTGGCGCGT
CTTCGTCCCG CTGAAGTGCT CGTCCCTGAC GATGAGGCGC TCCGTTTGCC AAACCTGGCC
CCGGTACAGG CACGTTTGAG CCAAGACCTG GCCCCGCTCA CCAAGGAGGA GCGTGAGGTA
TTGTTGCCCC ACGAGCGGGT GGCCCGTCGT CTTGATGCCC CTGGTGCTGC CAGTTGGACG
CAAGGTCACG TGACCGAGTG GCCGACGTGG CGCTGGGAGT TGGCGACGGC TGCCTCCGCG
TTGTGTGAGC AGTTGGCAGT TGCGACGTTG GCGGTGTGTG GTCTTGAAGA CCGCCCGTTG
GCGACACGCG CTGCCGGTGC ATTGATTCAA TACGCCCAGA CCACGCAACG CCAGCGGGTC
AACCAGTTGC GGTATCTGCG GGTGTACCAG ACCGGTGCAT ATATGCTGCT CGATCCGCAG
ACGCGGCGCA ATCTTGAATT GCTGGAAAGT AGTGGGCGGC AAGGGGCAAA AGCTTCGTTG
ATCGGCGTGC TCGACCGCAC GTGTACGGCA ATGGGGGCGC GTTTGCTGCG GCGTTGGATT
GCCCAACCGC TGATCGTTTT AGAACCATTG CAAGTGCGTC AGCATGCTGT AGCACGCCTG
GTCGCCGAGA CGATGACTCG GCTTGAGCTT CGTGAGGCGC TGGCCGAGTT GCCCGATATG
GAGCGGGCGC TCAATCGGAT CGCACAAGGT ATTGCAGTGG CAACGCCGCG TGATATGGTT
CAGTTACGGG CCGCGTTGCG CAAACTACCC GGCATCGCGC AAGCTATCGC ACCGTTGTTA
CCCGACTTGC TCGCCCCTGA AATGGACGGC GAGCCGCTGC TCACGTTTGA CCCGTGCAGT
GATGTGCTCG ATCTGCTAGA ACGGGCACTC GACGATGATC CACCGGCGTT GCTCGGTTCG
TCGAACTATC TACGGGCTGC CGAAGAGGGT GGCGAGCGAC CGCGCCGTGT GATCCGCCCC
GGTTTCGATC AGCGTCTCGA TGCGTTGATT AAGGCTAGTC GCCATGCCCA AGAATTCATC
GACCGTCTCG AAACGAAAGA ACGTGAGCGT ACCGGGATTC GTTCGCTCAA AGTGGGTTAC
AATCAAGTGT TTGGTTACTA TATCGAAATA TCGCGTGCCG TTGATCCGAA ACTGATCCCA
TCACATTACG AACGCAAGCA AACGCTGGTG AATGCTGAGC GTTATGTGAC TGAAGAGCTG
AAGTACTACG AAGGGTTGCT CAGCGATGCA CGGTTAAAGC TGGTTGATCT TGAACGAGAC
ATCTTTCAAC GGTTGTGCGA TGACATTCAG CAACACCTCG ACCGGCTGCG GATAACGGTG
GCCGCAGTGG CCCGCCTCGA TGCGTTAGCC GCCCTGGCCG AGGTGGCAGT GCGTGGCCGT
TATGTCCAAC CGACCTTGCG AACCGATCGG GTATTGCGGA TCAAGCAGGG CCGTCATCCG
GTTGTTGAGC GGACGCTGGG TGAGCCGTTC ATCGGCAACG ATGTCGATCT TGATGGTGAT
AATGTCCAGA TTTTGATCAT TACCGGCCCG AATATGGCCG GTAAAAGCAC TTTCTTGCGC
CAGGTGGCCT TGATTACCCT GATGGCGCAG ATCGGCTCGT TTGTCCCCGC CGATGAAGCC
GAAATTGGCT TGGTGGATCG CATTTTTACC CGGATCGGTG CTCAAGACGA CATCGCTACC
GGTCAGAGCA CCTTTATGGT TGAGATGACC GAAACTGCTG CATTGCTTAT GCAGAGTACA
CCCCGTTCGC TGATCATTCT CGATGAGGTG GGGCGTGGGA CGAGTACGTA TGACGGTATG
GCAATTGCCC GGGCCGTAGT TGAGTACATC CATAACGAAC CTCGGTTGGG GTGTCGGACG
TTGTTTGCGA CCCACTATCA CGAGTTGACG GCACTTGATA CCGAACTACC TCGTGTACGC
AACTTTCATA TGGCGGCTGT CGAGCGTGAC GGCCGAGTCG TCTTTTTGCA TGAGCTGCGT
CCCGGTGGTG CCGATCGCTC GTATGGTATA CACGTCGCCG AGCTGGCCGG TATTCCGGCG
AGTGTGATCA GGCGGGCCAA TGATTTGCTG GCCGAACTCG AGGGTCACAC GGCACGACCG
ACGGATCGGC ACGCCAAACC ACGTTCGGAT GGGGAGCGCG CATTGCCATC GGCACCCTCA
ACCGTAGGTA GTATGCAGTT ATCGCTATTT GATCTCGTAC CGCACCCGGT AGTGGAGTAT
CTCCGCCGGC TGCGGATTGA GGAACTTACC CCGTTGGAAG CGTTGAACCG GCTGGCCGAG
TTACAACGGC TAGCTCGCGA AGGATAG
 
Protein sequence
MAAIELHAWY RQYRKLKEEA ADAILLFRFG DFYETFDDDA KLIAELLDVT LTRKEYAVDK 
RAPKDQQKLY APMAGMPYHA VDRYVSELVA RGYRVAIAEQ LSETEAMRND TRPRSVYAAG
LTPLESSGKM VQRAIVRIIT PGTVIDPAML PDRTNNYLAA VLVEQGKVGL AYADLSTGEF
AAAEFVDARA LTQLQAELAR LRPAEVLVPD DEALRLPNLA PVQARLSQDL APLTKEEREV
LLPHERVARR LDAPGAASWT QGHVTEWPTW RWELATAASA LCEQLAVATL AVCGLEDRPL
ATRAAGALIQ YAQTTQRQRV NQLRYLRVYQ TGAYMLLDPQ TRRNLELLES SGRQGAKASL
IGVLDRTCTA MGARLLRRWI AQPLIVLEPL QVRQHAVARL VAETMTRLEL REALAELPDM
ERALNRIAQG IAVATPRDMV QLRAALRKLP GIAQAIAPLL PDLLAPEMDG EPLLTFDPCS
DVLDLLERAL DDDPPALLGS SNYLRAAEEG GERPRRVIRP GFDQRLDALI KASRHAQEFI
DRLETKERER TGIRSLKVGY NQVFGYYIEI SRAVDPKLIP SHYERKQTLV NAERYVTEEL
KYYEGLLSDA RLKLVDLERD IFQRLCDDIQ QHLDRLRITV AAVARLDALA ALAEVAVRGR
YVQPTLRTDR VLRIKQGRHP VVERTLGEPF IGNDVDLDGD NVQILIITGP NMAGKSTFLR
QVALITLMAQ IGSFVPADEA EIGLVDRIFT RIGAQDDIAT GQSTFMVEMT ETAALLMQST
PRSLIILDEV GRGTSTYDGM AIARAVVEYI HNEPRLGCRT LFATHYHELT ALDTELPRVR
NFHMAAVERD GRVVFLHELR PGGADRSYGI HVAELAGIPA SVIRRANDLL AELEGHTARP
TDRHAKPRSD GERALPSAPS TVGSMQLSLF DLVPHPVVEY LRRLRIEELT PLEALNRLAE
LQRLAREG