Gene Cag_0699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0699 
Symbol 
ID3747473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp970440 
End bp972821 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content52% 
IMG OID637773233 
ProductDNA mismatch repair protein MutS-like 
Protein accessionYP_379013 
Protein GI78188675 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCT CTACGCTAAA AAAACTTGAA TTTACGAAAA TAGCAGCTTA TGCAGCGCAA 
TTGTGCCTTT CGCCTATGGG GCGCGACCGT TTGCTCAATG CTCGTCCGTT GCGTGAGCGT
GAGGCGCTGA TGGCGGAATT GGAGCGCGTG CTTGAGTTGC GCATGTTGTT GCAAGAGGGG
CTAACCTTGC CCTTTTCCCA CCTGCCCGAC ACTCGCGTGC TCTTAAAAAA GCTGGAGATT
GAGCACCTTG CGCTTGAACC GCTTGAGTTG CTTGATCTCT ATCATTTGCT CTACTCATCG
GTGCAGTTGC GCCGTTTTAT GTATGGTAAT CGTGAGCGTT ATGGTCGCTT GAACGATCTT
ACCATTATGC TCTGGATGGA GCGAAGCTTG CAAGCAATGA TTCAACGCTG TGTGGATGAG
CGTGGCTTGG TGCGCGATAG TGCCAGCGAT GGGCTGTTGC TGATTCGCCA TGATCTTGCT
GAAAGTCGGG AGTTGTTGCG CCGCCGCATG GAGCGTTTGC TGCGCCGTGC AAGTGCAAAT
GGTTGGTTGA TGGAGGAGAC GGTTGCGGTA AAAAATGGGC GTTTAACTTT AGCCTTAAAG
GTTGAGTACA AGTATAAAAT CCCCGGTTAC ATTCAAGATT ATTCAGGCAC GGGGCAGACG
GTTTTTATTG AGCCTGCCGA AACGTTGGAA ACCAGCAACC GCATTCAAGA TTTAGAGATT
AGCGAGCGGC GCGAGGTGGA GCGCATTTTG CAAGAGGTGA GTGCGGCGTT GCGCGGCGAG
CTTGAAAATA TTCACCACAA TCAACAATTG ATGGCTGAGT TTGATGCGCT TTACGCCCGT
GCTCGCTTTG CGGTTGAAAC CAACGCCGTG CTGCCTACTG TTACGGAGGG CAACGAGTTG
CGCTTAATAA AAGCTTACCA TCCATGGCTT TTGCTCTCGC ACCGTGAGCG CACGGTGCAG
CCGCTTGACC TCCATTTAAG CGCTGAAGAG CAGGTTTTAG TGATTTCGGG ACCCAATGCG
GGTGGCAAGT CGGTTACTAT GAAAAGCGTG GGATTGCTCT GCTGTATGCT TGTGCATGGC
TACCTTCTCC CCTGTAGCGA AAGCTCCTGC ATTCCTCTCT TTAACAATAT TTTTATTGAA
ATTGGCGACG ACCAATCCAT TGAGCACGAC CTTTCCACCT TTAGTTCCCA CCTCAGCGCC
ATTCGCTCTA TTCTTGAGCG GGCAGGCACG CGCGATTTAG TTTTAATTGA TGAATTGTGC
GGCGGCACCG ACGTTGAAGA GGGCGGAGCA ATTGCACGTG CGGTGATTGA AGAGCTTTTG
GCATCAGTGG CAAAAAGCAT TGTAACTACG CACCTTGGCG ACCTTAAAGC CTATGCACAC
CAGCGCGACG GGGTGGTGAA TGGCGCTATG GCGTTTGACC GTGCTGAGCT GCAACCAACC
TTCCGTTTTA TTAAAGGATT GCCCGGCAAC AGCTTTGCCT TTGCCATGAT GCAACGCATG
GGCTTTTCGC CCGCTTTGGT GGAGCGAGCA CGCCACTTTA TGGCGCACGA ACGCATTGGC
TTAGAGCAAA TGGTGGACGA TTTGAGCCAT ATTATGGAGG AGCAACAACG CCAACGCCAG
CAGCTTGACG ACGAGCAACG CACCTTTGCA GAGCGCGAAC GCACGGTGCT GGAGGTTGAA
GCGACCCTAA AGCAGCAACA ACGCGAGTTA AAACAACAAA TTTCACGCGC CGTGCAAAAA
GAGGTGGAAC ATGCCCGCAA AGAGATTCGC GCCATTGTGC AAGAGGTAAA AGCGGCGCCG
ACCAATCCGC AAGTGGTTCA AGCTGCTCGC GAAAAGCTTG GCATCAAGCG TCAAGAGGTT
GAAGAGCGCC ATACCACCGC TGCACCCACA ACCGCAAGCG AGCCAACCAT TGATCGCACC
ATCACCATTG GCGACATGGT GCGCTTGCTT GACACCAACG CCACGGGCGA AGTTGAACGC
TTTAACGGTG ATAACGTGGT GGTACGCTGC GGAACCATTC GTTTGCAGAC GCATCTAAAG
AATTTGGAAA AAAGCTCCAA AACCAAAGCA CGCACCGCAC AGCGCGACAC CTCGAATAGC
AAGGTACGCT CATGGTCAAC CGTTACAAAC GAGGTCAGCT CAACGCAGCT TGATGTACGA
GGCATGAGCG GCAACGAAGC CGTCCCCCAT ATTGAGCGCT TTCTTGATAC ATTGCGCCTG
CACCGCATTC ACTTTGCCAC CATTTTGCAC GGCAAAGGCA CAGGCTCACT CCGCAAACGC
ACTGCCGAAT GCTTAAAATT GCACACTGCC GTTAAAAGCT TTCGTTTAGG GGGATTAGGG
GAAGGTGGGG ATGGGGTTAC GATTGTGGAG TTGGGGGAGT GA
 
Protein sequence
MNPSTLKKLE FTKIAAYAAQ LCLSPMGRDR LLNARPLRER EALMAELERV LELRMLLQEG 
LTLPFSHLPD TRVLLKKLEI EHLALEPLEL LDLYHLLYSS VQLRRFMYGN RERYGRLNDL
TIMLWMERSL QAMIQRCVDE RGLVRDSASD GLLLIRHDLA ESRELLRRRM ERLLRRASAN
GWLMEETVAV KNGRLTLALK VEYKYKIPGY IQDYSGTGQT VFIEPAETLE TSNRIQDLEI
SERREVERIL QEVSAALRGE LENIHHNQQL MAEFDALYAR ARFAVETNAV LPTVTEGNEL
RLIKAYHPWL LLSHRERTVQ PLDLHLSAEE QVLVISGPNA GGKSVTMKSV GLLCCMLVHG
YLLPCSESSC IPLFNNIFIE IGDDQSIEHD LSTFSSHLSA IRSILERAGT RDLVLIDELC
GGTDVEEGGA IARAVIEELL ASVAKSIVTT HLGDLKAYAH QRDGVVNGAM AFDRAELQPT
FRFIKGLPGN SFAFAMMQRM GFSPALVERA RHFMAHERIG LEQMVDDLSH IMEEQQRQRQ
QLDDEQRTFA ERERTVLEVE ATLKQQQREL KQQISRAVQK EVEHARKEIR AIVQEVKAAP
TNPQVVQAAR EKLGIKRQEV EERHTTAAPT TASEPTIDRT ITIGDMVRLL DTNATGEVER
FNGDNVVVRC GTIRLQTHLK NLEKSSKTKA RTAQRDTSNS KVRSWSTVTN EVSSTQLDVR
GMSGNEAVPH IERFLDTLRL HRIHFATILH GKGTGSLRKR TAECLKLHTA VKSFRLGGLG
EGGDGVTIVE LGE