Gene Clim_1732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1732 
Symbol 
ID6354560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1902579 
End bp1904969 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content53% 
IMG OID642669336 
ProductSmr protein/MutS2 
Protein accessionYP_001943752 
Protein GI189347223 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCGG TTACGTTAAG AAAGCTCGAG TTTACGAGGG TTGCCGAATA TACGTCGAGG 
TATTGTCTCT CCGCCATGGG AAGCGATCTT CTGCTCGAGG CTTATCGAGA GACGGAAAGG
GGGCTTCTCC TCGCGGATCT GGAACGGGTG CTCGAACTGA AAAATTTTCT GCTTGAAGGG
CAATCCCTGC CGTTTTCACT GCTGCCGGAT ACCCGTCATC TGCTGAGGCA TCTCGAGGTT
ATAGACAGCT ATCTCGATGC GGAGGGACTG CAGGATATTT TCCATCTGCT GCACGCATCG
GCTCAGCTCA GAAAGTTCAT GTTCGGGAGC CGCGAGATAT ATCCGAAGCT GAATGAGTTC
TGCATCAGGC TTTGGCTTGA GAAGACGCTT CAGTATACCA TCCGCAGTGT CGTCGATGAA
CAGGGAGGTA TACGGGATAC CGCCAGTGAC GGTCTTTTTC TGATCCGCAG CGAACTTCGG
GAGAGCCGCA CTGCCCTGCA GAGAAAAATG GAGCGGCTGC TTCGCCGCTG CCTCGAAAAC
GGCTGGCTGA TGGATGATAC CATAGCGATG AAAAATGGCC GGCTTACGCT TGGCTTCAGG
GTCGAGTACA AGTACAAGGT ACCGGGTTAT ATCCAGGATT ATTCGGGCAG CGGTCAGACG
GTGTTTATCG AGCCGGCTGA GGCTCTCGCC CTGAGCAACA GGATTCAGGA ACTCGATATC
AATGAACGGC GGGAGATCGA GCGGATTCTT CGCGAGGTTT CGGGTATGAT TCGGCCTGAG
CTGGAAAACC TCAATCATAC TCAGGAGCTT CTTGCCGGAT TCGATACTCT TTACGCAAGG
GCAAGGCTTG CGGTGGAGAC CGGTTCGGTA TTGCCGAAAA TTTCTGAAGG CAGGGAGCTT
CGTATTGTCA GGGGATATCA CCCATGGCTT CTCATAACCC ATCGGATGAA ACAGGCAGAG
GTTTTTCCTC TCGATCTCGA TCTTCGGGAT GATGAACAGG TACTGATCAT ATCCGGGCCT
AATGCAGGAG GAAAGTCGGT GGCCATGAAA ACCGCCGGCC TTTTGTGCTG TATGCTTTCG
CATGGCTACC TGCTTCCCTG CAGCGAAAGT TCGGTTTTTC CGCTGTTCAG CTCTATCGAT
ATTGAAATCG GCGATGAACA GTCGATTGAA CATGATCTTT CGACCTTCAG CTCTCATCTC
GCCGCTATAC GGGGAATTCT TGAATCGGCC GGCGGCAGCT CTTTGGTGCT TATCGACGAG
CTTTGTTCCG GTACCGATGT CGAAGAGGGA GGCGCAATAG CGAAGGCCGT TATCGAAGAG
CTGCTTCGGA GAGGCGCGAT GACGTTTGTG ACGACGCATC TCGGCGAACT GAAGGCGTAT
GCTCATGAGC GTGACGGAGT TGTAAATGGT GCGATGGAGT TTGATCGGGA GGGGCTGCTT
CCGACTTTCA GGTTCATCAA GGGACTGCCG GGCAACAGTT TTGCTTTTGC CATGATGAAG
CGAATGGGAT TTGCTTCTGA TCTGATCGAT ACTGCACAGG GTTTTATGAA AGAAGAGCGG
GTAGGGGTTG AGCGGCTGCT CGATGATCTC AAGTGTCTGA TGCAGGAAAA TCGGGAGCTT
GATAGCTCTC TGAGGCATGA TAGAGCTGTT TTTGAAACAG AAAAGCAATT GTTTGCCGCA
GCGCGGGCTG AACTTGCCGC ACAGCGGACC GAGCTGAAAA GCAGAGCGTT GCGTGAAACA
CAGAAAGAGA TCGAGCATGC ACGCCGGGAG ATACGCGGCA TTGTTCAGGA ACTGAAAAAT
GCTCCATCTG ATGACAGAAC GGTACTTGAT GCCCGGAAAA AACTCGATCT CCGTAAAAAA
GCTCTTTCAG ACGTGGAGGC GGAACAGGAA AATCAGGTGG CGTTTTCGCC GGCGGCTGCG
GGGGATGAGA TTGGCACCGG CGATCTCGTC AGGCTGCAGG GCAGCAGTAC CACCGGTGAG
GTTGAATCGA TCCAGAGCGG CAGTGCTGTC GTTCGATGCG GTAATTTCAG GTTGACGACG
GCGTTGAAAA GTCTGGAGAA GATCACGAAA ACCATAGAGC GGAAGATGCA GAGAGAGCCT
CAGGCGTCTG CAGGAAAAGC GTCGTGGACA GCGGTGACTA CCGATGCCCT GTCTACCACC
ATCGACCTTC GAGGTATGAC CGGCGAGGAG GCAATTCCCG CTCTTGAGCG TTTTCTTGAC
AGCATGAGTA TGAATCGCAT CAGGATGGCA ACCATCATTC ATGGAAAGGG GACCGGCTCC
CTGCGCAGGA GAACAGCTGA ATTGCTTCAG CAGCATAAAG CTGTCAAGAG TTTCCGGCTT
GGAGAGTGGG GCGAGGGAGG GGCCGGGGTG ACGGTTGTAG CGTTGCAATG A
 
Protein sequence
MDAVTLRKLE FTRVAEYTSR YCLSAMGSDL LLEAYRETER GLLLADLERV LELKNFLLEG 
QSLPFSLLPD TRHLLRHLEV IDSYLDAEGL QDIFHLLHAS AQLRKFMFGS REIYPKLNEF
CIRLWLEKTL QYTIRSVVDE QGGIRDTASD GLFLIRSELR ESRTALQRKM ERLLRRCLEN
GWLMDDTIAM KNGRLTLGFR VEYKYKVPGY IQDYSGSGQT VFIEPAEALA LSNRIQELDI
NERREIERIL REVSGMIRPE LENLNHTQEL LAGFDTLYAR ARLAVETGSV LPKISEGREL
RIVRGYHPWL LITHRMKQAE VFPLDLDLRD DEQVLIISGP NAGGKSVAMK TAGLLCCMLS
HGYLLPCSES SVFPLFSSID IEIGDEQSIE HDLSTFSSHL AAIRGILESA GGSSLVLIDE
LCSGTDVEEG GAIAKAVIEE LLRRGAMTFV TTHLGELKAY AHERDGVVNG AMEFDREGLL
PTFRFIKGLP GNSFAFAMMK RMGFASDLID TAQGFMKEER VGVERLLDDL KCLMQENREL
DSSLRHDRAV FETEKQLFAA ARAELAAQRT ELKSRALRET QKEIEHARRE IRGIVQELKN
APSDDRTVLD ARKKLDLRKK ALSDVEAEQE NQVAFSPAAA GDEIGTGDLV RLQGSSTTGE
VESIQSGSAV VRCGNFRLTT ALKSLEKITK TIERKMQREP QASAGKASWT AVTTDALSTT
IDLRGMTGEE AIPALERFLD SMSMNRIRMA TIIHGKGTGS LRRRTAELLQ QHKAVKSFRL
GEWGEGGAGV TVVALQ