Gene Cphamn1_1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1829 
Symbol 
ID6375520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1983652 
End bp1986066 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content53% 
IMG OID642684326 
ProductSmr protein/MutS2 
Protein accessionYP_001960228 
Protein GI189500758 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.401294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTTCCTG AAAAAGCGGT TATTGCAGGT ATGAGGGATA CGTTGACAAA AAGAAAACTG 
GATTTTGACA GGATTGTCGA GCATGTTTCG ACATTCTGTA TCTCCGATAT GGGAAAGGAT
GCTCTTGCCG GGGCGGCTCC GGTTGCGGAC TCTTCGATGC TCGATGCTGA ACTGAACCGG
GTTCTCGAAT TGAAGGGTTT TCTTGAGGAG GGCAATCCTT TGCCTTTTTC GCATCTCCCG
GATACACGAC CGCTCCTGAA GAAGCTTGAA CTTGTCGATT ATTATCTGGA GCCGCGCGCT
CTCCTGGATA TACACGATCT GCTCTATGCT TCTGTAGCGT TGAGGAAGTT CATGTTCAGT
CACCGCGCGA TCTATCCTTC GCTGAACGAT CTGACTATTC GGCTCTGGCT GGAGAAGTCT
TTGCAGTATG AGATTCGCAG GGTTGTTGAT GAAGAGGGGC GGATACGTGA CACGTCCAGT
GACGGCCTCT TTTCGCTGAG GCGCGAGCTC AACGGGAGCA GGGAGGAACT GCGCAGAAAA
ATGAACCGTT TGCTGAAACG GTGTCAGGCC GGAGGTTGGC TGATGGAGGA CACTGTCGCG
ATGAAAAACG GGAGACTTGT TCTCGCCCTG AAGGTGGAAT ACAGGCACAA GCTTCAGGGA
TTCATCCAGG ACTACTCCCA GACAGGCCAG ACAGTGTTTA TAGAGCCTGC CGAGACGCTT
GAGATAAGCA ACCGGATACA GGATCTCGAG ATTCAGGAGC GCAGGGAGGT AGAAAGGATC
CTGAAAGAGA TGTCCGGGCA TATTCGGGAT GAGCTTGAAA ATGTGCGGCA TAACCAGGAT
GTGCTGGCCT CGTTCGATTC TCTCTACGGC CGCGCCAGGT TTGCGGCCGA TACCGGTTCG
GTGCTCCCTG TGGTTGATCC GGGCGGGACG CTTCAGATTA TCCGCGGTTT TCATCCATGG
CTGCTTGTCT CTCATCGCGA GAAAAAAGAG CAGATTTATC CTCTTGACCT GTCGCTTGAT
GACGATGAAC AGGTTCTTGT GATCTCAGGC CCGAACGCAG GCGGGAAATC GGTCGCCATG
AAAACGGTCG GTCTCCTGTG CTGTATGCTG CGGCACGGGT ATCTTGTTCC CTGCAGCGAG
AGTTCGGTTT TTCCTTTTTT CGAAGAGATC TCTATCGAGA TCGGCGATGA GCAGTCAATA
GAAAACGATC TTTCAACGTT CAGTTCTCAT CTTGAACATA TCCGGATTAT TCTTGAGTCA
GCAGGGCCGA AGAGTCTGGT GCTGATCGAC GAGCTGTGTT CGGGGACGGA TGTCGAGGAG
GGGAGCTCGA TTGCCCGCGC GGTTATCGAG GAGCTTCTTG CAAGGGGTTC GAAGGTTCTT
GTGACGACTC ATCTCGGAGA GATGAAAGCC TATGCACATG AGCGGGCAAA CGTGGTGAAC
GGAGCTATGG AATTCGACAA AACTTCCCTG AGCCCATCGT TCAGGTTTCT CAAAGGACTG
CCGGGCAACA GTTTCGCGTT TGCTATGATG CAGCGCATGG GATTCCGCTC TGAGCTGGTC
GAACGTGCCG AGAGCTATCT GGATAAAGGC AGAGCCGGGT TGGAACGGCT TCTGGATGAT
CTCACGATTG CTCTTGAGGA AAATCGGGGA CTGAGCCAAT TGCTTGAGCA GGAGCGCGCG
TGGCTTGCGA CTGAGCGCGA ACGTCTCCTT GACGCAGGTA AAGAGATGAT GGAGAGGGAG
CGTGAGCTGA AGCTGACCAG GCAGCGGGAG ATGCAGCGGG AGGTCGAGAA GGCGAAAAAA
ATGATCCGTG ATATTGTTCG GGAGGTAAAA AAACAGCCGG TGGAAAAGGT CGTCACTGCA
GCAAGAAACA AACTGGAGAG CCGGAAAAAG CAAGCTGTGC GTGAAGAAAA ACGGATTGTT
TCGGAGATGA CAGGTAAGGC GATGCCTGAA GAGCCCATCC GGCAGGGCGA TATGGTTCGG
GTTCTGTCGA CAAATACTTC GGGCGAGGTG GTTTCATTGC AGGGTGAAGA TGCTGTGGTG
CGCTGCGGGA CGTTTCGTCT GTCTACCTCG GTTAAGAATC TTGAAAGGGT TTCAAAAACC
GCGGCGAAAA AACTCGATCG GAAGGACAGC CCGCCTTCTC AGGCGGGAGT GAGCGTCTCT
CAAACTTCCG CACTTGAATC GACGAGACTG GATTTGCGGG GAATGTCGGG AGATGAAGCG
GTAAACGAAG TGGAGCGGTT TATCGATAAA TTGCGGCTCA ACAGGATCGT GAGCGCGACA
ATAATCCATG GGAAAGGTAC CGGGGCACTG CGCCAGAAAG TGGCGCAGTG TCTACAAAAG
CACCCTGCAG TAAAACGCTA TCGGCTGGGG GAGTGGTCGG AAGGTGGCGC GGGAGTGACG
ATACTTGAAT TGTAG
 
Protein sequence
MFPEKAVIAG MRDTLTKRKL DFDRIVEHVS TFCISDMGKD ALAGAAPVAD SSMLDAELNR 
VLELKGFLEE GNPLPFSHLP DTRPLLKKLE LVDYYLEPRA LLDIHDLLYA SVALRKFMFS
HRAIYPSLND LTIRLWLEKS LQYEIRRVVD EEGRIRDTSS DGLFSLRREL NGSREELRRK
MNRLLKRCQA GGWLMEDTVA MKNGRLVLAL KVEYRHKLQG FIQDYSQTGQ TVFIEPAETL
EISNRIQDLE IQERREVERI LKEMSGHIRD ELENVRHNQD VLASFDSLYG RARFAADTGS
VLPVVDPGGT LQIIRGFHPW LLVSHREKKE QIYPLDLSLD DDEQVLVISG PNAGGKSVAM
KTVGLLCCML RHGYLVPCSE SSVFPFFEEI SIEIGDEQSI ENDLSTFSSH LEHIRIILES
AGPKSLVLID ELCSGTDVEE GSSIARAVIE ELLARGSKVL VTTHLGEMKA YAHERANVVN
GAMEFDKTSL SPSFRFLKGL PGNSFAFAMM QRMGFRSELV ERAESYLDKG RAGLERLLDD
LTIALEENRG LSQLLEQERA WLATERERLL DAGKEMMERE RELKLTRQRE MQREVEKAKK
MIRDIVREVK KQPVEKVVTA ARNKLESRKK QAVREEKRIV SEMTGKAMPE EPIRQGDMVR
VLSTNTSGEV VSLQGEDAVV RCGTFRLSTS VKNLERVSKT AAKKLDRKDS PPSQAGVSVS
QTSALESTRL DLRGMSGDEA VNEVERFIDK LRLNRIVSAT IIHGKGTGAL RQKVAQCLQK
HPAVKRYRLG EWSEGGAGVT ILEL