Gene Cphamn1_0302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0302 
Symbol 
ID6373957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp296357 
End bp298231 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content51% 
IMG OID642682816 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001958752 
Protein GI189499282 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000565974 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00985332 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGAGAA TTGCCAGGTT GCCCGATATT GTCGCGAACA AGATTTCAGC AGGGGAGGTT 
GTCCAGCGCC CTGCTTCAGT GGTGAAAGAA CTGCTCGAAA ATGCCATTGA TGCAGGGGCG
ACACGTATCA CTGTGGCGAT AAAAGATGCG GGAAAAGAGC TTGTTCAGGT TATCGACAAT
GGTTCGGGCA TGGATGAAGA GGATGCGCTC CGTTGTGTCG AACGGTTTGC TACCAGCAAA
ATTTCTGATG CGGAGGAGCT CGATGCCCTC ACGACTCTGG GGTTCAGGGG AGAGGCGCTG
GCAAGTATCT CCACGGTCTC GCATTTCGAA CTCAGGACGC GGCGTGAGAA CGACAACGTG
GGGATTCAGC TGCGTTACGA GGGAGGTGTG CTGTCGGAGA GGGGCAAGGC GGCGTCTGAG
CCGGGTACGG CGGTAAGCGT ACGAAACCTT TTTTACAATG TTCCGGCCCG CAGGAAGTTT
CTTAAATCAA ATGCTACGGA ATTCAAGCAT ATTTTCGAGA GCGTGAAGGC CCAGGTGCTT
GCCTATCCCG AGATACAGTG GCAGATGATC AATGATGATG AAACGCTTTT CGATTTCAGA
AGCTCGGACA TGCACGAGCG CCTGAATTTT TTTTTCGGTG ACGACTTTGC CGGGAGTCTG
ATAGAAGTTC ACGATGATAA CGATTTTCTT TCTCTGCACG GTTATGTCGG CAAGCCTTCC
ATGCAGAAAC GCCAGAAGAA TGAACAGTTC ATCTATCTGA ACAGGCGGGT GATCCAGAAC
AGGATGCTCT CACAGGCTTT GCAGCAGGCC TACGGGGAAC TTCTTGTAGA GCGCCATTCT
CCCTTCGTGC TTCTCTTTCT TGGAATCGAC CCTCAGCAGA CTGATGTTAA CGTTCATCCC
GCAAAGCTTG AAGTGAAGTT CGAGGATGAA CGCAGTGTGC GAACCATGTT TTATACTATT
ATAAAACGAT CTGTCAGGAT GCAGGACTTC TCACCTGATG TCGGCGGTGA AGGGTTCCAT
GAGACGAGTG ATTCTTTTTC TTCCCGGAGT TCTCAGCATA GCGATGCCAG GCTTGGCTTT
CAGGCGGTTC CTTCCAGAGC GTCATCAACC GATGATCTCT ACAGGGAGTT TCAGGAGAGT
ACGCCGAAGC GTCCGATGCC GGACAGAACG CGTGTCAGTG AACAGGAAGA GATGTTCAGT
CACAGCGCCG ATATTTTCTG TGAACCGGAC AGGGAGTTTC GCAGCAGTGA TTTCGGACAG
GTTTCAGAGG AGTTTGTTGA CGGAGTGCGC CTGGAACCGG AAGAGAAAGA TCCCAAAATC
TGGCAACTGC ATAACAAGTA TATCATCTGT CAGATCAAGA CAGGATTGAT GCTTATCGAT
CAGCATGTCG CTCATGAACG GGTTCTCTAT GAGCGTGCGG TAGATATTAT GGACAACAAC
GTCCCGAATG CCCAGCAACT TCTTTTTCCT CAGAAAGTCG AGCTCAAGCC TTGGGAATTC
GAGATCTATC TGGAGATTTG CGATGACCTC GACAGGCTTG GTTTCAATCT CGGCACACTG
GGAACGAGGA CCGTTATGAT AGAGGGTGTT CCACAGGATG TTCGCAGCGG TTCGGAGGCC
TATATCCTTC AGGACATGAT TCAGGAGTAT CAGCAGAATG CGTCAAAACT GAAGCTCGAG
AAACGTGAAA ATCTTGCTAA ATCCTACTCC TGCCGGAACG CGATAATGAG CGGTCAGGCA
TTGAGCCTTG AAGATATGCG CTCCCTTATT GACAGGCTGT TTGCGACGAA AATGCCGTAT
GTCTGTCCAC ATGGGCGTCC GGTAATTATA CGGATCTCTC TTGACCAGCT GGACAGAATG
TTCGGGCGGA AGTAG
 
Protein sequence
MPRIARLPDI VANKISAGEV VQRPASVVKE LLENAIDAGA TRITVAIKDA GKELVQVIDN 
GSGMDEEDAL RCVERFATSK ISDAEELDAL TTLGFRGEAL ASISTVSHFE LRTRRENDNV
GIQLRYEGGV LSERGKAASE PGTAVSVRNL FYNVPARRKF LKSNATEFKH IFESVKAQVL
AYPEIQWQMI NDDETLFDFR SSDMHERLNF FFGDDFAGSL IEVHDDNDFL SLHGYVGKPS
MQKRQKNEQF IYLNRRVIQN RMLSQALQQA YGELLVERHS PFVLLFLGID PQQTDVNVHP
AKLEVKFEDE RSVRTMFYTI IKRSVRMQDF SPDVGGEGFH ETSDSFSSRS SQHSDARLGF
QAVPSRASST DDLYREFQES TPKRPMPDRT RVSEQEEMFS HSADIFCEPD REFRSSDFGQ
VSEEFVDGVR LEPEEKDPKI WQLHNKYIIC QIKTGLMLID QHVAHERVLY ERAVDIMDNN
VPNAQQLLFP QKVELKPWEF EIYLEICDDL DRLGFNLGTL GTRTVMIEGV PQDVRSGSEA
YILQDMIQEY QQNASKLKLE KRENLAKSYS CRNAIMSGQA LSLEDMRSLI DRLFATKMPY
VCPHGRPVII RISLDQLDRM FGRK