Gene Cag_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0145 
Symbol 
ID3747191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp162849 
End bp164783 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content45% 
IMG OID637772672 
ProductDNA mismatch repair protein 
Protein accessionYP_378466 
Protein GI78188128 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATAA TTACCCGATT ACCTGATAGC GTTGCTAATA AAATATCCGC AGGCGAAGTG 
GTGCAGCGCC CGGCATCGGT AGTTAAAGAG CTGCTTGAAA ATGCTATTGA TGCTGGCGCT
ACAAAAATTA GCGTTACTAT TAAAGATGCA GGTAAGGAGC TGATTCGCAT TGCCGATAAT
GGCGTTGGTA TGAATCGCGA TGATGCTTTG CTTTGCGTGG AGCGTTTTGC CACCAGTAAA
ATTAAAAGTG CAGATGATCT TGATGCGTTG CATACGTTGG GTTTTCGTGG GGAGGCGTTA
GCAAGTATTT GTTCGGTCTC TCATTTTGAG CTTAAAACTC GTCAAGCCGA TGCAACGCTT
GGCTTGCTGT TTCGCTACGA TGGTGGCTCG TTGGTTGAAG AGTTGGAGGT GCAAGCGGAG
CAAGGTACCA GCTTTAGTGT GCGCAATCTT TTTTATAACG TGCCTGCTCG TCGTAAGTTT
TTAAAGTCGA ATGCTACCGA GTATCATCAT CTTTTTGAGA TTGTAAAATC TTTTACGCTG
GCTTATCCCG AAATTGAGTG GCGTATGGTG AATGATGATG AGGAGCTGTT CAACTTTAAA
AACAATGATG TTCTTGAGCG GCTCAATTTT TATTATGGCG ATGATTTTGC AAGCAGCTTA
ATTGAGGTTG CTGAGCAAAA CGATTATTTG CCTATTCACG GCTATCTTGG CAAGCCTGCG
TTACAAAAAA AGCGCAAGTT GGAGCAATAC TTTTTTATTA ATCGTCGCCT TGTGCAAAAT
CGGATGTTGT TGCAAGCGGT GCAGCAGGCG TATGGTGATT TGCTTGTTGA GCGTCAAACA
CCGTTTGTGT TGCTTTTTCT GACGATTGAT CCTTCGCGTA TTGATGTGAA TGTGCATCCC
GCTAAGCTTG AAATTCGTTT TGATGATGAG CGGCAGGTGC GCTCCATGTT TTATCCCGTT
ATAAAGCGAG CGGTGCAGTT GCACGACTTT TCGACCAATA TTTCCGTTAT CGAACCTTTT
GCATCGGCTT CTGAACCATT TGTGGGCTCA TCTTCCCAAC CAATATTTTC ATCTACCTCA
AGCCAAGCGC CCCGTATGGG TGGGGGAAGT CGTCGTTTTG ATTTGAGTGA TGCGCCTGAG
CGTGCAATCA CTAAAAATGA GCTGTATCGC AATTATCGTG AAGGAGCTTT TTCGTCGCCC
TCGGTAGCTT CATATGATGC GCCATCTCCA TTGCAACAGG GTGGATTGTT TGCGTTGGCA
TCGGCTGAAG AGAGTTTGTT TGGTGCGCAA GCGGTGCATG AGGCAAGCGA AAACATTGAG
GCGTTCCAGC TTTCGCCGCT TGACAACATT GTTGAGCATA AAGAGGTTGA GCCAAAAATC
TGGCAGTTGC ATAACAAATA TCTTATATGT CAGATTAAAA CGGGGTTAAT GATTATTGAC
CAGCATGTGG CGCATGAGCG TGTGCTTTAT GAGCGAGCGT TGGAGGTGAT GCAGCAAAAT
GTGCCAAATG CGCAGCAATT GCTTTTTCCG CAAAAAGTGG AGTTTCGTGC TTGGGAGTAT
GAAGTGTTTG AGGAGATTCG TGATGACCTT TATCGCCTTG GCTTTAATGT GCGTTTGTTT
GGCAACCGCA CGGTGATGAT TGAGGGGGTG CCGCAAGATG TGAAGTCGGG GAGTGAGGTT
ACTATTTTGC AGGATATGAT TACGCAATAT CAAGAAAATG CTACCAAGCT GAAGTTGGAG
CGGCGCGATA ATTTAGCAAA GTCCTACTCC TGCCGTAATG CCATTATGAC GGGGCAGAAG
CTTTCGATGG AGGAGATGCG TTCGTTGATT GATAATCTTT TTGCAACACG AGAGCCTTAC
ACCTGCCCAC ACGGACGTCC AATTATCATC AAGTTATCGC TTGATCAGCT TGATAAAATG
TTTGGGAGGA AGTAA
 
Protein sequence
MPIITRLPDS VANKISAGEV VQRPASVVKE LLENAIDAGA TKISVTIKDA GKELIRIADN 
GVGMNRDDAL LCVERFATSK IKSADDLDAL HTLGFRGEAL ASICSVSHFE LKTRQADATL
GLLFRYDGGS LVEELEVQAE QGTSFSVRNL FYNVPARRKF LKSNATEYHH LFEIVKSFTL
AYPEIEWRMV NDDEELFNFK NNDVLERLNF YYGDDFASSL IEVAEQNDYL PIHGYLGKPA
LQKKRKLEQY FFINRRLVQN RMLLQAVQQA YGDLLVERQT PFVLLFLTID PSRIDVNVHP
AKLEIRFDDE RQVRSMFYPV IKRAVQLHDF STNISVIEPF ASASEPFVGS SSQPIFSSTS
SQAPRMGGGS RRFDLSDAPE RAITKNELYR NYREGAFSSP SVASYDAPSP LQQGGLFALA
SAEESLFGAQ AVHEASENIE AFQLSPLDNI VEHKEVEPKI WQLHNKYLIC QIKTGLMIID
QHVAHERVLY ERALEVMQQN VPNAQQLLFP QKVEFRAWEY EVFEEIRDDL YRLGFNVRLF
GNRTVMIEGV PQDVKSGSEV TILQDMITQY QENATKLKLE RRDNLAKSYS CRNAIMTGQK
LSMEEMRSLI DNLFATREPY TCPHGRPIII KLSLDQLDKM FGRK