Gene Cpha266_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0172 
Symbol 
ID4568469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp194046 
End bp195920 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content50% 
IMG OID639764772 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_910663 
Protein GI119356019 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA TCGCGCGATT ACCGGACAAT GTCGCCAACA AAATTTCGGC TGGCGAGGTA 
GTGCAACGTC CGGCATCAGT CATCAAGGAA CTTCTCGAAA ACGCCATTGA TGCCTGCGCG
TCAAAAATAA CCGTGACCAT CAAGGATGCC GGCAAAGAAC TGGTGCAGAT AGTTGACAAC
GGAATCGGTA TGAGTCGCCA GGATGCGCTG CTCTCGGTTG AACGCTTTGC AACAAGCAAG
ATTTCAGGCG TTGAAGACCT CGACTCGCTC ATGAGCCTTG GATTCAGAGG CGAAGCGCTG
CCAAGCATCG CTTCGGTTTC ACAGTTCGAA CTGAAAACCA AGCCGGAAGG CGCCCTGCTC
GGCTTCAGGT TTCGCTGCGA CGGAGGAGAA CCGGTTGAAG AATCCGAAGT CAATGCCGAA
AAGGGAACAA CCATCACAGT AAGAAATCTC TTTTACAACG TTCCGGCACG CAGAAAATTC
CTCAAATCAA ACGCAACAGA GTTCCGTCAT ATTTTCGAGT CAGTCAAGTC GCTGGCACTT
GCCTATCCGG AAATCGAATG GAAAATGGTC AGCGATGACG AAGAACTCTT CCACTTCAGA
ACTCCCGACA TTTACGAACG CCTCGATGCT TTTTATGGTG AAAATTTCTC CCTGAGCCTC
ATACCTGTTT CTGAAGAGAA CGATTACCTG TCGATAAGCG GCTTTCTGGG AAAACCGGGC
ATGCAGAAAC GGCAGAAACT CGATCAGTAT ATCTATGTTA ACCGGAGAAT TATTCAAAAC
AGGATGCTCT CACAAGCCTT GCAGCAAGCC TATGGCGAAC TGCTCGTCGA GCGTCAGGCT
CCGTTTGCCC TGCTCTTTCT CGGCATTGAC CCCTCACGTA TTGATGTTAA CGTACACCCG
GCAAAACTTG AAGTCAAATT CGAAGATGAG CGAAGTGTAC GGACCATGTT TTATCCTGTT
ATCAAGCGGA CCATCCAGCT TCACGACTTT TCACCTGATG CCGCTGAAAA AGAACCCTGT
TCGATCAAGG AAGGCACTCT TGATTGTTCA TCAAGAAAAC TCGGGTTTCA GGACATCGCG
GAACCTGCAT CGACAACCAG CACACTCTAT GCAAACTATC GGCAGGGGGC TTTCGGCGAT
ACACCCTTCG AACGACCTGC CTACGCGGAA AAAGAGCCCC GCCCGTCATC CATCAATACA
GGCTTTGAGC GTTTTGAACC AGATCTGCGC GAAGGAGGCG ACCTGTTTTC GACAACACTC
CAGGCAAGAC CTTACGAGGA CGACAACACT CCTGATCCGG GAGAAAACGA CCCCAAAATC
TGGCAACTGC ACAACAAATA CATTATCTGC CAGATCAAGA CAGGAATGAT GATTATCGAC
CAGCACGTAG CCCATGAGCG AGTGCTCTAC GAACGAGCCG TTGATGTCAT GAACCAGAAC
GTACCAAACT CTCAGCAACT GCTCTTCCCC CAGAAAATCG AACTCCGTGC CTGGGAATAT
GAAGTGTTCG AAGAAATTCG GGACGACCTC TATCGGCTTG GATTCAACCT CCGCTCATTC
GGCGCAAAAA CAGTGATGAT CGAAGGAATT CCTCAGGATG TCAGACCCGG AACCGAAGTC
ACCATCCTGC AGGACATGAT TACCGAGTTT CAGGAAAACA GCTCAAAGCT GAAACTCGAA
AGAAGAGAAA ACCTTGCAAG ATCCTACTCC TGCCGCAATG CCATTATGGC CGGTCAGAAA
CTATCGCTTG AAGAGATGCG CTCGTTGATT GACAACCTCT TCGCCACACG GGTACCCTAT
ACCTGTCCGC ACGGCAGACC TGTTATCATA AAGCTCTCGC TCGACCAGCT CGACAGGATG
TTCGGGCGAA AATAA
 
Protein sequence
MAKIARLPDN VANKISAGEV VQRPASVIKE LLENAIDACA SKITVTIKDA GKELVQIVDN 
GIGMSRQDAL LSVERFATSK ISGVEDLDSL MSLGFRGEAL PSIASVSQFE LKTKPEGALL
GFRFRCDGGE PVEESEVNAE KGTTITVRNL FYNVPARRKF LKSNATEFRH IFESVKSLAL
AYPEIEWKMV SDDEELFHFR TPDIYERLDA FYGENFSLSL IPVSEENDYL SISGFLGKPG
MQKRQKLDQY IYVNRRIIQN RMLSQALQQA YGELLVERQA PFALLFLGID PSRIDVNVHP
AKLEVKFEDE RSVRTMFYPV IKRTIQLHDF SPDAAEKEPC SIKEGTLDCS SRKLGFQDIA
EPASTTSTLY ANYRQGAFGD TPFERPAYAE KEPRPSSINT GFERFEPDLR EGGDLFSTTL
QARPYEDDNT PDPGENDPKI WQLHNKYIIC QIKTGMMIID QHVAHERVLY ERAVDVMNQN
VPNSQQLLFP QKIELRAWEY EVFEEIRDDL YRLGFNLRSF GAKTVMIEGI PQDVRPGTEV
TILQDMITEF QENSSKLKLE RRENLARSYS CRNAIMAGQK LSLEEMRSLI DNLFATRVPY
TCPHGRPVII KLSLDQLDRM FGRK