Gene Mchl_3104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_3104 
Symbol 
ID7118382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp3283849 
End bp3285897 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content68% 
IMG OID643525855 
Productexcinuclease ABC, C subunit 
Protein accessionYP_002421870 
Protein GI218531054 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.299342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.238612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCG CGACCCGATC CGCCTTCGAC GACGTCCCGC CCGACGGGTT CGACGAGGAC 
GAGACCGAGG CTGCCTCCCT CGACGAGGCG CCCGAGATCG ACTTCGATTT CGAGCCCGGT
GCCGTCCAGG CGGGCACCGA GATCATCCGC CGATTCTGGT CGACCCTACC GAACTCGCCG
GGCGTGTACC GGATGTTCGA CGCCAAGGGC GACGTGCTCT ACGTCGGCAA GGCCAAGAAC
CTAAAGGCCC GCGTCGGCTC CTACGCCCGC GGCCAGGCGC ATTCCAACCG TATCGCCCGC
ATGATCGCCC AGACGGCGGC GATGGAGTTC GTCACCACGG CGACCGAGAC GGAAGCGCTG
CTGCTGGAGG CCAACCTCAT CAAGCAGTTG AAGCCGCGCT TCAACGTGCT GATGCGCGAC
GACAAGTCGT TCCCCTACAT CCTGCTGACC AGCGACGGGC CGGCGCCGCA GATCGTCAAG
CATCGCGGCG CGCGCCGCCG CAAGGGCAAC TACTACGGCC CCTTCGCCAA TGTCTGGGCG
GTCAACCGCA CGGTGAACGC GCTCCAGCGC GCCTTCCTGA TGCGGACCTG CTCGGACAGC
TATTACGAGA ACCGCACCCG GCCCTGCCTG CTCTACCAGA TCAAGCGCTG CTCCGGCCCC
TGCACCGGCG AAATTGCCCT CGAGGACTAT AGCGCGCTCG CCGACAGCGC CCGCGCCTTC
CTCTCGGGCA AGTCGAACGC GGTGAAGGAC CGGATGCGCG AGGAAATGCA GCGCGCCTCC
GAGGCGCTTG AATTCGAGCG TGCCGCGCGC TTCCGCGACC GTATCGCCGC GCTCTCGGCG
ATCCAGGGCG TGCAGGGGGT GAACACCCAG GGGGTCGAGG AGGCCGACGT GTTCGCCATC
GACGAGCAGG CGGGCCAGTT CTGCATCGAG GTGTTCTTCT TCCGCAACTT CCAGAACTGG
GGCAACCGCG CCTACTTCCC GAAGGCCGAC CGCTCGATGA GCGCGGACGA GGTGCTGGCC
TCGTTCATCT CGCAGTTCTA CGACGACAAG CCTGCCCCGA AACTCGTCCT CGTCAGCCAC
ACGATCGAGG ATGCCGAACT CGTGGCCGCC GCCCTGTCGA GCCGGGTCGA GCACCGCGTC
GAGGTCCACC AGCCGCAGCG GGGCGAGCGC AAGAACCTCG TCGATTACGC GCAGCGCAAC
GCCAAGGAGG CGCTTGGACG CCGGCTCGCC GACACCGCGT CGCAGGGCAA GCTGCTGACG
GCGCTGGGCC AAGCCTTCGG CCTCGACAAG CCGCCGCGCC GTGTCGAGGT CTACGACAAC
TCGCACATTT CCGGCACGGC CGCGGTCGGC GGCATGATCG TCGCCGGGCC GACCGGCTTC
ATGAAGACGC ATTACCGCAC CTTCAACATC AAGTCGGAGG AGCTGAGCCC CGGCGACGAT
TTCGGGATGA TGCGCGAGGT GCTGACCCGC CGCTTCAAGC GGCTGGCCAA GGAGGCGCCG
CGCACCCCGC GGGAGGGTGA TGCCGGCGAA CCGCAGGCCC CGGAAAAGGC CGTTCCAACG
GTGGCCGAGG TGCAAGACGA TCCCGACGCT TTCCCCGCCT GGCCCGACCT CGTTCTGATC
GACGGCGGCG CCGGGCAGTT GGAGGCGGCC CGCGCCTCGC TCGCCGAGAT CGGCGTGACC
GGCGTGCCGT TGGTCGGCAT CGCCAAGGGC CGTGACCGCG ATGCCGGCCG CGAGACGTTC
TTCGTGCCCG GCCGCAGCCC GTTCAAGCTG CCGCCGCGCG ATCCCGTGCT CTATTTCGTG
CAGCGCCTGC GTGACGAGGC CCATCGTTTC GCCATCGGCA CCCATCGGGC ACGGCGCAAG
CGCGAGATGA CGAAAAACCC GCTCGACGAG ATTGCCGGGA TCGGCCCCAC TCGCAAGCGC
GCGCTGCTGC ACCATTTCGG CACGGTGAAA GCCATCGAAC GCGCCGCCTT GGAGGATCTC
GCCAAGGCGC CGGGCGTGAA CGCCGCCACC GCGCGGGCGG TCTACGACTT CTTCCATGCC
AATGCTTGA
 
Protein sequence
MSRATRSAFD DVPPDGFDED ETEAASLDEA PEIDFDFEPG AVQAGTEIIR RFWSTLPNSP 
GVYRMFDAKG DVLYVGKAKN LKARVGSYAR GQAHSNRIAR MIAQTAAMEF VTTATETEAL
LLEANLIKQL KPRFNVLMRD DKSFPYILLT SDGPAPQIVK HRGARRRKGN YYGPFANVWA
VNRTVNALQR AFLMRTCSDS YYENRTRPCL LYQIKRCSGP CTGEIALEDY SALADSARAF
LSGKSNAVKD RMREEMQRAS EALEFERAAR FRDRIAALSA IQGVQGVNTQ GVEEADVFAI
DEQAGQFCIE VFFFRNFQNW GNRAYFPKAD RSMSADEVLA SFISQFYDDK PAPKLVLVSH
TIEDAELVAA ALSSRVEHRV EVHQPQRGER KNLVDYAQRN AKEALGRRLA DTASQGKLLT
ALGQAFGLDK PPRRVEVYDN SHISGTAAVG GMIVAGPTGF MKTHYRTFNI KSEELSPGDD
FGMMREVLTR RFKRLAKEAP RTPREGDAGE PQAPEKAVPT VAEVQDDPDA FPAWPDLVLI
DGGAGQLEAA RASLAEIGVT GVPLVGIAKG RDRDAGRETF FVPGRSPFKL PPRDPVLYFV
QRLRDEAHRF AIGTHRARRK REMTKNPLDE IAGIGPTRKR ALLHHFGTVK AIERAALEDL
AKAPGVNAAT ARAVYDFFHA NA