Gene Mchl_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4110 
Symbol 
ID7114417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4333052 
End bp4334149 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content68% 
IMG OID643526827 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002422835 
Protein GI218532019 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCGT CTATTGGATC ATACGCCAAC GGCGGCCCGG ACGGGGTGGC GGCGGACGAG 
AGTTCCGAGG AGGCTGAGCA AGCTCCGTCC GTCGGCTTCA TCCATTCTGA TGTCGTGCGG
GCGAGCCACA CCGTCCTGGT GGAACTCGGC GCCGACCTTG ATGGCCTCAT CGCCGCGGCG
GGCCTTGATC GCCGCCTGTT CGACGCCAGC AGCAAGCCCG TTCCGTTCAC GGCGATCGGA
CGCCTGATCG CCCTGGCCGC CGACCACTCG CGCTGCCCTC ATCTCGGGCT ACTGGTTGGG
AAGCAAACCA CCCTGGCCTC GCTCGGCCTT CTCGGCGTGC TGCTGCGCAA TTGCGCCTCG
GTGGGTAAAG CTCTGCAGGC CCTGGAAGCG CATGCGCGCG TGCGAGACCG CGGCGCGGTG
GTCGGCCTCG GCGTCTACAA CGACATCGTC GTCCTCAGCT ATGCCCCGCA CGAGCCCGAG
GCCGAGGGCG CTGCTCTGCA CTTGGAGAGG GCGCTCGCCA CGCTGACCAA CATCCTGCGG
GGGTTATATG GGCCTGACTG GGCGCCGCTA GGGGTGCTGC TGCCGCGCTC TGCACCGCGC
GATACCGTCC CCTACACCGA GTTCTTCGGG GCTCCTGTCC GCTTCGATCA AGAAGCGGCC
GCCTTGGTGT TCCCGGCCGC GCTCCTGCAG CAGCCGATCG TGGGGGCGGA TCCGGCGCTT
CGCCAGAGGG CAGAGGACCA CATCCGCCGG CTCGAGGCCG ATCAGCCTTC CACGCTGACG
GACAAGCTTC GCGAGTATCT CCAGACCGCC GTGACCCAGC AGCGCTGCAG GGTAGAGCGC
GTAGCGCGCT TGCGACTAGT GAACCGCCGC ACCTTGAGCC GGCACCTGCA GGCGGAGGGC
ACGAGCTTCC GGCGCCTTGC CAACGAGGCG CAGTTCCGGG TGGCGAAGCA GCTTCTCATC
GATACCAGCC TGGCGTTGGG GCAGATTTCG GCTGCCCTCG ACTTCTCCGA GCCCGCCGCC
TTCACGCATG CCTTTCGCCG CTGGTCGGGC GTGACGCCTA GCGCATGGCG GCAGGCGAAC
CGACCCGAAC AGCAATGA
 
Protein sequence
MLSSIGSYAN GGPDGVAADE SSEEAEQAPS VGFIHSDVVR ASHTVLVELG ADLDGLIAAA 
GLDRRLFDAS SKPVPFTAIG RLIALAADHS RCPHLGLLVG KQTTLASLGL LGVLLRNCAS
VGKALQALEA HARVRDRGAV VGLGVYNDIV VLSYAPHEPE AEGAALHLER ALATLTNILR
GLYGPDWAPL GVLLPRSAPR DTVPYTEFFG APVRFDQEAA ALVFPAALLQ QPIVGADPAL
RQRAEDHIRR LEADQPSTLT DKLREYLQTA VTQQRCRVER VARLRLVNRR TLSRHLQAEG
TSFRRLANEA QFRVAKQLLI DTSLALGQIS AALDFSEPAA FTHAFRRWSG VTPSAWRQAN
RPEQQ