Gene Mchl_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_0078 
Symbol 
ID7114002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp74139 
End bp75338 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content65% 
IMG OID643522892 
Producthypothetical protein 
Protein accessionYP_002418962 
Protein GI218528146 
COG category[R] General function prediction only 
COG ID[COG4469] Competence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.698048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCATCC ATACGGTAGG TCACATGGTT GAAGGTGCCG TCCGGGCCTA CACGCGGGAG 
GGGCTGCTCT TCGGTGAGCG ACCAAATGGC AGCATCGTCC ATATCTCCGA GGTGCCCTCC
GGTCTTGCCT GCGACTGCCG ATGTCCGAAC TGCGGGACAC CGCTGGTTGC TCGTAGAGGC
GAGCAGCTGG GCCATCACTT CGGACACCAT AATACCGTGG GCGAGCGCGC CTGCGCGGGG
GGACCGGAGA CCGCGCTGCA CCGCTTTGCC AAGGAGTTGC TCGCAGCCAA ACTCGCGCTC
GTTCTCCCGC CCCTGTACCG GAATGGTGAG GGAAAGGCTC GCTACGCTGG CGGCTTCCAT
CGGTTCGACG CTGCGCTCCT CGAACACCGG CTCGGTGCCA TTGTTCCGGA CGTGATTGTC
CGCCGGGCGG ACCGGGATCT GCTTGTCGAG TTCCACGTCA CCCACGCCTG TGACGCTACC
AAGATCGCCA AGATCGCGAG CCTTGGCACA GCTGCGATCG AGGTTGATCT TTCCGGTTTG
GCACTGAACG CACCACGGGC TGAGCTTGAG GCGGCCATCC TCGAACGTGC TCCCCGCCGT
TGGCTGCACA ACCCGAAGCT CGGCTGGGTT GGGGGTATTC ACGGCCCGGT GACGACCAGG
ATCACGCCAG CGTCGTCTCG ATCCCTCACT GCTCTGGAGA AAGCCTACGC GTCGGCCTAC
CGCGAGGCAC TCTCGACTCC GAGCCGTAGC CTCGCTCGGC ATCGCGTTGA GGCTGACGGC
CTCGTACGCA CGATCGGCGT CGAGGTTGCC GGGATCGGGT GCTTCACGGC CTCACCTCGT
GATTGGCAGG CAGTCATCCT CTTGAACGCG CTTGAAGGCG CCCTGGTTGG CCGCAGCAGC
ATCGTGAGTG CCAAGGCGGC TCTGCAGCAG ATCCGCGAGC GCGGCTGGCT CCGGCCCCGC
TTTAGCCGCC TCCCACCAGC AGAGGCGAAG GCGCTATCCG CGGCACTGCC CTCGTATGCC
TCCCCTGCCG ATGCAATCAC AGCCTGGGCA ATGACACTGT CTCGGGAAGG CATCCTTGTC
CCGAGCAGTG CGCGCGGTCA GTGGGTGATC CGGCGCGAGA CGTTGCAGCG CGTTCGCGAA
GCACGACAAC AAAAGGAAGC TCGGCCGAGT AGCAGGTCCG GCCCAGCCGA TCCCACTTAA
 
Protein sequence
MSIHTVGHMV EGAVRAYTRE GLLFGERPNG SIVHISEVPS GLACDCRCPN CGTPLVARRG 
EQLGHHFGHH NTVGERACAG GPETALHRFA KELLAAKLAL VLPPLYRNGE GKARYAGGFH
RFDAALLEHR LGAIVPDVIV RRADRDLLVE FHVTHACDAT KIAKIASLGT AAIEVDLSGL
ALNAPRAELE AAILERAPRR WLHNPKLGWV GGIHGPVTTR ITPASSRSLT ALEKAYASAY
REALSTPSRS LARHRVEADG LVRTIGVEVA GIGCFTASPR DWQAVILLNA LEGALVGRSS
IVSAKAALQQ IRERGWLRPR FSRLPPAEAK ALSAALPSYA SPADAITAWA MTLSREGILV
PSSARGQWVI RRETLQRVRE ARQQKEARPS SRSGPADPT