Gene Mchl_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1971 
Symbol 
ID7116787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2043229 
End bp2044653 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content50% 
IMG OID643524724 
Producthypothetical protein 
Protein accessionYP_002420750 
Protein GI218529934 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.985028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTGTT CATGGTACGC CGGGAAAGGC CTGGAATCGA TTGCCTCCTT TTTTGCAATC 
ATGTTTCTCT CGTTTTCTGC CAATGCGCTT GCACCGGGAA AACAAACCCT CGAAGCGCTC
GCCCGCAACG CGCGCTCCGA TCAGCCCTAT GCTGTGGTTA TATCGCCCAA CTCAAGGGAG
AATGTTCGTA ACAGGCAGCT AGAAGCTGAT AGCTATACTC AGCGCCTTCG AGAACTTGGT
TTCGCAGTCA CCACAATTGG ACCATCCCGC AGATTCGATG CAGATGCCGC AATCCGCGAT
CTTGCCAATA TTCCGAGGGG GTCCAACGTA GCTGTTGTTG TTCCTGCGCC CGCCTATGCC
GATGCGGATG ACATATTTAT CCTTGCTCAA GATAGTGCGG AAACCGCAAC AAATGATACA
ACATCTATGG CTAGCGAGGC GCTTTCTCTG AGTTTTCTAT CCCGTGCAAT TAGTAAGAGC
AGACCGACGC AATTTATTGT GTTGATACCT CATTGTCGCC GCGTGGACAA TCCACAAGCA
TTGTGTCCAG CAGAGTCCTT GGCTCGGTCA GGTGGTGCAA GTGTTATCGC TGCAAATAAT
TCGAACCTTG AGACGGACTG GGAGGGGATT TCTTACGACA AGATACTGCC GCTAATGACA
CAAGAGGGGC TTACCTACGC CGCTTTATAC AACCGAATTA GCGCGGCAAC CACAGGCGCG
GGCATCACGA TGAGCCGGTC GCTCAATTTA TCAACTGAAT TCATGTTTGC GCCACTGAAT
TTTTTTCAAA ATATCAGTAC GCCATGCAAT AGTAATCGCA CCGGCGCGAT ATCCCTTACA
TTAGCAAGGG CAAGAGTGTC GGCATGTGAA ACCGCTGTAG CGACATGGCC GTATGCAAGG
GAATTTGTGC AAGCGCATGA ATTTGCATTG GAGCAACTTG CTTTTGCTGA GACGGAGACA
TTCTGCGGTC CGTTACTTCA ATCATCATTG GCAGTCTATC GAGAACGATA TCCGGCGGGG
TCATTTATTT CTCAGATTGA ACGGCGGCTT ACCGATTGCG AAAAACGCCG AGTGGAAAAG
GGGCGGCAAA AGGATCGCGA GGCTGAGAAC GTGCGTAGGC AAGATCGAAA ACAGAATCAA
GCAGATCAAC CTCAACTCGA CAGTCGAACA TCGTCCAACG TTGGCAGTTG GTTTGTTATC
ATGGGTTCGT ATCCTTCGGC CGAGCGTTTC AAAGCAGTGG CAAAGCAGAA TTGGCTGGAC
GCGCAAGGCA TAAACGCTCA GCTCATCGCC ACAAATAACT ATCCAGGCCT AACATCCGGA
TTAACTATTG TTAGCCAAGG GCCCTACTCG AAGGATGTAG CACAGCGGCG ATTAAACCAA
GTAAAATCGG TCGCTCGCGA TGCTTATATC AAGTCTGCAT ATTAA
 
Protein sequence
MLCSWYAGKG LESIASFFAI MFLSFSANAL APGKQTLEAL ARNARSDQPY AVVISPNSRE 
NVRNRQLEAD SYTQRLRELG FAVTTIGPSR RFDADAAIRD LANIPRGSNV AVVVPAPAYA
DADDIFILAQ DSAETATNDT TSMASEALSL SFLSRAISKS RPTQFIVLIP HCRRVDNPQA
LCPAESLARS GGASVIAANN SNLETDWEGI SYDKILPLMT QEGLTYAALY NRISAATTGA
GITMSRSLNL STEFMFAPLN FFQNISTPCN SNRTGAISLT LARARVSACE TAVATWPYAR
EFVQAHEFAL EQLAFAETET FCGPLLQSSL AVYRERYPAG SFISQIERRL TDCEKRRVEK
GRQKDREAEN VRRQDRKQNQ ADQPQLDSRT SSNVGSWFVI MGSYPSAERF KAVAKQNWLD
AQGINAQLIA TNNYPGLTSG LTIVSQGPYS KDVAQRRLNQ VKSVARDAYI KSAY