Gene Mchl_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4031 
Symbol 
ID7118036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4242936 
End bp4244189 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content68% 
IMG OID643526750 
Productsarcosine oxidase, beta subunit family 
Protein accessionYP_002422759 
Protein GI218531943 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.198896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCT TCTCCCTCAC GTCGCTGCTC TCCGAATCCC TGCGGGGCCA TACCGGCTGG 
GGCCGCCAGT GGCGTGCCCC CGAGCCCAAG CCCCGCTACG ACGTCGTCAT CGTCGGCGGC
GGCGGCCATG GCCTCGCGAC CGCCTATTAT CTCGCCACCG TGCACGGCAT CACCAACGTG
GCGGTGCTGG AGAAGGGATG GATCGGCGGC GGCAATACCG GCCGCAACAC CACGATCATC
CGCTCCAACT ACCTCTACGA CGAGAGCGCG GCGATGTACG AGCACGCGCT CAAGCTCTGG
GAGGGGCTCA GCCAGGAGCT GAACTACAAT ATCATGTTCT CCCAGCGCGG CGTGTTGATG
CTCGCGCACA ACATCCACGA CGTTCAGAGC TTCAAGCGCC ACGTCTACGC CAACCGCCTC
AACGGCATCG ACAACGAGTG GCTCTCGAAG GAAGAGTGCA AGGAATTCTG CCCGCCGCTC
GATATCTCCG GGAGCCTACG CTACCCCGTG CTCGGCGGCG CGCTCCAGCG CCGGGCCGGC
ACCGCCCGCC ACGACGCCGT AGCCTGGGGC TATGCCCGCG GCGCCGACAA CCGCGGCGTC
GATATTATCC AGAACTGCGA GGTCACCGGC ATCCGCCGCG ATGCCTCCGG TGCGGCGGTG
GGCGTCGAGA CGACCCGCGG CTTCATCGGC GCCGGCCGGA TCGGCGTGGT CGCCGCCGGC
CACACCTCGA CGCTGATGTC GATGGCCGGC GTGTCGATGC CGCTGGAGAG CTACCCGCTC
CAGGCTTTGG TCTCCGAGCC GGTCAAGCCG TGCTTCCCCT GCGTGGTGAT GTCGAACGCG
GTCCACGCCT ACCTGTCGCA ATCCGACAAG GGCGAACTGG TGATCGGTGC GGGCACCGAC
CAGTACACCT CCTACAGCCA GCAGGGTGGC CTCCACATCA CCACCCACAC GCTCGACGCG
ATCTGCGAAC TGTTTCCGCA ATTCACCCGG ATGCGGATGC TGCGCTCCTG GGGCGGCATC
GTCGACGTGA CACCGGATCG TTCGCCGATC ATCGGCAAGA CCCCGGTGCC GAACCTGTTC
GTCAATTGCG GCTGGGGCAC TGGCGGCTTC AAGGCGACGC CGGGCTCGGG CCACGTCTTC
GCCCACACGC TCGCGACGGC TGAGCCGCAC GCGATCAACG CGCCCTTCAC CCTCGACCGG
TTCCGCACCG GGCGCCTCAT CGACGAAGCC GCCGCCGCGG CCGTCGCGCA CTGA
 
Protein sequence
MRRFSLTSLL SESLRGHTGW GRQWRAPEPK PRYDVVIVGG GGHGLATAYY LATVHGITNV 
AVLEKGWIGG GNTGRNTTII RSNYLYDESA AMYEHALKLW EGLSQELNYN IMFSQRGVLM
LAHNIHDVQS FKRHVYANRL NGIDNEWLSK EECKEFCPPL DISGSLRYPV LGGALQRRAG
TARHDAVAWG YARGADNRGV DIIQNCEVTG IRRDASGAAV GVETTRGFIG AGRIGVVAAG
HTSTLMSMAG VSMPLESYPL QALVSEPVKP CFPCVVMSNA VHAYLSQSDK GELVIGAGTD
QYTSYSQQGG LHITTHTLDA ICELFPQFTR MRMLRSWGGI VDVTPDRSPI IGKTPVPNLF
VNCGWGTGGF KATPGSGHVF AHTLATAEPH AINAPFTLDR FRTGRLIDEA AAAAVAH