Gene Mext_3736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3736 
Symbol 
ID5833317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4138569 
End bp4139822 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content68% 
IMG OID641369526 
Productsarcosine oxidase beta subunit family protein 
Protein accessionYP_001641181 
Protein GI163853138 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.42969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.377029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCT TCTCCCTCAC GTCGCTGCTC TCCGAATCCC TGCGGGGCCA TACCGGCTGG 
GGCCGCCAGT GGCGTGCCCC CGAGCCCAAG CCCCGCTACG ACGTCGTCAT CGTCGGCGGC
GGCGGCCATG GCCTCGCAAC CGCCTATTAT CTCGCCACCG TGCACGGCAT CACCAACGTG
GCGGTGTTGG AGAAGGGCTG GATCGGCGGC GGCAATACCG GCCGCAACAC CACGATCATC
CGCTCCAACT ACCTCTACGA CGAGAGCGCG GCGATGTACG AGCACGCGCT CAAGCTCTGG
GAGGGGCTCA GCCAGGAGCT GAACTACAAC ATCATGTTCT CCCAGCGCGG CGTGTTGATG
CTCGCGCACA ACATCCACGA CGTTCAGAGC TTCAAGCGCC ACGTCTACGC CAACCGCCTC
AACGGCATCG ACAACGAGTG GCTCTCGAAG GAAGAGTGCA AGGAATTCTG CCCGCCGCTC
GATATCTCGG GGAGCCTGCG CTACCCCGTG CTCGGCGGCG CGCTCCAGCG CCGGGCCGGC
ACCGCCCGCC ACGACGCGGT GGCCTGGGGC TATGCCCGCG GCGCCGACAA CCGCGGCGTC
GATATCATCC AGAACTGTGA GGTCACCGGC ATCCGCCGCG ACGCCTCCGG CGCGGCGGTG
GGCGTCGAGA CCACCCGCGG CTTCATCGGC GCCGGCCGGA TCGGCGTGGT CGCCGCCGGC
CACACCTCGA CGCTGATGGC GATGGCCGGC GTATCGATGC CGCTGGAGAG CTACCCGCTT
CAGGCTTTGG TCTCCGAGCC GGTCAAGCCG TGCTTCCCCT GCGTGGTGAT GTCGAACGCG
GTCCATGCCT ACCTGTCGCA ATCCGACAAG GGCGAACTGG TGATCGGTGC GGGCACCGAC
CAGTACACCT CCTACAGCCA GCAGGGTGGC CTTCACATCA CCACCCATAC GCTCGACGCG
ATCTGCGAAC TGTTTCCGCA ATTCACCCGG ATGCGGATGC TGCGCTCCTG GGGTGGCATC
GTCGACGTGA CGCCGGATCG CTCGCCGATC ATCGGCAAGA CCCCGGTGCC GAACCTGTTC
GTCAATTGCG GCTGGGGCAC CGGCGGCTTC AAGGCGACGC CGGGCTCGGG CCACGTCTTC
GCCCACACGC TCGCGACCGG CACGCCGCAC GCGATCAACG CGCCCTTCAC CCTCGACCGG
TTCCGCACCG GGCGCCTCAT CGACGAGGCC GCCGCCGCGG CCGTCGCGCA CTGA
 
Protein sequence
MRRFSLTSLL SESLRGHTGW GRQWRAPEPK PRYDVVIVGG GGHGLATAYY LATVHGITNV 
AVLEKGWIGG GNTGRNTTII RSNYLYDESA AMYEHALKLW EGLSQELNYN IMFSQRGVLM
LAHNIHDVQS FKRHVYANRL NGIDNEWLSK EECKEFCPPL DISGSLRYPV LGGALQRRAG
TARHDAVAWG YARGADNRGV DIIQNCEVTG IRRDASGAAV GVETTRGFIG AGRIGVVAAG
HTSTLMAMAG VSMPLESYPL QALVSEPVKP CFPCVVMSNA VHAYLSQSDK GELVIGAGTD
QYTSYSQQGG LHITTHTLDA ICELFPQFTR MRMLRSWGGI VDVTPDRSPI IGKTPVPNLF
VNCGWGTGGF KATPGSGHVF AHTLATGTPH AINAPFTLDR FRTGRLIDEA AAAAVAH