Gene Mext_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1654 
Symbol 
ID5832812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1849256 
End bp1850506 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID641367452 
Productsarcosine oxidase beta subunit family protein 
Protein accessionYP_001639124 
Protein GI163851081 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.502295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTACT CCCTGTTCAG CGTCCTGGGG CAGGCCCTGC GCTGCCAGCG GGATTGGACC 
CCGCAATGGC GCGACGCCGC CCCGCAGGAG GCCTACGACG TGGTGATCGT CGGCGGTGGC
GGGCACGGGC TCGCCACCGC CTACTACCTC GCCAAGGAGC ACGGCCTCAC CAACGTCGCC
GTGCTGGAGA AGAGCCACAT CGGCTCCGGC AATGTGGGGC GCAACACCAC CATCGTGCGC
TCGAACTACG GCCTGCCGGG CAACATCCCG TTCTACGAGC GCTCGATGAA GCTCTGGGAA
GGGCTGGAGC AGGACATCAA CTACAACGCC ATGGTCAGCC AGCGCGGCGT GCTGAACCTC
TACCATTCGG ACGCGCAGCG CGACGCCTAT GCCCGCCGCG GCAACGCCAT GCGGCTCGCC
GGTATCGATG CGGAACTGCT CGACCGCGAG GGCGTGCGCC GGCTGGTCCC GTTCATCGAT
TTCGACAACG CGCGCTTCCC CGTAAAGGGC GGCCTGCTCC AGCGCCGCGG CGGCACCGTG
CGCCACGACG CGGTCGCCTG GGGCTATGCC CGAGCCGCCA GCGACCGCGG CGTCGACATC
GTCCAGAACT GCGCCGTCAC CGGCATCCGC CGCGAGAACG GCCGCGTCAC CGGCGTCGAG
ACCAGCCGCG GCTTCATACG GGCGGGGAAG GTCGCGCTAT CGGTCGCCGG CTCGTCGTCG
CTGCTCGCCG GCATGGTCGA TATGCGCCTG CCGATCGAGA GCCACGTGCT CCAGGCCTTC
GTCAGCGAGG GCGTGAAGCC CCTGATCGAC GGCGTGATGA CCTTCGGCGC CGGCCATTTC
TACGTCAGCC AGTCGGACAA GGGCGGCCTC GTCTTCGGCG GCGATATCGA CGGCTACAAT
TCCTATGCGA GCCGCGGCAA TCTCCACACC ATCGAGGATG TGATGGAGGG CGGCATGGCC
CTCTGGCCGG GGCTCGGCCG CCTGCGGCTG CTGCGCCACT GGGGCGGCAT CATGGACATG
TCGATGGACG GCTCGCCCAT CATCGACCGC ACGGATATCG GCGGCCTCTA TCTCAACGCC
GGCTGGTGCT ACGGCGGCTT CAAGGCGACG CCCGCCGCTG GCTTCTGCTT CGCCCACCTG
ATCGCCCGCG ACGAACCGCA CGCGGATGCG CGCGCCTACC GCCTTGACCG CTTCGCCACC
GGCCGTCTCA TCGACGAGAA GGGCATGGGC GCCCAGCCCA ACCTGCATTG A
 
Protein sequence
MRYSLFSVLG QALRCQRDWT PQWRDAAPQE AYDVVIVGGG GHGLATAYYL AKEHGLTNVA 
VLEKSHIGSG NVGRNTTIVR SNYGLPGNIP FYERSMKLWE GLEQDINYNA MVSQRGVLNL
YHSDAQRDAY ARRGNAMRLA GIDAELLDRE GVRRLVPFID FDNARFPVKG GLLQRRGGTV
RHDAVAWGYA RAASDRGVDI VQNCAVTGIR RENGRVTGVE TSRGFIRAGK VALSVAGSSS
LLAGMVDMRL PIESHVLQAF VSEGVKPLID GVMTFGAGHF YVSQSDKGGL VFGGDIDGYN
SYASRGNLHT IEDVMEGGMA LWPGLGRLRL LRHWGGIMDM SMDGSPIIDR TDIGGLYLNA
GWCYGGFKAT PAAGFCFAHL IARDEPHADA RAYRLDRFAT GRLIDEKGMG AQPNLH