Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1654 |
Symbol | |
ID | 5832812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1849256 |
End bp | 1850506 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641367452 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_001639124 |
Protein GI | 163851081 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.502295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTACT CCCTGTTCAG CGTCCTGGGG CAGGCCCTGC GCTGCCAGCG GGATTGGACC CCGCAATGGC GCGACGCCGC CCCGCAGGAG GCCTACGACG TGGTGATCGT CGGCGGTGGC GGGCACGGGC TCGCCACCGC CTACTACCTC GCCAAGGAGC ACGGCCTCAC CAACGTCGCC GTGCTGGAGA AGAGCCACAT CGGCTCCGGC AATGTGGGGC GCAACACCAC CATCGTGCGC TCGAACTACG GCCTGCCGGG CAACATCCCG TTCTACGAGC GCTCGATGAA GCTCTGGGAA GGGCTGGAGC AGGACATCAA CTACAACGCC ATGGTCAGCC AGCGCGGCGT GCTGAACCTC TACCATTCGG ACGCGCAGCG CGACGCCTAT GCCCGCCGCG GCAACGCCAT GCGGCTCGCC GGTATCGATG CGGAACTGCT CGACCGCGAG GGCGTGCGCC GGCTGGTCCC GTTCATCGAT TTCGACAACG CGCGCTTCCC CGTAAAGGGC GGCCTGCTCC AGCGCCGCGG CGGCACCGTG CGCCACGACG CGGTCGCCTG GGGCTATGCC CGAGCCGCCA GCGACCGCGG CGTCGACATC GTCCAGAACT GCGCCGTCAC CGGCATCCGC CGCGAGAACG GCCGCGTCAC CGGCGTCGAG ACCAGCCGCG GCTTCATACG GGCGGGGAAG GTCGCGCTAT CGGTCGCCGG CTCGTCGTCG CTGCTCGCCG GCATGGTCGA TATGCGCCTG CCGATCGAGA GCCACGTGCT CCAGGCCTTC GTCAGCGAGG GCGTGAAGCC CCTGATCGAC GGCGTGATGA CCTTCGGCGC CGGCCATTTC TACGTCAGCC AGTCGGACAA GGGCGGCCTC GTCTTCGGCG GCGATATCGA CGGCTACAAT TCCTATGCGA GCCGCGGCAA TCTCCACACC ATCGAGGATG TGATGGAGGG CGGCATGGCC CTCTGGCCGG GGCTCGGCCG CCTGCGGCTG CTGCGCCACT GGGGCGGCAT CATGGACATG TCGATGGACG GCTCGCCCAT CATCGACCGC ACGGATATCG GCGGCCTCTA TCTCAACGCC GGCTGGTGCT ACGGCGGCTT CAAGGCGACG CCCGCCGCTG GCTTCTGCTT CGCCCACCTG ATCGCCCGCG ACGAACCGCA CGCGGATGCG CGCGCCTACC GCCTTGACCG CTTCGCCACC GGCCGTCTCA TCGACGAGAA GGGCATGGGC GCCCAGCCCA ACCTGCATTG A
|
Protein sequence | MRYSLFSVLG QALRCQRDWT PQWRDAAPQE AYDVVIVGGG GHGLATAYYL AKEHGLTNVA VLEKSHIGSG NVGRNTTIVR SNYGLPGNIP FYERSMKLWE GLEQDINYNA MVSQRGVLNL YHSDAQRDAY ARRGNAMRLA GIDAELLDRE GVRRLVPFID FDNARFPVKG GLLQRRGGTV RHDAVAWGYA RAASDRGVDI VQNCAVTGIR RENGRVTGVE TSRGFIRAGK VALSVAGSSS LLAGMVDMRL PIESHVLQAF VSEGVKPLID GVMTFGAGHF YVSQSDKGGL VFGGDIDGYN SYASRGNLHT IEDVMEGGMA LWPGLGRLRL LRHWGGIMDM SMDGSPIIDR TDIGGLYLNA GWCYGGFKAT PAAGFCFAHL IARDEPHADA RAYRLDRFAT GRLIDEKGMG AQPNLH
|
| |