Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0969 |
Symbol | |
ID | 4077265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1035985 |
End bp | 1037280 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006272 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_612964 |
Protein GI | 99080810 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATCCC GGTCATATCT TGAGGCCAGA CACGAAGCAT TGGATTCGCT CATGAAACGC TATTCAGCCT TTGCAGTGGC GCGGGAAGGC CTTCGCTACC ACAGCGGATG GGAACGCGCC TGGCGCTCCC CGGAACCCAA ACGTCACTAT GACGTCATCA TCGTAGGCGC GGGTGGCCAT GGGCTCGCTA CGGCCTATTA CCTTGGGAAA AACTTTGGCA TCACCAATGT TGCGGTGATC GAAAAGGGTT GGCTGGGGGG CGGCAATACG GGCCGCAACA CCACCATCAT CCGTTCGAAC TACCTGCAGG ATCCCTCTGC CGCGATCTAC GAGAAATCGC GCAGCCTCTA TGAGGATCTG TCGCAGGACT TCAACTACAA CATCATGTTC AGCCCGCGCG GCGTGATCAT GCTGGCGCAG ACCGAGCACG AGGTGCGTGG TTACAAGCGC ACCGCCCATG CCAATGCGCT CCAGGGCGTG TCGACCGAAT GGATCGAACC CGCCCGCGTG AAGGAACTGG TGCCGATCAT CAACCTCGAA GGTCCGCGCT ATCCGGTCCT TGGCGGGCTC TGGCAAGCGC GTGGCGGTAC CGCCCGTCAC GATGCGGTGG CCTGGGGCTA TGCGCGGGCC TGCTCGGCGA TGGGCATGGA CATCATCCAG AAATGCGAAG TCACCAATGT TCGGACTGAA AACGGCCGCG TGGTAGGTGT CGACACCACC AAAGGGGCGA TCGACTGCGA CAAGCTGGGC ATGGTGGTTG CGGGCAACTG TTCGGTGCTG TCTGAAATGG CGGGCTTCCG TCTGCCGGTG GAATCGGTGG CGCTGCAGGC GCTGGTCTCC GAGCCGATCA AACCCTGCAT GGACGTGGTC GTGATGGCCA ACACCGTGCA TGGCTACATG TCGCAATCCG ACAAGGGCGA GATGGTCATT GGTGGCGGCA CCGACGGCTA CAACAACTAC ACCCAGCGCG GTTCTTTCCA CCACATCGAG GAAACCGTGC GCGCCCTCAA CGAGACTTTC CCGATGGTGT CGCGCCTCAA GATGCTGCGC CAATGGGGTG GGATCGTGGA TGTAACCGGC GACCGCTCGC CGCTGATTTC CAAAACGCCG GTTCAGAACT GTTTTGTCAA CGCTGGCTGG GGCACCGGCG GCTTCAAGGC GATCCCCGGC TCGGGCTGGG CGATGGCGGA ACTGATGGCG ACAGGGCATT CCAACCTCGC GGAAGAGTTC TCCATGATGC GCTTCAAAGA AGGCAAATTC ATCGACGAGA GCGTCGCAGC AGGGGTGGCA CACTGA
|
Protein sequence | MRSRSYLEAR HEALDSLMKR YSAFAVAREG LRYHSGWERA WRSPEPKRHY DVIIVGAGGH GLATAYYLGK NFGITNVAVI EKGWLGGGNT GRNTTIIRSN YLQDPSAAIY EKSRSLYEDL SQDFNYNIMF SPRGVIMLAQ TEHEVRGYKR TAHANALQGV STEWIEPARV KELVPIINLE GPRYPVLGGL WQARGGTARH DAVAWGYARA CSAMGMDIIQ KCEVTNVRTE NGRVVGVDTT KGAIDCDKLG MVVAGNCSVL SEMAGFRLPV ESVALQALVS EPIKPCMDVV VMANTVHGYM SQSDKGEMVI GGGTDGYNNY TQRGSFHHIE ETVRALNETF PMVSRLKMLR QWGGIVDVTG DRSPLISKTP VQNCFVNAGW GTGGFKAIPG SGWAMAELMA TGHSNLAEEF SMMRFKEGKF IDESVAAGVA H
|
| |