Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3375 |
Symbol | |
ID | 4075274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 389009 |
End bp | 390259 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004883 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_611609 |
Protein GI | 99078351 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.480932 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTACT CTGCTTTGAG ACTGATCAAG GAAAGCCTGA CCGGACACAG GGGCTGGGGG CCTCAATGGC GCGACCCTGA CCCACAAGCG TCCTATGATT ACGTCATCAT CGGGGGCGGG GGACACGGAT TGGCCACCGC TTACTATCTG GCCAAAGAGT TTCAGGGCCG CCGGATTGCG GTTCTGGAAA AGGGCTGGAT TGGCGGTGGC AACGTCGGGC GCAACACGAC GATCATTCGC TCCAACTACC TTCTGGACGG CAACGAGCCG TTCTACGAGT TCTCGCTGAA GCTTTGGGAA GGGTTGGAGC AGGACCTGAA CTATAATGCC ATGGTAAGCC AGCGTGGCAT TCTCAACCTT GTGCACACCG ATGCCCAGCG CGATGCCGCG CGGCGGCGCG GGAATGCGAT GATCCTGAAC GGATCGGATG CGGAACTCCT CGACACTGAT GGCGTCCGCG CGCTCTATCC GTTCCTGAAT TTCGAAAATG CCCGCTTCCC GATCAAGGGT GGCCTCCTGC ACCGGCGCGG TGGGACCGTG CGGCATGACG CTGTTGCCTG GGGCTACGCC CGCGGCGCAG ATCAGCTGGG CGTGGACATC ATCCAGAACT GCGAAGTCAC CGGCTTCAGG GTGGAAAACG GCCGCGTAAC AGGCGTTGAA ACATCACGCG GACTGATTCG CGCTGCAAAA GTCGGCGTAT CCGTCGCGGG CAGCTCGAGC CGCGTGATGG CGATGGCCGG AATGCGCTTG CCAATCGAAA GCCACGTGCT GCAGGCCTTT GTGTCCGAGG GACTCAAACC CTTCATTCGG GGGGTCATCA CTTATGGCGC GGGACATTTC TATTGCAGCC AATCCGACAA GGGCGGGCTG GTGTTTGGCG GCGATATAGA CGGCTACAAT TCATATGCAC AACGCGGCAA CCTTCCGGTG GTCGAAGATG TCGTCGAAAG CGGCATGTCA CTGATCCCCG GTCTGGGGCG CGCACGGCTG CTGCGCAGTT GGGGCGGCAT CATGGATATG TCCATGGACG GCTCCCCCTT CATCGACAAG ACCCATATCG AAGGCCTCTA TTTCAACGGT GGCTGGTGCT ATGGCGGCTT CAAGGCAACA CCCGCCGCAG GCTTTTGTTT TGCGCATCTC CTGAAGACCG ACCGCCCACA TGAAACCGCC AAAGCCTATC GGCTCGACCG GTTCATGACG GGGCACATGA TCGACGAAAA GGGCCAAGGC GCCCAGCCCA ACCTTCACTA A
|
Protein sequence | MRYSALRLIK ESLTGHRGWG PQWRDPDPQA SYDYVIIGGG GHGLATAYYL AKEFQGRRIA VLEKGWIGGG NVGRNTTIIR SNYLLDGNEP FYEFSLKLWE GLEQDLNYNA MVSQRGILNL VHTDAQRDAA RRRGNAMILN GSDAELLDTD GVRALYPFLN FENARFPIKG GLLHRRGGTV RHDAVAWGYA RGADQLGVDI IQNCEVTGFR VENGRVTGVE TSRGLIRAAK VGVSVAGSSS RVMAMAGMRL PIESHVLQAF VSEGLKPFIR GVITYGAGHF YCSQSDKGGL VFGGDIDGYN SYAQRGNLPV VEDVVESGMS LIPGLGRARL LRSWGGIMDM SMDGSPFIDK THIEGLYFNG GWCYGGFKAT PAAGFCFAHL LKTDRPHETA KAYRLDRFMT GHMIDEKGQG AQPNLH
|
| |