Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2881 |
Symbol | |
ID | 8754553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 3007666 |
End bp | 3010533 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | YP_003409879 |
Protein GI | 284991325 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCAGCC CGTTCCGCAC GCCCCAGGGC GGCCGCATCG ACCGCGCCAC CACCGTCGGG TTCACCTTCG ACGGGCAGAC CTTCCCCGGT CACCCCGGCG ACACCCTGGC CTCGGCGCTG CTCGCCAACG GCCGGCACCA GGTCGCCACG AGCATCAAGC TCGGCCGACC GCGCGGCATC GCCGCCGCCT GGGCCGAGGA CCCGTGCGGC CTGGTGCAGA TCGAGGAGCC TTTCCCCGAG CCCATGCTGC TGGCGACCAC CGTCGAGCTG TACGACGGAC TGGCCGCCCA CGGCCTGCCG GGGCAAGGCC GCCTCGCCGA CGTCCCCGAC TCGGCCCGCT ACGACGCGGT CCACCACCAC GTGGACGTGC TCGTCGTGGG GGCCGGCCCC GCGGGGCTGG CCGCCGCGCT CACCGCCGCC CGGGCCGGTG CCCGCGTCGC GCTCGTCGAC GAGCAGTCCG AGGCCGGCGG CTCCCTGCTC TCGGGCACCG AGCGGCTCGA CGACGCCCCG GCGCTGCAGT GGGTCGCCGC GGCGGTCGCC GAGCTGGCCG GCTCCCCGGA GGTGCTGCAC CTGCAGCGCA CCACCGCGTT CGGCAGCTAC GACGACGGTT TCGTCCTGGC CCTGGAGCGG CGCACCGACC ACCTCGGCGC CGCCGCGCCC AAGCACGTCT CCCGCCAGCG GGTCTGGCGC ATCCGGGCCC GCTCGATCGT CGTCGCGACG GGGGCGCACG AGCGTCCGGT CGTCTTCGCC GACAACGACC GCCCGGGGAC CATGCTGGCC GGCGCCGCCC GGACGTTCCT GCACCGCTAC GGCGTGCTGC CCGGCCGCGA GGCCGTCGTG TTCACCACGA ACGACAGCGC CTACGACGCC GCCCTCGACC TGCACCGCGC GGGTGTGCAG GTGCAGGCGG TGGTCGACGC CCGGCCGGAG GGCTCGCCGC GCCGCGAGGA GTGCGAGCAC GCGGGCATCC GCGTCCTGTC CGGTGCCGTC GTGACCGGCA CGCAGGGCGT CGGGCGGGTC ACGCACGCCC TCGTCGCCCC GTTCGCGGAC GGCGAGGTCG GCGCCAGCAG CGCGATCGCC TGCGACCTGC TCCTGGTCAG CGGGGGCTGG AACCCCGCGG TGCACCTGTT CAGCCAGGCC CGCGGCCGGC TCCGCTACGA CGAGGCCCTC GGTGCGTTCC TCCCCGGGGA GGCGCTCGAC GGCCTGACCG TCACCGGCTC GGCCGCCGGC GTGTTCGACC TGGCCGGGTG CCTGGCCGAC GGGCAGCGGG TGGCGCGCGC CGCGCTGACG GCGCTGACGA TCCCGCCGGC GGGGGAGGAC CGGCTGCCCG CCACCCCGGA CCCCGTCGTC CCGGCCGCGC CGCTGGTGCT GTGGCGGGTG CCGGACACCT CCGGCGCCGA CGGCAGCACC TCGTTCGTCG ACCTGCAGCG CGACGCGACG GTCGCTGACA TCGCCCGCGC GGTCGGCGCC GGGCTGCGCT CCATCGAGCA CGTCAAGCGC TACACGACGA TCGGGACGGC GCACGACCAG GGCAAGACCT CCGGCGTCCT CACCTCGGGG ATCACCGCCG AGCTGCTCGG GATCCCGGTG CAGGACACCG GGACCACCAC GTTCCGGCCG CCCTACACCC CCGTCGCCTT CGCCGCCCTG GCCGGCCGCG ACCGCGGCCG GCTCTTCGAC CCCGAGCGGG TCACGGCCCT GCACGAGTGG CACGGAGCGG CGGGCGCGGT CTTCGAGGAC GTCGGTCAGT GGAAGCGTCC CCGCTACTAC CCGCAGCCCG GCGAGGACAT GGAGACCGCG GTCCTGCGGG AGTGCGCCGC GGCCCGGACC GGCGTCGGGA TCCTCGACGG CTCCACCCTC GGCAAGATCG ACGTCCAGGG CCCGGACGCC GCCGTCCTGC TCGACCGGCT CTACACGAAC CTGATGAGCA GCCTGAAGGT CGGCTCCGTC CGCTACGGGG TCATGTGCGG CGTCGACGGC ATGGTCATCG ACGACGGCAC CGTGCTGCGC CTGGCCGAGG ACCGCTTCCT CGTCCTCACC ACCACCGGCG GCGCGGCGAA GATCCTCGAC TGGATGGAGG AGTGGGCCCA GACGGAGTGG CCGGACCTGC GGGTCCACTG CACGTCGGTC ACCGAGCAGT GGGTCACCTT CCCCGTCGTG GGGCCGCGGT CCCGCGACGT CGTCGGCGCG GTCTTCCCCC ACGTGGACGT CTCCGCCGAG GCCTTCCCGT TCATGACCTG GCGGGACACC ACCCTCGACG GGGTGCCGGT CCGGCTGGCC CGGATCAGCT TCTCCGGCGA GCTCGCGTAC GAGGTCTACG TCAACCCCTG GTACGCGGTC GCGGTCTGGC AGCGGCTGCT CGACGCCGGC CGCCCGTACG GCATCACGCC GTACGGCACG GAGACCATGC ACGTCCTGCG CGCGGAGAAG GGTTACCCGA TCATCGGGCA GGACACCGAC GGCACCGTCA CCCCGCACGA CCTCGGTATG GCGTGGGCGG TCTCGAAGAA GAAGCCCGAC TTCGTCGGCA AGCGCTCCTT CGCCCGCCCG GCCAACGCCG ACCCGCTGCG CAAGCAGCTG GTCGGGCTGC TGCCGGTGGA CCGGCAGACC GTGCTGCCGG AGGGCTCCCA GATCATCGAC TTCCTCGCCG ACGGCCAGCT GCCGCCCCCG CCGGTCCCGA TGCTCGGCCA CGTCACCTCC AGCTACCGCA GCGCCGAGCT CGCCCGCCCC TTCGCCCTGG CCCTGGTCAA GGGCGGCCGG GAGCGCATCG GTGACACCGT CCACGTCCCC GTCAACGGCA CCCTCGTCCC GGTCGAAGTC ACCGGCTCGG TGCTGGTCGA CCCCGAAGGA GCCCGTCGCG ATGGCTGA
|
Protein sequence | MTSPFRTPQG GRIDRATTVG FTFDGQTFPG HPGDTLASAL LANGRHQVAT SIKLGRPRGI AAAWAEDPCG LVQIEEPFPE PMLLATTVEL YDGLAAHGLP GQGRLADVPD SARYDAVHHH VDVLVVGAGP AGLAAALTAA RAGARVALVD EQSEAGGSLL SGTERLDDAP ALQWVAAAVA ELAGSPEVLH LQRTTAFGSY DDGFVLALER RTDHLGAAAP KHVSRQRVWR IRARSIVVAT GAHERPVVFA DNDRPGTMLA GAARTFLHRY GVLPGREAVV FTTNDSAYDA ALDLHRAGVQ VQAVVDARPE GSPRREECEH AGIRVLSGAV VTGTQGVGRV THALVAPFAD GEVGASSAIA CDLLLVSGGW NPAVHLFSQA RGRLRYDEAL GAFLPGEALD GLTVTGSAAG VFDLAGCLAD GQRVARAALT ALTIPPAGED RLPATPDPVV PAAPLVLWRV PDTSGADGST SFVDLQRDAT VADIARAVGA GLRSIEHVKR YTTIGTAHDQ GKTSGVLTSG ITAELLGIPV QDTGTTTFRP PYTPVAFAAL AGRDRGRLFD PERVTALHEW HGAAGAVFED VGQWKRPRYY PQPGEDMETA VLRECAAART GVGILDGSTL GKIDVQGPDA AVLLDRLYTN LMSSLKVGSV RYGVMCGVDG MVIDDGTVLR LAEDRFLVLT TTGGAAKILD WMEEWAQTEW PDLRVHCTSV TEQWVTFPVV GPRSRDVVGA VFPHVDVSAE AFPFMTWRDT TLDGVPVRLA RISFSGELAY EVYVNPWYAV AVWQRLLDAG RPYGITPYGT ETMHVLRAEK GYPIIGQDTD GTVTPHDLGM AWAVSKKKPD FVGKRSFARP ANADPLRKQL VGLLPVDRQT VLPEGSQIID FLADGQLPPP PVPMLGHVTS SYRSAELARP FALALVKGGR ERIGDTVHVP VNGTLVPVEV TGSVLVDPEG ARRDG
|
| |