Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2266 |
Symbol | |
ID | 5713919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2389926 |
End bp | 2391176 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641268188 |
Product | putative sarcosine oxidase, beta subunit |
Protein accession | YP_001533603 |
Protein GI | 159044809 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.547041 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCTATT CCGCGCTGGA ACTTCTCAAA CAAGGGCTCT CGGGCAACAA GGGTTGGCCC CTGCTGTGGC GCAGCCCCGC GCCGAAGCCC GCCTACGATG CGCTCATCAT CGGCGGTGGC GGGCATGGGC TCGCCACGGC CTATTACCTT GCGCGCAATC ATGGGATGAC CAATGTTGCG GTGCTGGAAA AGGGGTATCT CGGCAGCGGC AATATCGGCC GTAACACCAC TATCGTACGT GCCAACTACC TGCTGGCCGG TAATTCCGAG TTCTATTCCC ATTCCTTGAA GCTATGGGAG GGGATGGAGA CGGATCTGAA CTTCAACGCG ATGCACTCCC AGCGCGGGAT CATCAATCTG TTCCACTCCG ACGGGCAGCG CGATGCCTTT GCCCGCCGCG GCAACGCCAT GATCAACCAG GGTGACGATG CGATCCTGCT GGACCGGGAG GGTGTGCGCC GCGAGGTGCC CTATCTGGAT TTCGACCAGA CCCGCTATCC GATCTATGGC GGGCTGTGGC ACAAGCGCGG CGGGACCGCG CGCCACGACG CTGTCGCCTG GGGCTATGCC CGCGGAGCCG ACCAGCGTGG TGTTGACCTG ATCCAGAATT GCGAAGTCAC CGGATTTCTG ATCGAGAACG GCTCTGTACA GGGCGTGCAG ACCACGCGCG GGGAGATCCG CGCCCCCAAG GTCGGCATGG TCGTGGCGGG TCGCTCGGGG CAGGTGGCCG AGATGGCGGG CTTCCGTCTG CCCATCGAGA GCCACATCCT GCAGGCCTTC GTGACCGAAG GGCTGAAACC ATGCATCGAC CAGGTGATTA CCTATGGCAT GGGTCACTTC TATATCAGCC AGTCCGACAA GGGCGGTCTG GTGTTTGGCG GGGACCTGGA TTTCTACGCC TCCTATGCGC AGCGCGGCAA CCTGCCGATG GCCGAGCACG TTGTTGAGGC GGGAATGACT CTGATGCCGA TGATCGGCAA GGCGAAGATG CTGCGCTCCT GGGGCGGGAT CATGGACATG ACGCCCGACG GCTCGCCAAT CATCGACCAC AGCCCCGTAC AAGGGCTCTA TGTCAACGGG GGCTGGTGCT ATGGCGGGTT CAAGGCAGTG CCCGCCTCAG GCTGGTGCAT GGCGCATCTC ATGGCCACGG ATACGGTACA CGAGATCGCT CGACGCTATC GGCTCGACCG GTTCCGCACG GGGCACCTGA TCGACGAGGA AGCCACGGGC AGTCAGCACA ATCTACACTG A
|
Protein sequence | MRYSALELLK QGLSGNKGWP LLWRSPAPKP AYDALIIGGG GHGLATAYYL ARNHGMTNVA VLEKGYLGSG NIGRNTTIVR ANYLLAGNSE FYSHSLKLWE GMETDLNFNA MHSQRGIINL FHSDGQRDAF ARRGNAMINQ GDDAILLDRE GVRREVPYLD FDQTRYPIYG GLWHKRGGTA RHDAVAWGYA RGADQRGVDL IQNCEVTGFL IENGSVQGVQ TTRGEIRAPK VGMVVAGRSG QVAEMAGFRL PIESHILQAF VTEGLKPCID QVITYGMGHF YISQSDKGGL VFGGDLDFYA SYAQRGNLPM AEHVVEAGMT LMPMIGKAKM LRSWGGIMDM TPDGSPIIDH SPVQGLYVNG GWCYGGFKAV PASGWCMAHL MATDTVHEIA RRYRLDRFRT GHLIDEEATG SQHNLH
|
| |