Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4345 |
Symbol | |
ID | 8449971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4832370 |
End bp | 4833512 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645043392 |
Product | Sarcosine oxidase |
Protein accession | YP_003203621 |
Protein GI | 258654465 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCGCA CTCCCCCTAT GAGCGTGGCC GTCATCGGCG GCGGGGCGAT CGGCTCGGCC GCCGCCTGGC AGCTGGCGGC CCGCGGGCAT CGGGTCGTGC TGGTCGAACA GTTCGGCCCC GGTCATGTGC GGGGCGCCTC GCACGGCAGT TCCCGGATCT TTCGCTACTC CTACCCGTCG GCGCTCTACA TCGAACTAGC CCGGCGGGCC GGCCGGCTCT GGCGACGTCT GGAACGTCTG CACGGGCAAC GGTTCTACGC CCGGACCGGA TCGGTCGACC ACGGCAATCC GGCGGCGGTG CAGCGGCTGG CCCGGTCGTT GCACCAGGCC GGGATCGAGC ATTCCGTGCT CACCCCCGGG GAGGCCGAAC TGCAGTGGCC CGGCCTGCGC TTCGACGGCA TGGTGCTGCA CCATCCCGAC TCGGGGCGAC TGCACGCCGA TCAGGCGGTC GCCGCGCTCC AGCGGTGCGC CCAGGTCGAG GGCGCCGAGA TCCGTTTCCA CACCTCGGCG ACCGGGGTTC GGGTCAGTCC CTCCGGGGTT CGAGTGCTGT CCGCGTCCGG GTCGATCCGC GTCGACCAGG TCGTCGTCGC GGCGGGCGCC TGGACCTGCG ACATCCTGGA ATCGCTGCCC ACGCTGAGCC GGTCCCTGCC GGCGCTGGTG ACCACCCAGG AGCAACCGGC GCACTTCGCG CCGCGGCAGA CGCCGGTCGG CTGGCCCAGT TTCCTGCACC ACCCGGGCGG GCAGTACCTG GGCCCGGCCG TGTACGGCCT GGCCGCCCCG GACGGGGTGA AGGTCGGCGA GCACGGCACC GGGCCACGCG TCACCCCGCA GCACCGCGAC TTCCGGCCCG ATCCGGACGG TGTGGGGCGG CTGCAGCAGT ACGCCCAGCA ATGGCTGCCC GGGGTCGATC CGACCCTGGT CGAGGCCACC ACCTGCCTGT ACACGTCCAC CCCCGACGGG CACTTCGTCA TCGACCGCCG CGGGCCGATC ACCGTGGCGG CCGGGTTCTC CGGGCACGGC TTCAAGTTCG CGCCGGCCAT CGGCGAACTG ATCGCCGGCC TGGTGGCCGA GCAGGGTCGC TCCCCCACCC TCTTCCGGCT TGGACCTCGT GTATCAGAAC CGGTTTCGGC CGGACGCCGC TGA
|
Protein sequence | MSRTPPMSVA VIGGGAIGSA AAWQLAARGH RVVLVEQFGP GHVRGASHGS SRIFRYSYPS ALYIELARRA GRLWRRLERL HGQRFYARTG SVDHGNPAAV QRLARSLHQA GIEHSVLTPG EAELQWPGLR FDGMVLHHPD SGRLHADQAV AALQRCAQVE GAEIRFHTSA TGVRVSPSGV RVLSASGSIR VDQVVVAAGA WTCDILESLP TLSRSLPALV TTQEQPAHFA PRQTPVGWPS FLHHPGGQYL GPAVYGLAAP DGVKVGEHGT GPRVTPQHRD FRPDPDGVGR LQQYAQQWLP GVDPTLVEAT TCLYTSTPDG HFVIDRRGPI TVAAGFSGHG FKFAPAIGEL IAGLVAEQGR SPTLFRLGPR VSEPVSAGRR
|
| |