Gene Namu_4345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4345 
Symbol 
ID8449971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4832370 
End bp4833512 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content72% 
IMG OID645043392 
ProductSarcosine oxidase 
Protein accessionYP_003203621 
Protein GI258654465 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGCA CTCCCCCTAT GAGCGTGGCC GTCATCGGCG GCGGGGCGAT CGGCTCGGCC 
GCCGCCTGGC AGCTGGCGGC CCGCGGGCAT CGGGTCGTGC TGGTCGAACA GTTCGGCCCC
GGTCATGTGC GGGGCGCCTC GCACGGCAGT TCCCGGATCT TTCGCTACTC CTACCCGTCG
GCGCTCTACA TCGAACTAGC CCGGCGGGCC GGCCGGCTCT GGCGACGTCT GGAACGTCTG
CACGGGCAAC GGTTCTACGC CCGGACCGGA TCGGTCGACC ACGGCAATCC GGCGGCGGTG
CAGCGGCTGG CCCGGTCGTT GCACCAGGCC GGGATCGAGC ATTCCGTGCT CACCCCCGGG
GAGGCCGAAC TGCAGTGGCC CGGCCTGCGC TTCGACGGCA TGGTGCTGCA CCATCCCGAC
TCGGGGCGAC TGCACGCCGA TCAGGCGGTC GCCGCGCTCC AGCGGTGCGC CCAGGTCGAG
GGCGCCGAGA TCCGTTTCCA CACCTCGGCG ACCGGGGTTC GGGTCAGTCC CTCCGGGGTT
CGAGTGCTGT CCGCGTCCGG GTCGATCCGC GTCGACCAGG TCGTCGTCGC GGCGGGCGCC
TGGACCTGCG ACATCCTGGA ATCGCTGCCC ACGCTGAGCC GGTCCCTGCC GGCGCTGGTG
ACCACCCAGG AGCAACCGGC GCACTTCGCG CCGCGGCAGA CGCCGGTCGG CTGGCCCAGT
TTCCTGCACC ACCCGGGCGG GCAGTACCTG GGCCCGGCCG TGTACGGCCT GGCCGCCCCG
GACGGGGTGA AGGTCGGCGA GCACGGCACC GGGCCACGCG TCACCCCGCA GCACCGCGAC
TTCCGGCCCG ATCCGGACGG TGTGGGGCGG CTGCAGCAGT ACGCCCAGCA ATGGCTGCCC
GGGGTCGATC CGACCCTGGT CGAGGCCACC ACCTGCCTGT ACACGTCCAC CCCCGACGGG
CACTTCGTCA TCGACCGCCG CGGGCCGATC ACCGTGGCGG CCGGGTTCTC CGGGCACGGC
TTCAAGTTCG CGCCGGCCAT CGGCGAACTG ATCGCCGGCC TGGTGGCCGA GCAGGGTCGC
TCCCCCACCC TCTTCCGGCT TGGACCTCGT GTATCAGAAC CGGTTTCGGC CGGACGCCGC
TGA
 
Protein sequence
MSRTPPMSVA VIGGGAIGSA AAWQLAARGH RVVLVEQFGP GHVRGASHGS SRIFRYSYPS 
ALYIELARRA GRLWRRLERL HGQRFYARTG SVDHGNPAAV QRLARSLHQA GIEHSVLTPG
EAELQWPGLR FDGMVLHHPD SGRLHADQAV AALQRCAQVE GAEIRFHTSA TGVRVSPSGV
RVLSASGSIR VDQVVVAAGA WTCDILESLP TLSRSLPALV TTQEQPAHFA PRQTPVGWPS
FLHHPGGQYL GPAVYGLAAP DGVKVGEHGT GPRVTPQHRD FRPDPDGVGR LQQYAQQWLP
GVDPTLVEAT TCLYTSTPDG HFVIDRRGPI TVAAGFSGHG FKFAPAIGEL IAGLVAEQGR
SPTLFRLGPR VSEPVSAGRR