Gene Ndas_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3520 
Symbol 
ID9247389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4228777 
End bp4229949 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content76% 
IMG OID 
ProductSarcosine oxidase 
Protein accessionYP_003681427 
Protein GI297562453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCAC CGTTGGAGAC CGACACCGTC GTGGTCGGAC TGGGAGCCAT GGGGGCGCAG 
GCCCTGTGGC GCCTGGCCCG GCGCGGTGTG GACGTGATCG GGGTCGAGCA GTTCACGCCC
GGGCACGACC GGGGCTCCAG CCACGGCGAG TCCCGCATCA TCCGCACCGC CTACATGGAG
GGCGCCGCCT ACGTGCCGTT CGTGCGGTCG GCCTGGCGCG CCTGGTCGGA GCTGGAGGAG
GCCTCCGGAA CCCGGCTCGT GGTGCGCACC GGCGCCCTGA TGCTCGGCGC GCCGGACAGC
CCCGCCGTCA CCGGGTCGGT CGCCGCCGCC GAACACCACG GTCTCCCCCA CCAGGTGCTC
TCCCGCGACC AGGTCGCCGA GCGCTTCCCC CAGCACGTGC TGCGCCCGGG TGAGGTGGGC
GTCTTCGAGG AGGACGCCGG TGTGGTCCTG CCCGAGGCCG CGATCACGGC GGCCGTGCGG
CTCGCGCGGG AGGCGGGCGC GCGGGTGCTC ACCGGCGCCC GGGCGTCCCG TGTCGTCCCC
GACCCGGACC GCCCCCGTGT GGTGGTCGGG GACACCGTGA TCCGGGCCCG CCGGGTGGTC
GTGACCGCCG GGTCCTGGCT GCCGCGGCTG GTGCCCGAGG TGGCGGAGCT GGGCGGCGGC
CTGCGGGTGG AGCGGCGGGT GCTGGGCTGG TTCCGCACCA CGCGGGACCC GTCCCCGCAC
GCGCACGGAC CGGTGTTCGC CCGGGACGAG GACGACTGCA CGTGGTACGG GTTCCCCAGC
ATGGACGGCG GCCTGACCGT CAAGATCGGT GTGCACGCCG AGGCTCCGGG GAACAGGGGC
GAGGGCGCCC AGTGGGGCGA ACCGGTCGAC CCCGACGCGG GGCCGCGGGA GCCCGACGCC
GCCGACGCGC GGCGGCTGGG ACGGCTGGCC GCCGGACTGA ACGGTGTGGC CCCGCTGCCC
GAGCGGATGG CGTCGTGCAT GTACACGATG ACGCGGGACG AGCACTTCGT CATCGGGCAG
CGCCGCGAAC TGCCCGGACT GGTGCTGGCG GGGGGCTTCT CGGGGCACGG CTACAAGTTC
GCCTCCGCGG TCGGGGAGGC GCTGGCCGAC CTGGCCCGGC ACGGGCGCAC GGACCTGGCC
GTGGACCTGT TCGACCCGCA CCGCTGGGAC TGA
 
Protein sequence
MTAPLETDTV VVGLGAMGAQ ALWRLARRGV DVIGVEQFTP GHDRGSSHGE SRIIRTAYME 
GAAYVPFVRS AWRAWSELEE ASGTRLVVRT GALMLGAPDS PAVTGSVAAA EHHGLPHQVL
SRDQVAERFP QHVLRPGEVG VFEEDAGVVL PEAAITAAVR LAREAGARVL TGARASRVVP
DPDRPRVVVG DTVIRARRVV VTAGSWLPRL VPEVAELGGG LRVERRVLGW FRTTRDPSPH
AHGPVFARDE DDCTWYGFPS MDGGLTVKIG VHAEAPGNRG EGAQWGEPVD PDAGPREPDA
ADARRLGRLA AGLNGVAPLP ERMASCMYTM TRDEHFVIGQ RRELPGLVLA GGFSGHGYKF
ASAVGEALAD LARHGRTDLA VDLFDPHRWD