Gene Ndas_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1454 
Symbol 
ID9245304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1778985 
End bp1780355 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content78% 
IMG OID 
Productamine oxidase 
Protein accessionYP_003679392 
Protein GI297560418 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.162575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.372865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG GGCTCGACGT CGCCGTGGTC GGAGCGGGCC CCGCCGGACT GGCCACCGCG 
CACGCGCTGG CCCGCGCGGG CCGCTCGGTC CGCGTCCTGG AGGCCGACGA GGCCGTGGGC
GGACGCATGC GCACCCTGCG CGAGCACGGC TGCCTCATCG ACACCGGCGC GGAGATGCTG
CCCCCGGCCA AGGCCTACCC GGCCACCTGG CGGCTGATCC GCGAGGTCGG TCTGGACGCC
GACCGCGGCG CGGTGCCCCG CATCCGCGGC GCCCTGGCGT CCTGGCGCGA GGGCCGGGTC
CGACCGCACG TGGGCCGCCC CCTGGGCCTG CTCACCGGCG CCGGGCTGTC CCCCCGCGCG
CGGGCCGACC TGCTGCGCCT CCAGATCCAG CTGGCCCGCC TCGGCCCGGA CCCCGAACAC
CCCGAGCGCT CCCCGCTGGG CGCGCGCACC CTCGCCGAAC TGCTCCGCCC CTACCATCCG
GAGCTGCGCT ACCGGCTGCT CGGCGCCCTG GCCGCCGGGT TCTTCGGCTG GGACCCCGAG
CACACGGCCG CCGCCCCCTT CGCCGCGCAC CTGGTCTCCG CGGGCAGCAG CGCCCGCTGG
CGCACCTACC GCGACGGCAT GGACACCCTC GCGCGGATCC TGGCCGACCG GCTCGACGTG
GTCACCGGCC ACCCGGTGAC CGCTGTGTCC GCGGGCCCCG ACGGGGTCAG GCTGGAGTCC
CCGGCTGGCA CGGTCACCGC CCGTGCGGCC GTGCTGGCCG TGCCAGCCCC GGTCGCGGCC
GGGATCCACC CCGGCGCGTC CGGCGTCGAG CGCGCCTACC TGGAGGCCTG CGAGTACGCG
CCCATGCTCC GCGTCTCCCT GGTCCTGGAC CGCCCGATGG AACCGGCCGG GGCGCGCGGC
GGCTTCGCCA CCCTGGTGCC CGCCCTGGAG GACCCGCTGC TCAACGTGGT CACCGCCGAC
CACAACAAGC ACCCGGGCCG GGTGCCGCCC GGACGGGGCC TGGTCTCCCT GGTGGCCTCC
CCGCGCGGCG CCGCCGAACT CCTCGACGCC TCCGACGCGG TGGTGGCGGA GCGGCTCACC
GACCGCGCCG AACCCCTGCT GCCCGGCCTG TCCCGACGCG TCGAGGCCGT CCGCGTGCAC
CGCTTCCGCC ACGGCCTGCC CGCCGCGGGG CCGCGCGCCC TGCGCGAGCG CTCCGCCTTC
GCCGTCCGCC CGCCCGCGGC CGTGGACTAC GCCGGGGACT GGATCTCACT GCGCCCGTGC
AGCGAGGGCG CGGTCTCCTC CGCGACCACG GCCGCCGCGC GGGTCCTGGC CTTCCTCGCC
TCGACGGCCC CGACCCCCGT TCCCACCCCC AGTGAGGACC GACGACGATG A
 
Protein sequence
MSQGLDVAVV GAGPAGLATA HALARAGRSV RVLEADEAVG GRMRTLREHG CLIDTGAEML 
PPAKAYPATW RLIREVGLDA DRGAVPRIRG ALASWREGRV RPHVGRPLGL LTGAGLSPRA
RADLLRLQIQ LARLGPDPEH PERSPLGART LAELLRPYHP ELRYRLLGAL AAGFFGWDPE
HTAAAPFAAH LVSAGSSARW RTYRDGMDTL ARILADRLDV VTGHPVTAVS AGPDGVRLES
PAGTVTARAA VLAVPAPVAA GIHPGASGVE RAYLEACEYA PMLRVSLVLD RPMEPAGARG
GFATLVPALE DPLLNVVTAD HNKHPGRVPP GRGLVSLVAS PRGAAELLDA SDAVVAERLT
DRAEPLLPGL SRRVEAVRVH RFRHGLPAAG PRALRERSAF AVRPPAAVDY AGDWISLRPC
SEGAVSSATT AAARVLAFLA STAPTPVPTP SEDRRR