Gene Ndas_4938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4938 
Symbol 
ID9248825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp75493 
End bp76848 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative monooxygenase 
Protein accessionYP_003682827 
Protein GI297563854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.131533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG GACGACGGCT CCACCTCAAC GCCTTCCTCA TGGGGGTGGG GCACCACGAG 
GCGGCCTGGC GGCACCCGCG CACGCGGCAG GACGGGGTGC TGGACGTGGC GCACTTCCAG
AACCTGGCGC GTGTCGCCGA ACGCGGACGG CTGGACTCGG TGTTCTTCGC CGACGGCCTG
GCCGTGGGCC ACCGGGTCGA GCGCAACACC CTCGCCGTCT TCGAGCCCAT CACCCTGCTC
AGCGCGATGG CCGCGGTGAC CGAGCGGGTG GGGCTGATCG CCACCGCCTC CACCGGCTAC
TACCCGCCCT ACCTGCTGGC CAGGGCCTTC GCCTCGCTGG ACCACATCAG CGGCGGACGC
GCCGGATGGA ACATCGTCAC CTCCGGCCGC GAGGACGAGG CGGCCAACTT CGGACTGGAC
CAGGTGCCCG AGCACGCCGA CCGCTACGCG CGTGCGGCGG AGTTCACCGA CGTCGTCGTC
AAGCTGTGGG ACAGCTGGGA GGACGGCGCC CTGCGACTCG ACGCCGAGGA GGGGGTGTTC
GCCGACCCCG ACCGCGTCCA CGCCATCGAC CACGAGGGCG AGCGGTTCCG GGTGCGCGGG
CCGCTGAACT CCCCCCGCCC GCCGCAGGGC CGTCCGGTGC TGGTGCAGGC GGGGTCCTCC
GAGGACGGCA GGGAGTTCGC CGCGCGCTAC GCCGAGGCGG TGTTCACCGC GCAGCAGACC
CTCCGGGAGG GGGTGGACTT CTACCGCGAC CTCAAGGGGC GGTTGGCGCG TTTCGGACGC
GCCCCGCAGG AGTTGAAGGT CCTGCCCGGG ATCGTGCCGT TCATCGCGCC GACGGAGGCC
GCGGCCAAGG AGCTGGAGGC GGAGTTCACC GGGCTCATCT CCCCGGACTA CGCCCTGCGC
CAGCTCTCCC AGATGCTGGG GGTGGACCTG ACCGGGCACC CGCTGGACGC CCCCCTGCCC
CCGCTGCCCG GCGAGGACGG CATCCGCGGC AACAAGAGCC GGTACACCCT GGTCGCGGAC
CTGGCGGCGC GCGAGTCGCT GACCGTGGGG GAGCTGATCG GGCGCCTGGG CGGCGGGCGC
GGCCACCGGA CCTTCGCGGG CACGCCCGAG CAGGTGGCGG ACGAGATCCA GGGCTGGTTC
GAGGCCGGGG CCGCCGACGG GTTCAACGTC ATGCCGCCCC ACCTCCCCGG CGGCCTGGAG
GACTTCGTCG ACCAGGTGGT GCCGATCCTC CGGGAGCGCG GCCTCTTCCG GGAGGAGTAC
GAGGGGACGA CCCTGCGCGA CCACTACGGT CTGCCCCGCC CTCCCAGCCA GTACGCCGAA
GCCGTGCCGG TACCGGAGGT CTCCGGCGCG GTCTGA
 
Protein sequence
MTDGRRLHLN AFLMGVGHHE AAWRHPRTRQ DGVLDVAHFQ NLARVAERGR LDSVFFADGL 
AVGHRVERNT LAVFEPITLL SAMAAVTERV GLIATASTGY YPPYLLARAF ASLDHISGGR
AGWNIVTSGR EDEAANFGLD QVPEHADRYA RAAEFTDVVV KLWDSWEDGA LRLDAEEGVF
ADPDRVHAID HEGERFRVRG PLNSPRPPQG RPVLVQAGSS EDGREFAARY AEAVFTAQQT
LREGVDFYRD LKGRLARFGR APQELKVLPG IVPFIAPTEA AAKELEAEFT GLISPDYALR
QLSQMLGVDL TGHPLDAPLP PLPGEDGIRG NKSRYTLVAD LAARESLTVG ELIGRLGGGR
GHRTFAGTPE QVADEIQGWF EAGAADGFNV MPPHLPGGLE DFVDQVVPIL RERGLFREEY
EGTTLRDHYG LPRPPSQYAE AVPVPEVSGA V