Gene Ndas_1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1059 
Symbol 
ID9244905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1305442 
End bp1306602 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content74% 
IMG OID 
ProductNADH:flavin oxidoreductase/NADH oxidase 
Protein accessionYP_003679007 
Protein GI297560033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0160422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.54744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCG ACCACTTGCC GCAGACGGAA GGACACCCCT TGAGCACGCT GTTCACCCCC 
TTGAAGCTGA GGTCGCTGGA GATCCCCAAC CGCGTGTGGA TGTCCCCGAT GTGCATGTAC
TCGGCCGCCT CCGAAGGGCC CGACGCCGGC GCCCCGACCG ACTTCCACCT CGCGCACCTG
GCCGACCGGG CCGCGGGCGG GGCCGGTCTG GTCATGGTCG AGGCCACCGG CGTGCGGCCC
GACGGGCGGA TCAGCCCCTG GGACCTCGGC CTGTGGAACA GCCGCCAGCA GGAGGCCTTC
AAACGCGTGA CCTCCGCGAT CAGCGCCCAC GGCGCGGTAC CGGCCATCCA GCTCGCGCAC
GCCGGGCGCA AGGCCTCGAC GGACAAGCCC TGGCTGGGCG GACAGTACGT CCCCGAGTCC
GAGGGCGGCT GGCCGACGGT GGGTCCCGGC ACTGAGGCCT TCCCCGGCTA CCCCGCGCCG
GTCGAGCTGA CCGCCGACGG GATCCGGCAG CTGGTCCGAG ACTTCGCGGC TTCGGCCGAG
CGGGCGCTGG CCGCCGGGTT CCAGGTGGCC GAGGTGCACG GCGCGCACGG TTACCTGCTG
CACTCCTTCC TGTCGCCCGC CACCAACCAC CGCACCGACG AGTACGGCGG CAGCCCCGAG
AACCGGATGC GCTTCGCGCT GGAGGTCGTC GAGGCCGTGC GCGAGGTGTG GCCCGAGCAC
CTGCCCGTGT TCTTCCGCAC CTCGGCCACC GACTGGCTGA CGGAGAACGA GGCCGACGAG
CGCGAGGGCT GGACCGGCGA GGACACCGTC CTGCTCGCCA AGGAGCTCCA GGCCCGGGGC
GTGGACCTCC TCGACGTGTC CACCGGAGGC CTGGTCGCCG ACGCCCTCAT CCCCGTGGGC
CCGAACTACC AGGTGCCCTT CGCCGAGCGG GTCCGCGGTG CCACCGGCCT GCCCACCGGC
GCGGTCGGAG CGATCACCGA ACCCGCCCAG GCCGAGGAGA TCGTCGCCTC GGGCCGCGCC
GACGCGGTGT TCCTGGGACG GCAGCTGCTG CGCGAGCCCT ACTGGGCCCA CCGCGCCGCG
GAGGAGCTGG GCGCCGACTC GCACTGGCCC GAGCAGTACG GCTACGCGGT CGGCCGCCCG
CGCGTCGCCA GCGGGGCCTG A
 
Protein sequence
MDTDHLPQTE GHPLSTLFTP LKLRSLEIPN RVWMSPMCMY SAASEGPDAG APTDFHLAHL 
ADRAAGGAGL VMVEATGVRP DGRISPWDLG LWNSRQQEAF KRVTSAISAH GAVPAIQLAH
AGRKASTDKP WLGGQYVPES EGGWPTVGPG TEAFPGYPAP VELTADGIRQ LVRDFAASAE
RALAAGFQVA EVHGAHGYLL HSFLSPATNH RTDEYGGSPE NRMRFALEVV EAVREVWPEH
LPVFFRTSAT DWLTENEADE REGWTGEDTV LLAKELQARG VDLLDVSTGG LVADALIPVG
PNYQVPFAER VRGATGLPTG AVGAITEPAQ AEEIVASGRA DAVFLGRQLL REPYWAHRAA
EELGADSHWP EQYGYAVGRP RVASGA