Gene Ndas_5066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5066 
Symbol 
ID9248955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp208702 
End bp210354 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content72% 
IMG OID 
Productproton-translocating NADH-quinone oxidoreductase, chain N 
Protein accessionYP_003682953 
Protein GI297563980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.146495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCCCC ACATGGACCT TCCCGTCATC ACCGAGGCGG CGCCGCAGAT GGACTGGTGG 
CTGCTCGCGC CGCTGCTCAC GGTGTTCGCC GCCGGTGTTC TGGGCGTGCT GTTCGAGGCG
TTCGTCCCCC AGGGGCGGCG CCGCGCCATC CAGATCGGGC TCGCGGCGCT CGCGCTGCTG
GCCGCGTTCG TCCTGCTGGT GCTCCAGGTG GGCTCGCTGC CCGAGGACGC GCCGGGGGTG
ACGGTCGGCG CCGGGGCGCT GGCGGTGGAC CGGACGGTGA TCTTCTTCCA GGGCACCATC
ACCGTGCTCG CGCTGGTGAG CCTGCTGCTG ATCTCCGAGC GCCGGGACGG GCGCGACGCG
TTCGCCGCGC AGGCGGCGAC GGTGCCCGGG AGCGAGGAGG AGCGCGCGCA CATCCTGGCG
GGCTCCCAGC ACACCGAGGT GTACCCGCTG GTGATGTTCG CGGTGCTGGG GATGCTGATG
TTCCCGGCCT CGAACGACTT CCTGACGATG TTCATCGCGC TGGAGGTCAT GAGCCTGCCG
CTGTACCTGC TGTGCGGGCT GGCCCGGCGC CGCCGCCTGT TCTCGCAGGA GGCCGCGGTC
AAGTACTTCC TGCTGGGCGC GTTCTCCTCG GCGTTCTTCC TGTTCGGCGC CGCCATGGTC
TACGGCTACG CGGGCTCGGT GAACTTCGGC GAGATCCGCG CGGCCGTCGA GGCGGGCGGC
GCGGACGTCT TCCAGGGCGG AGACGGCGAG CCGCTGCTGC TCATGGGCAT CGGCCTGGTG
GCGATCGGCC TGCTGTTCAA GGTCGGCGCG GTGCCCTTCC ACAACTGGAA GCCGGACGTG
TACCAGGGCG CGCCGACCCC GATCACGGCG CTGATGGCCT CCTGCACGCT GGTCGCCGCG
TTCGGCGCCC TGCTGCGGGT GTTCTTCGTG CCCTTCGGCG GCTCGGCCCA GACCTGGGAG
CCGATGCTGT GGGCGGTGGC CGTCCTGACG ATGGTCGTGG CCGCGGTCAT CGCGGTGACC
CAGCGCGACG TCAAGCGGCT GCTGGCCTAC TCGTCGGTGG TGCACGCCGG GTTCATCCTG
ACCGCCGTCG TGGCGGGCAG CACGGACGGG CTCGCGGGCG CCATGTTCTA CCTGGCGGCC
TACGGGTTCA CCACGCTGGG CGCGTTCGCC GTGGTGACGC TGGTGCGGAC CAAGGACGGC
GGCCAGGAGC TGGGCGACCT GGACCGGTGG GCGGGGCTGG GCCGCCGCTC GCCGGTGCTG
GCCGGGGCGC TGGCGCTGTT CCTGCTGGCG TTCGCCGGGA TCCCGCTCAC GAGCGGCTTC
ATCGGCAAGT TCGCGGTGTT CGAGGCGGCG GTGGCCGCCG GCGCCGTCCC GCTGGTCGTG
GTCGGTGTGC TGAGCAGCGC GGTGACCGCG TTCTTCTACG TGCGGATCAT CGTGGTGATG
TTCTTCCGGG ACCCGGAGGG CGAGGGCCCC ACGGTGGTGC GCGCGGGCGC GGCCACGGGC
GGGGTCATCG CGATCGGGGT GGCGGCGACG CTGCTGCTCG GCGTGTACCC GGGGGCGGTG
CTCGACAACC TGCTGCCCCC GGCCGACGCC GACCAGGCCC CGACCTCGGT GATGGTCGAA
CAGGCGTCGA CGCAGGCCGG AGGCGCGGAG TGA
 
Protein sequence
MIPHMDLPVI TEAAPQMDWW LLAPLLTVFA AGVLGVLFEA FVPQGRRRAI QIGLAALALL 
AAFVLLVLQV GSLPEDAPGV TVGAGALAVD RTVIFFQGTI TVLALVSLLL ISERRDGRDA
FAAQAATVPG SEEERAHILA GSQHTEVYPL VMFAVLGMLM FPASNDFLTM FIALEVMSLP
LYLLCGLARR RRLFSQEAAV KYFLLGAFSS AFFLFGAAMV YGYAGSVNFG EIRAAVEAGG
ADVFQGGDGE PLLLMGIGLV AIGLLFKVGA VPFHNWKPDV YQGAPTPITA LMASCTLVAA
FGALLRVFFV PFGGSAQTWE PMLWAVAVLT MVVAAVIAVT QRDVKRLLAY SSVVHAGFIL
TAVVAGSTDG LAGAMFYLAA YGFTTLGAFA VVTLVRTKDG GQELGDLDRW AGLGRRSPVL
AGALALFLLA FAGIPLTSGF IGKFAVFEAA VAAGAVPLVV VGVLSSAVTA FFYVRIIVVM
FFRDPEGEGP TVVRAGAATG GVIAIGVAAT LLLGVYPGAV LDNLLPPADA DQAPTSVMVE
QASTQAGGAE