Gene Ndas_5060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5060 
Symbol 
ID9248949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp201822 
End bp203195 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content70% 
IMG OID 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_003682947 
Protein GI297563974 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.252205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCCCG CCGCTCTCGG TCCGGGCGTG ACCGCCCACC TGGCCTCCGA GGAGACGCTG 
AGCGCCTTCG GCGGCGACCC CTGGTGGATC ACCACCATCA AGGCGCTGGC GATCTTCGTC
TTCCTCATGA TCTGCGTGCT GATGATGATC ATGGCCGACC GCAAGGTCAT GGGCCGGATG
CAGCAGCGCC ACGGACCCAA CCGGATGGGC CCCTTCGGCC TGTTCCAGTC GCTCTTCGAC
GGCATCAAGC TCTCCCTCAA GGAGGACCTG ATCCCGCGCG GGGTGGACCG GTTCGTCTAC
ATCGCCGCGC CGATGATCGC GGCCGTGCCC GCCTTCATCG CCTTCTCGGT CATCCCGATC
GGACCCGAGG TGAACCTGTT CGGGGTCATC ACCCCGCTCC AGCTCACCGA CCTGCCCGTC
GCGGCCCTGG TGGTCCTGGC CACCGCCGCG CTGGGGGTCT ACGGGTTCGT GCTCGGCGGC
TGGGCCTCCC AGTCGCCCTA CGCCCTGCTC GGCGGCCTGC GCGCCTCCGC GCAGGTGATC
AGCTACGAGA TCGCGATGGG CCTGTCCTTC GTCGCGGTCT TCATCATGTC CGGGACCCTG
ACCACCTCGG GGATCGTCGA GTCCCAGCGC GGACTGTGGT TCGCCGTGCT GCTCCTGCCG
TCCTTCCTCA TCTACCTGGT GACGATGGTC GGCGAGACCA ACCGGCTGCC CTTCGACCTG
GCCGAGGGCG AGGGCGAGAT CGTCGGCGGC TTCATGACCG AGTACGGGTC CATGAAGTTC
ACGATGTTCT TCCTCGCGGA GTACGTGAAC ATGTGCACGG TCGCCGCCGT GTCCGTCACG
CTCTTCCTCG GCGGCTGGCT CGCCCCGCCC GGGATCGCGG CGATCCTGCC GGGCGCCAAC
GAGGGCTGGT GGCCCGCCCT GTGGTGGCTG CTCAAGTTCG TCTGCGTGAT GTTCCTGTTC
ATCTGGGCGC GCGGCAGCCT GCCCCGGGTG CGCTACGACC AGCTCATGAA GCTGGGCTGG
AAGGTGCTCA TCCCGATCCA GCTGGTGTGG ATCACCGCCG TCGCCGTCGT GCGGATGCTC
GTCCTGGACG GCGCCTCCCC GCTGGTCGTC GGCGTGGTGA TCGCCGCCTT CACCGCCGCG
ACCGTGGCCG CCTTCGCCGC CTGGCTGCGC CACGTGCGCC GCGAGCGCGC CGAGGAGGCC
CGCCAGCGCG CGGAGAACGC CCGCCGGGCG CACCGGGAAC CCGCCTTCGG CGGCTTCCCG
GTTCCGCCGA GCGCCGCACC GCACTACGGC AGCAGCGTGC TCGCCGAACC GCCGCGGACG
GCCCCGGCCG CCGCCGACCG CAACAAGAAG GGTGAGGAGG TTACCGGTGC TTGA
 
Protein sequence
MTPAALGPGV TAHLASEETL SAFGGDPWWI TTIKALAIFV FLMICVLMMI MADRKVMGRM 
QQRHGPNRMG PFGLFQSLFD GIKLSLKEDL IPRGVDRFVY IAAPMIAAVP AFIAFSVIPI
GPEVNLFGVI TPLQLTDLPV AALVVLATAA LGVYGFVLGG WASQSPYALL GGLRASAQVI
SYEIAMGLSF VAVFIMSGTL TTSGIVESQR GLWFAVLLLP SFLIYLVTMV GETNRLPFDL
AEGEGEIVGG FMTEYGSMKF TMFFLAEYVN MCTVAAVSVT LFLGGWLAPP GIAAILPGAN
EGWWPALWWL LKFVCVMFLF IWARGSLPRV RYDQLMKLGW KVLIPIQLVW ITAVAVVRML
VLDGASPLVV GVVIAAFTAA TVAAFAAWLR HVRRERAEEA RQRAENARRA HREPAFGGFP
VPPSAAPHYG SSVLAEPPRT APAAADRNKK GEEVTGA