Gene Ndas_5065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5065 
Symbol 
ID9248954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp207038 
End bp208705 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content72% 
IMG OID 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_003682952 
Protein GI297563979 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCT GGCTCACCAT CGCGATCGCG CTCCCCGCGG TCGGCGCGGT CCTGGTCTGG 
GCGCTGCCGC GCACCGCCAA GGCCGCCACG GAGCGGACGG CCGCGGCCTC GGCCGCCTCG
TCCGCCACCG CCTCCTCCGC GGGCGCCACC GCCACGGCGA CGGCCGTGCG GCCCGCCGCC
GCGGCCAAGG GCGGGGCCCC GGCCGAGACC GCCAAGCGGA TCACGCTGGG CTTCTCGCTC
GCCACCCTGC TCGTGCTCGG GGCCATGGCC CTGCAGTTCG ACACCTCCCG GGCGGGGGCC
CTCCAGTTCG AGGAGGTCTA CCCCTGGATC CCGCGCTTCG GCGTCAGCTA CGCCGTGGGC
GTGGACGGCA TCGCCCTGGC GCTGGTCCTG ATGTCGGCCG TGCTGGTGCC GCTGGTGGTC
CTGGCCGCCT GGCGCGAGCA CGACGACCGG GAGGACGGGG GGCGCGGGTA CTTCGCGCTC
ATCCTGGTCC TGGAAGCGAT GATGATCGGC GTCTTCGCCG CCACCGACGT CTTCCTGTTC
TACGTGTTCT TCGAGGCCAT GCTCATCCCG GTCTACTTCA TGATCGGGCG CTACGGGCGC
GGCGACGACC GGTCCAGGGC CGCGGTCAAG TTCCTGCTCT ACAGCCTCGC CGGCGGCCTG
GTCATGCTGG TCGCGGTGAT CGGCGTGTAC GTCGTCGGCG GCACCTTCCT GTGGAGCGAC
CTGGTCGGCG AGGGCAGCGC CCTGGCCGCC GTGGACCCGG CCACGGCCCG CTGGCTGTTC
CTCGGCTTCT TCATCGCCTT CGCGATCAAG GCGCCGATGT GGCCCGTGCA CACCTGGCTG
CCCTCGGCGG CGGGCGCCTC GCGCCCGGGC ACCGCCGTGC TGCTGGTGGG CGTGCTGGAC
AAGGTCGGCA CTTACGGGAT GCTGCGCTAC TGCCTGGAGC TGTTCCCGGC GGCCGTGTCC
TGGTTCGTGT GGCCGGTGGT GGCGCTGAGT CTGGTGAGCA TCATCTACGG CGCGATCCTG
GCCATCGGGC AGAACGACAT GATGCGCCTG GTGGCCTACA CGTCGGTCTC CCACTTCGGC
TTCATCACCC TGGGCATCTT CGCCCTGACC GCGCAGGGGC AGGCCGGGGC CGCCCTGTAC
ATGGTCAACC ACGGGTTCGC GACCGGGGCG CTGTTCCTGG TCGTGGGCTT CCTGATCGCC
CGCCGCGGCT CCGCGCTCAT CAGCGACTAC GGCGGCGTGC AGCGGATCGC GCCCAAGCTG
GCCGGGGTGT TCCTGGTGAC GGGCCTGGCC GGTCTGGCGC TGCCGGGGCT GGCGCCGTTC
GTCAGCGAGT TCCTGGTGTT CGTCGGCGTG TACGCGTTCA GCCCGGCCCC CGCGATCGTC
GCGGCGGTCG GCGTGGTCCT GGCCGCGCTC TACATCCTGT GGATGTACCA GCGCACCATG
AACGGACCCA CCCGGGAGGA CCTGACCGGC CTGCGCGACC TGTCCGCGCG GGAGACGTGG
GCGGTGGCCC CGCTGCTCGC GCTCATCCTC CTGCTCGGGC TCTACCCGCA GCCGGTGCTG
GACGTGATCA ACCCCGCGGT GGAGCGCACC GTGGAGGTCG GCGCGGGCCC TGACAGCGGG
ACCGGGGGCG CCGAGGGCGC CCAGGAAGAG GAAGGAGACG CGGAGTGA
 
Protein sequence
MIPWLTIAIA LPAVGAVLVW ALPRTAKAAT ERTAAASAAS SATASSAGAT ATATAVRPAA 
AAKGGAPAET AKRITLGFSL ATLLVLGAMA LQFDTSRAGA LQFEEVYPWI PRFGVSYAVG
VDGIALALVL MSAVLVPLVV LAAWREHDDR EDGGRGYFAL ILVLEAMMIG VFAATDVFLF
YVFFEAMLIP VYFMIGRYGR GDDRSRAAVK FLLYSLAGGL VMLVAVIGVY VVGGTFLWSD
LVGEGSALAA VDPATARWLF LGFFIAFAIK APMWPVHTWL PSAAGASRPG TAVLLVGVLD
KVGTYGMLRY CLELFPAAVS WFVWPVVALS LVSIIYGAIL AIGQNDMMRL VAYTSVSHFG
FITLGIFALT AQGQAGAALY MVNHGFATGA LFLVVGFLIA RRGSALISDY GGVQRIAPKL
AGVFLVTGLA GLALPGLAPF VSEFLVFVGV YAFSPAPAIV AAVGVVLAAL YILWMYQRTM
NGPTREDLTG LRDLSARETW AVAPLLALIL LLGLYPQPVL DVINPAVERT VEVGAGPDSG
TGGAEGAQEE EGDAE