Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4112 |
Symbol | |
ID | 9247986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4910170 |
End bp | 4912140 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Respiratory-chain NADH dehydrogenase domain 51 kDa subunit |
Protein accession | YP_003682014 |
Protein GI | 297563040 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.483313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACCTGC GATTCCTGGA CGAGACGCCC ACCACCGCCG AACGCGAGGC CGTCGAGAGC GTACTCGGGC CGCCCGAGTC CGGCTGGGAC GGCGGCACCC GCCAGGACGC CGACCTGCGC TACGCGCGGG GCGGCCACTC CGCCCGCGAC CGGCGCGACC TCCTCCTCCC GGCCCTGCAC GCGGTGAACG ACCGCGTGGG CTGGATCAGC CGCCCCGCCC TCGACCACAT CTGCCGCAGG CTGACCGTCC CCCCGGCGGA GGCCTACGCC GTCGCCTCCT TCTACGCGAT GTTCGCCCTG CGCCGCAGGC CCCGCCGGGT CGTGCACCTG TGCACCGACA TCGCCTGCGC GGCCGCGGGT TCGGACCGGA TGCGCGCCCG GCTGACCGAG CGACTCGGCC CGCCCGGCGG CCACGACGGG GAGGCCGCCT GGCACGAGAG CCCGTGCCTG GGCATGTGCG AGCGCGCCCC CGCGGCCCTG GCCCTGGAGG CGGGCGCCCC CGCCCGGTAC GCCGTCGCCG CCCCCGCCAC GCCCGAGGCG CTGGACGCCA TGGCCGCGCG GGACCTGCCC GAGAACGGGG CGCCGAAGAC CAACGGCCAC CGGCCCTCCA CCACCTTCGG CGCGGACCCG CTGCCCGGGA CCGCGCCCGA GCCCGACCCC GCGGCGGCCG TGCCCGACGC GGGCTCCCCC GACCACGTGC TGCTGCGCCG GATCGGCACC GTCGATCCCC TGAGCCTGGA CGACTACCGC GCCGCGGGCG GGTACGCCGC GCTGCGCCGG GCGCTGCGCC TGGGCCCCGC CGGTGTCATC CGGGAGGTCT CCGACTCCGG CCTCGTCGGC CGGGGCGGGG CCGCCTTCCC GACCGGGCGC AAGTGGCAGG CCACGGCCCA GCAGCCGGAC GGCCCGCACT ACCTGGTGTG CAACGCCGAC GAGAGCGAGC CCGGCACCTT CAAGGACCGG GTCCTGATGG AGGGCGACCC GTTCTCCGTC GTCGAGGCGA TGACCATCGC GGGGTACGCC GTGGGCGCGC GGACCGGCTA CCTCTACATC CGGGGTGAGT ACCCCCGGGC GCTGCGCCAC CTGGAGAACG CCGTCGCCCT GGCCCGCGGG CGGGGCCTGC TCGGCCCGGA CATCCTGGGC TCGGGTTTCG CCTTCGACAT CGAGATCCGG CGCGGCGCGG GCGCCTACAT CTGCGGCGAG GAGACCGCGA TCTTCGGCTC CATCGAGGGG CAGCGGGGCG AGCCGCGCAG CAAGCCGCCC TTCCCCGTGG AGAAGGGGCT GTTCGGCAAG CCCACCGCCG TGAACAACGT CGAGACGCTG GTCAACGTGC TGCCCGTCCT GCTGATGGGC GGCCCGGCCT ACGCGGCCGT GGGCACCGGC CAGTCCACCG GCCCCAAGCT GTTCTGCCTG TCGGGCGGCG TGCGCCGCCC GGGACTGTAC GAACTGCCCT TCGGCGCCAC CCTGCGCGAC CTGATCGAGG CGGCCGGGGG GATGCCCGAG GGGCGCACCA TCCAGGCGGT CCTGCTCGGC GGCGCGGCCG GGACCTTCGT GCGCGGGGAC GAGCTGGACA TCCCGCTGAC CTTCGAGGGG GCCCGCGCGG CGGGCACGTC CCTGGGCTCG GGGGTGGTGC TGGTCCTGGA CGACACCGCC GACCTGGTCG CGACGCTGGT CCGGGTGGCG GCCTTCTTCC GCGACGAGTC CTGCGGCCAG TGCGTGCCCT GCCGGGTGGG CACCGTGCGC CAGGAGGAGT CGCTGCTGCG GATCAGGGCC GGGGGCGACG CCGCCGCCGA GGTGCCCCTG CTGCGCGAGG TCGGCCTGGC GATGCGCGAC GCCTCGATCT GCGGCCTGGG ACAGACCGCG TGGAACGCCG TGGAGTCGGC CATCGACCGG CTCGGCCTGT TCGACGCGAC CGACGACACC CACCAGAGCA CCGGCCCGGG CACCCGGGCG GCTACCGAGG AGGCCCGATG A
|
Protein sequence | MDLRFLDETP TTAEREAVES VLGPPESGWD GGTRQDADLR YARGGHSARD RRDLLLPALH AVNDRVGWIS RPALDHICRR LTVPPAEAYA VASFYAMFAL RRRPRRVVHL CTDIACAAAG SDRMRARLTE RLGPPGGHDG EAAWHESPCL GMCERAPAAL ALEAGAPARY AVAAPATPEA LDAMAARDLP ENGAPKTNGH RPSTTFGADP LPGTAPEPDP AAAVPDAGSP DHVLLRRIGT VDPLSLDDYR AAGGYAALRR ALRLGPAGVI REVSDSGLVG RGGAAFPTGR KWQATAQQPD GPHYLVCNAD ESEPGTFKDR VLMEGDPFSV VEAMTIAGYA VGARTGYLYI RGEYPRALRH LENAVALARG RGLLGPDILG SGFAFDIEIR RGAGAYICGE ETAIFGSIEG QRGEPRSKPP FPVEKGLFGK PTAVNNVETL VNVLPVLLMG GPAYAAVGTG QSTGPKLFCL SGGVRRPGLY ELPFGATLRD LIEAAGGMPE GRTIQAVLLG GAAGTFVRGD ELDIPLTFEG ARAAGTSLGS GVVLVLDDTA DLVATLVRVA AFFRDESCGQ CVPCRVGTVR QEESLLRIRA GGDAAAEVPL LREVGLAMRD ASICGLGQTA WNAVESAIDR LGLFDATDDT HQSTGPGTRA ATEEAR
|
| |