Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5056 |
Symbol | |
ID | 9248945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 195982 |
End bp | 197277 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | NADH dehydrogenase I, D subunit |
Protein accession | YP_003682943 |
Protein GI | 297563970 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCA CCGAGGACTA CATCGACGCC TCCGGCGGGG ACTGGGACGA CGTCATCGAG AAGGCGCAGG CCACCAGCGC CGAACGCCTC GTCGTCAACA TGGGACCCCA GCACCCCTCC ACGCACGGCG TCCTGCGGCT CATCCTCACC CTCGACGGTG AGACCTGCAC CGAGGCGCGC GTCGGTATCG GCTACCTGCA CACCGGTATC GAGAAGAACA TGGAGTACCG GACGTGGACG CAGGGCACCA CGTTCGTGAC CCGCATGGAC TACCTGACGC CGCTGTTCAA CGAGGCGGCG TACTGCCTGG CGGTCGAGAA GCTGCTCGGC ATCGAGGACC GCGTCCCCGA GCGGGCCAGC GTCATCCGCG TGATGATGAT GGAGCTCAAC CGGATGGCCT CGCACTTCGT GGCGATGGCG ACCTTCGGCA TGGAGCTGGG CGCGACCACG GTCATGACCA ACGGCTTCCG TGAGCGCGAG ATGATCCTGG ACATCTTCGA GCTGGTCACC GGCCTGAGGA TGAACCACGC CTACATCCGC CCCGGCGGCG TGGCCCAGGA CCTGCCGCCC GGCGCGGCCG GCAAGGTCCG CGAGCTGCTC AAGGAGATGC CCAAGCGCAT CGCCGTCATG CGCAAGCTCC TGGACGAGAA CCCCGTCTAC CTCGCCCGCA CCAAGGACGT GGCCCACCTC AACCTGCCCG GCTGCATGGC GCTCGGCGTC ACCGGCCCGC TGCTGCGGGC CTCCGGCCTG GCCTGGGACC TGCGCAAGGC CAAGCCCTAC TGCGGCTACG AGGGCTACGA GTTCGACGTC CCCGTCTCCG ACGGCGGCGA CGTCTACGCC CGCTACCGGG TGCGCATGGC CGAGATGGAG GAGAGCCTGA AGATCATCGA GCAGTGCCTG GACAAGCTCC AGCCCGGCCC GGTGATGATC CAGGACGCCA AGATCGGATG GCCCGCCAAG CTGGCGCTGG GGCCCGACGG CCTGGGCAAC TCGCCCGACC ACATCGCGCA CATCATGAGC GGCTCCATGG AGGCGCTCAT CCACCACTTC AAGCTGGTCA CCGAGGGCTT CCGGGTCCCC GCGGGCCAGG CCTACGCGGC CGTCGAGAGC GCCAAGGGCG AACTCGGCTG CTACGCGGTC AGCGACGGGA GCACCCGCCC CCACCGCGTG CACTTCCGCG ACCCCTCGTT CACCCACCTG CAGGCCGTCG CGGCCATGTG CGAGGGGGGA ACGGTGGCGG ACGTCATCGC CGCCGTGGCC AGTATCGACC CGGTGATGGG AGGCGTGGAC CGGTGA
|
Protein sequence | MTITEDYIDA SGGDWDDVIE KAQATSAERL VVNMGPQHPS THGVLRLILT LDGETCTEAR VGIGYLHTGI EKNMEYRTWT QGTTFVTRMD YLTPLFNEAA YCLAVEKLLG IEDRVPERAS VIRVMMMELN RMASHFVAMA TFGMELGATT VMTNGFRERE MILDIFELVT GLRMNHAYIR PGGVAQDLPP GAAGKVRELL KEMPKRIAVM RKLLDENPVY LARTKDVAHL NLPGCMALGV TGPLLRASGL AWDLRKAKPY CGYEGYEFDV PVSDGGDVYA RYRVRMAEME ESLKIIEQCL DKLQPGPVMI QDAKIGWPAK LALGPDGLGN SPDHIAHIMS GSMEALIHHF KLVTEGFRVP AGQAYAAVES AKGELGCYAV SDGSTRPHRV HFRDPSFTHL QAVAAMCEGG TVADVIAAVA SIDPVMGGVD R
|
| |