Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0021 |
Symbol | |
ID | 9243848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 26261 |
End bp | 29515 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003677979 |
Protein GI | 297559005 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.12847 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTAC GCACGTGGAT CGAGTCGTGG CCGGTGTACC GGCAGTTCAC GGGGGACGAC CCGACGGGGC GCGGGGCCGC CGCGAAGTCC CGCCGGTCGG AGAACCTCAG GTCCCGCACC GAGGAGGCCG ACCGGGTCGT GAAGTCGGTC TGCCCCTACT GCGCGGTGGG GTGCGGGCAG AACGTCTACG TCAAGGACGA GAAGGTCGTC CAGATCGAGG GCGACCCGGA CTCGCCGATC AGCCGCGGGC GGTTGTGCCC CAAGGGGTCG GCCAGCCTCC AGCTCACCAC CGGCTCCGCG CGCGAGGAGA GGGTCCTGTA CCGGCGCCCG TACGGCACCG AATGGGAGGA ACTCGACCTC GACACCGCCA TGCGGATGGT CGCCCACCGC GCGGTCCGCA CGCGCGAGCA GACCTGGGAG CGGGAGCACG AGGGGCAACG GGTGAACCGC ACGCTGGGGA TCGCCAGCCT CGGCGGCGCC ACGCTGGACA ACGAGGAGAA CTACCTCATC AAGAAGCTCC TGACCTCGCT CGGCGTCGTC CAGGTGGAGA ACCAGGCCCG CGTGTGCCAC AGCTCCACGG TCGCCGGGCT CGGGACGTCC TTCGGGCGCG GCGGGGCGAC CATGTTCCTC CAGGACCTGC AGAACTCCGA CTGCATCGTC ATCGAGGGCT CCAACTTCGC CGAGGCCCAC CCGGTCGGCT TCCAGTGGGT CATGGAGGCC AAGGCCCGCG GCGCGGTGGT CATCCACGTC GACCCCCGGT TCAGCCGCAC CAGCGCGCTC GCCGACATGC ACGTGCCCAT CCGCGCCGGC ACCGACATCG CCTTCCTCGG CGCGATCATC AACCACGTCC TCACCGAGGA GAAGTACTTC CTCGACTACG TGCGCGCCTT CACCAACGCC GCCACGATCG TCGGCGAGGA CTTCCAGGAC ACCGAGGACC TGGACGGCCT GTTCTCCGGC TTCTCGGAGG AGGACAAGAG CTACGACGCC AGCACCTGGA GCTACGAGGG CGCCGAGGTC GCCGCCGCCT CCGGCAACCG CAACCAGCTG TTCAGGGAGC GGCTGGAGGA GAACCTCGGC ACCAGCCACT CCGGGCGGCC CGAACAGCAG GGGTCGGGCG GCGCGGTCAT CCGGGAGAAG CCCAGGGAGG ACCCGACGCT CACCGACCCG CGCTGCGTGT TCCAGGTCCT CAAACGCCAC TACGCGCGCT ACACGCCGGA GCTGGTCGAG GAGGTCTGCG GTATTCCGAA GGAGACCTTC CGGAGGGTGT GCGACCACCT CACCGAGAAC TCCGGCCGGG ACCGCACCAG CGCCTTCTGC TACGCCGTGG GCTGGACGCA GCACACGGTC GGCTCCCAGT ACATCCGGGC CGCCTCCATC CTCCAGCTGC TGCTGGGCAA CATCGGCCGC CCGGGGGGAG GCATCCAGGC GCTGCGCGGA CACGCCAGCA TCCAGGGCTC CAGCGACGTG CCCACCCTGT TCGACCTGCT CCCGGGCTAC CTGCCGATGC CCCACGCCCA CGAGGAGCAG TCCCTGGAGA CCTACATCAT GGCCGCCGGC AACCCCAGGA AGGGGTTCTG GGACGGGATG GAGGCCTACA CCGTCAGCCT GCTCAAGGCG TGGTGGGGCG AGCACGCGAC GGCCGAGAAC GACTACTGCT TCGACCACCT GCCCCGGCTC ACCGGCTCCC ACAGCCACTA CGACACCGTG ATGGGGCAGA TCGCCGGAAA GTGCAAGGGC TACTTCCTCA TGGGCGAGAA CCCCGCCGTG GGATCGGCCA ACAGCAGGGC CCAGCGCATG GGCATGGCCG AACTGGACTG GCTGGTCGTG CGCGACTTCT CGCTGATCGA GAGCGCCACC TGGTGGAAGG ACGGGCCGGA GATCGAGTCC GGCGAGATGC GCACCGAGGA CATCGGCACC GAGGTGTTCT TCTTCCCGGC CGCCGCGCAC ACCGAGAAGT CGGGCACCTT CACCAACACC AACCGGCTGC TCCAGTGGCA CGACCGGGCC GTGAGCCCGA GCGGTGACCA GCGCAGCGAC CTGTGGTTCA TGTACCACCT GGGCCGCGAG ATCCGCGGTA TCCTCGCCGG CTCCGAGGAC CCCAAGGACC GCCCCGTCCT CGACCTCACC TGGGACTACC CCCTGGAGGA GGACGGGGAG CCCGACGCCG CCGCGGTCCT GCGCGAGATC AACGGACACG ACGCCGAGGG CCGCCCGCTC ACCGTCTACA CCGAGCTGCG CGACGACGGG TCGACCTCGT GCGGCTGCTG GATCTACTGC GGCGTCTTCA AGGACGGGGT CAACCAGGCG GCGCGCAGGA AGCCCCACAC CGAACAGGAC TGGATCGCCG GGGAGTGGGC CTGGGCGTGG CCGGACAACC GGCGCGTCCT GTACAACCGC GCCTCCGCAG ACGAGAACGG GGAGCCCTGG AGCCCCCGCA AGAGCCTCGT GTGGTGGGAC GCGGAGCAGG GGCGCTGGGT CGGGCACGAC ACCCCCGACT TCGAGAACCG CAAGGCGCCC GACTACGAGC CCGAGGAGGA CGCCGAGGGC GTCGCGGCGC TGACCGGTCG GGACGCCTTC ATCATGCAGG CCGACGGCAA GGGGTGGCTG TACGCGCCCG CGGGCCTCAA CGACGGCCCG ATGCCCACGC ACTACGAGCC GCAGGACACC CCGTTCGAGA ACCCGCTGTA CGGGCACCAG CGCAACCCGA CGCGTCTGCT GTACCCGCAC GAGTACAACC GGTACCACCC CGCCCCCGGC ACGCCCGGGT CCGACGTCTT CCCGTACGTG GTGACGACCT ACCGGCTCAC CGAGCACTTC ACCGCGGGCG GGATGAGCAG GTGGACGCCC TACCTGGCCG AACTCCAGCC CGAGTTCTTC TGCGAGGTGG GGCCCGAGCT GGCCGAGGAG CGCGGTCTGG AGCACGGCGG CTGGGCCACC GTCGTCACGG CGCGCAACGC CATCGAGGCC CGGGTCATGG TCACCGACCG GATGGCGCCG CTGCGGGTGC AGGGACGGGT GGTCCACCAG ATCGGGATGC CCTACCACTG GGGGCCCAAC GGGTACTCGA CCGGGGACGC GGTCAACGAG CTGATGCCCA TCGCGCTCGA CCCCAACGTG CACATCCAGG AGGTCAAGGC GATCACGGCC GACATCCGGC CGGGACGCAG GCCCCGGGGC GTCGAGCGCC TCGACCTGGT GCGCGAGTAC CGGGAGCGGG CCGGAATCAC GGAACAGACG GGACTGGAGG TGTGA
|
Protein sequence | MGVRTWIESW PVYRQFTGDD PTGRGAAAKS RRSENLRSRT EEADRVVKSV CPYCAVGCGQ NVYVKDEKVV QIEGDPDSPI SRGRLCPKGS ASLQLTTGSA REERVLYRRP YGTEWEELDL DTAMRMVAHR AVRTREQTWE REHEGQRVNR TLGIASLGGA TLDNEENYLI KKLLTSLGVV QVENQARVCH SSTVAGLGTS FGRGGATMFL QDLQNSDCIV IEGSNFAEAH PVGFQWVMEA KARGAVVIHV DPRFSRTSAL ADMHVPIRAG TDIAFLGAII NHVLTEEKYF LDYVRAFTNA ATIVGEDFQD TEDLDGLFSG FSEEDKSYDA STWSYEGAEV AAASGNRNQL FRERLEENLG TSHSGRPEQQ GSGGAVIREK PREDPTLTDP RCVFQVLKRH YARYTPELVE EVCGIPKETF RRVCDHLTEN SGRDRTSAFC YAVGWTQHTV GSQYIRAASI LQLLLGNIGR PGGGIQALRG HASIQGSSDV PTLFDLLPGY LPMPHAHEEQ SLETYIMAAG NPRKGFWDGM EAYTVSLLKA WWGEHATAEN DYCFDHLPRL TGSHSHYDTV MGQIAGKCKG YFLMGENPAV GSANSRAQRM GMAELDWLVV RDFSLIESAT WWKDGPEIES GEMRTEDIGT EVFFFPAAAH TEKSGTFTNT NRLLQWHDRA VSPSGDQRSD LWFMYHLGRE IRGILAGSED PKDRPVLDLT WDYPLEEDGE PDAAAVLREI NGHDAEGRPL TVYTELRDDG STSCGCWIYC GVFKDGVNQA ARRKPHTEQD WIAGEWAWAW PDNRRVLYNR ASADENGEPW SPRKSLVWWD AEQGRWVGHD TPDFENRKAP DYEPEEDAEG VAALTGRDAF IMQADGKGWL YAPAGLNDGP MPTHYEPQDT PFENPLYGHQ RNPTRLLYPH EYNRYHPAPG TPGSDVFPYV VTTYRLTEHF TAGGMSRWTP YLAELQPEFF CEVGPELAEE RGLEHGGWAT VVTARNAIEA RVMVTDRMAP LRVQGRVVHQ IGMPYHWGPN GYSTGDAVNE LMPIALDPNV HIQEVKAITA DIRPGRRPRG VERLDLVREY RERAGITEQT GLEV
|
| |