Gene Ndas_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0021 
Symbol 
ID9243848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp26261 
End bp29515 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content70% 
IMG OID 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_003677979 
Protein GI297559005 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.12847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTAC GCACGTGGAT CGAGTCGTGG CCGGTGTACC GGCAGTTCAC GGGGGACGAC 
CCGACGGGGC GCGGGGCCGC CGCGAAGTCC CGCCGGTCGG AGAACCTCAG GTCCCGCACC
GAGGAGGCCG ACCGGGTCGT GAAGTCGGTC TGCCCCTACT GCGCGGTGGG GTGCGGGCAG
AACGTCTACG TCAAGGACGA GAAGGTCGTC CAGATCGAGG GCGACCCGGA CTCGCCGATC
AGCCGCGGGC GGTTGTGCCC CAAGGGGTCG GCCAGCCTCC AGCTCACCAC CGGCTCCGCG
CGCGAGGAGA GGGTCCTGTA CCGGCGCCCG TACGGCACCG AATGGGAGGA ACTCGACCTC
GACACCGCCA TGCGGATGGT CGCCCACCGC GCGGTCCGCA CGCGCGAGCA GACCTGGGAG
CGGGAGCACG AGGGGCAACG GGTGAACCGC ACGCTGGGGA TCGCCAGCCT CGGCGGCGCC
ACGCTGGACA ACGAGGAGAA CTACCTCATC AAGAAGCTCC TGACCTCGCT CGGCGTCGTC
CAGGTGGAGA ACCAGGCCCG CGTGTGCCAC AGCTCCACGG TCGCCGGGCT CGGGACGTCC
TTCGGGCGCG GCGGGGCGAC CATGTTCCTC CAGGACCTGC AGAACTCCGA CTGCATCGTC
ATCGAGGGCT CCAACTTCGC CGAGGCCCAC CCGGTCGGCT TCCAGTGGGT CATGGAGGCC
AAGGCCCGCG GCGCGGTGGT CATCCACGTC GACCCCCGGT TCAGCCGCAC CAGCGCGCTC
GCCGACATGC ACGTGCCCAT CCGCGCCGGC ACCGACATCG CCTTCCTCGG CGCGATCATC
AACCACGTCC TCACCGAGGA GAAGTACTTC CTCGACTACG TGCGCGCCTT CACCAACGCC
GCCACGATCG TCGGCGAGGA CTTCCAGGAC ACCGAGGACC TGGACGGCCT GTTCTCCGGC
TTCTCGGAGG AGGACAAGAG CTACGACGCC AGCACCTGGA GCTACGAGGG CGCCGAGGTC
GCCGCCGCCT CCGGCAACCG CAACCAGCTG TTCAGGGAGC GGCTGGAGGA GAACCTCGGC
ACCAGCCACT CCGGGCGGCC CGAACAGCAG GGGTCGGGCG GCGCGGTCAT CCGGGAGAAG
CCCAGGGAGG ACCCGACGCT CACCGACCCG CGCTGCGTGT TCCAGGTCCT CAAACGCCAC
TACGCGCGCT ACACGCCGGA GCTGGTCGAG GAGGTCTGCG GTATTCCGAA GGAGACCTTC
CGGAGGGTGT GCGACCACCT CACCGAGAAC TCCGGCCGGG ACCGCACCAG CGCCTTCTGC
TACGCCGTGG GCTGGACGCA GCACACGGTC GGCTCCCAGT ACATCCGGGC CGCCTCCATC
CTCCAGCTGC TGCTGGGCAA CATCGGCCGC CCGGGGGGAG GCATCCAGGC GCTGCGCGGA
CACGCCAGCA TCCAGGGCTC CAGCGACGTG CCCACCCTGT TCGACCTGCT CCCGGGCTAC
CTGCCGATGC CCCACGCCCA CGAGGAGCAG TCCCTGGAGA CCTACATCAT GGCCGCCGGC
AACCCCAGGA AGGGGTTCTG GGACGGGATG GAGGCCTACA CCGTCAGCCT GCTCAAGGCG
TGGTGGGGCG AGCACGCGAC GGCCGAGAAC GACTACTGCT TCGACCACCT GCCCCGGCTC
ACCGGCTCCC ACAGCCACTA CGACACCGTG ATGGGGCAGA TCGCCGGAAA GTGCAAGGGC
TACTTCCTCA TGGGCGAGAA CCCCGCCGTG GGATCGGCCA ACAGCAGGGC CCAGCGCATG
GGCATGGCCG AACTGGACTG GCTGGTCGTG CGCGACTTCT CGCTGATCGA GAGCGCCACC
TGGTGGAAGG ACGGGCCGGA GATCGAGTCC GGCGAGATGC GCACCGAGGA CATCGGCACC
GAGGTGTTCT TCTTCCCGGC CGCCGCGCAC ACCGAGAAGT CGGGCACCTT CACCAACACC
AACCGGCTGC TCCAGTGGCA CGACCGGGCC GTGAGCCCGA GCGGTGACCA GCGCAGCGAC
CTGTGGTTCA TGTACCACCT GGGCCGCGAG ATCCGCGGTA TCCTCGCCGG CTCCGAGGAC
CCCAAGGACC GCCCCGTCCT CGACCTCACC TGGGACTACC CCCTGGAGGA GGACGGGGAG
CCCGACGCCG CCGCGGTCCT GCGCGAGATC AACGGACACG ACGCCGAGGG CCGCCCGCTC
ACCGTCTACA CCGAGCTGCG CGACGACGGG TCGACCTCGT GCGGCTGCTG GATCTACTGC
GGCGTCTTCA AGGACGGGGT CAACCAGGCG GCGCGCAGGA AGCCCCACAC CGAACAGGAC
TGGATCGCCG GGGAGTGGGC CTGGGCGTGG CCGGACAACC GGCGCGTCCT GTACAACCGC
GCCTCCGCAG ACGAGAACGG GGAGCCCTGG AGCCCCCGCA AGAGCCTCGT GTGGTGGGAC
GCGGAGCAGG GGCGCTGGGT CGGGCACGAC ACCCCCGACT TCGAGAACCG CAAGGCGCCC
GACTACGAGC CCGAGGAGGA CGCCGAGGGC GTCGCGGCGC TGACCGGTCG GGACGCCTTC
ATCATGCAGG CCGACGGCAA GGGGTGGCTG TACGCGCCCG CGGGCCTCAA CGACGGCCCG
ATGCCCACGC ACTACGAGCC GCAGGACACC CCGTTCGAGA ACCCGCTGTA CGGGCACCAG
CGCAACCCGA CGCGTCTGCT GTACCCGCAC GAGTACAACC GGTACCACCC CGCCCCCGGC
ACGCCCGGGT CCGACGTCTT CCCGTACGTG GTGACGACCT ACCGGCTCAC CGAGCACTTC
ACCGCGGGCG GGATGAGCAG GTGGACGCCC TACCTGGCCG AACTCCAGCC CGAGTTCTTC
TGCGAGGTGG GGCCCGAGCT GGCCGAGGAG CGCGGTCTGG AGCACGGCGG CTGGGCCACC
GTCGTCACGG CGCGCAACGC CATCGAGGCC CGGGTCATGG TCACCGACCG GATGGCGCCG
CTGCGGGTGC AGGGACGGGT GGTCCACCAG ATCGGGATGC CCTACCACTG GGGGCCCAAC
GGGTACTCGA CCGGGGACGC GGTCAACGAG CTGATGCCCA TCGCGCTCGA CCCCAACGTG
CACATCCAGG AGGTCAAGGC GATCACGGCC GACATCCGGC CGGGACGCAG GCCCCGGGGC
GTCGAGCGCC TCGACCTGGT GCGCGAGTAC CGGGAGCGGG CCGGAATCAC GGAACAGACG
GGACTGGAGG TGTGA
 
Protein sequence
MGVRTWIESW PVYRQFTGDD PTGRGAAAKS RRSENLRSRT EEADRVVKSV CPYCAVGCGQ 
NVYVKDEKVV QIEGDPDSPI SRGRLCPKGS ASLQLTTGSA REERVLYRRP YGTEWEELDL
DTAMRMVAHR AVRTREQTWE REHEGQRVNR TLGIASLGGA TLDNEENYLI KKLLTSLGVV
QVENQARVCH SSTVAGLGTS FGRGGATMFL QDLQNSDCIV IEGSNFAEAH PVGFQWVMEA
KARGAVVIHV DPRFSRTSAL ADMHVPIRAG TDIAFLGAII NHVLTEEKYF LDYVRAFTNA
ATIVGEDFQD TEDLDGLFSG FSEEDKSYDA STWSYEGAEV AAASGNRNQL FRERLEENLG
TSHSGRPEQQ GSGGAVIREK PREDPTLTDP RCVFQVLKRH YARYTPELVE EVCGIPKETF
RRVCDHLTEN SGRDRTSAFC YAVGWTQHTV GSQYIRAASI LQLLLGNIGR PGGGIQALRG
HASIQGSSDV PTLFDLLPGY LPMPHAHEEQ SLETYIMAAG NPRKGFWDGM EAYTVSLLKA
WWGEHATAEN DYCFDHLPRL TGSHSHYDTV MGQIAGKCKG YFLMGENPAV GSANSRAQRM
GMAELDWLVV RDFSLIESAT WWKDGPEIES GEMRTEDIGT EVFFFPAAAH TEKSGTFTNT
NRLLQWHDRA VSPSGDQRSD LWFMYHLGRE IRGILAGSED PKDRPVLDLT WDYPLEEDGE
PDAAAVLREI NGHDAEGRPL TVYTELRDDG STSCGCWIYC GVFKDGVNQA ARRKPHTEQD
WIAGEWAWAW PDNRRVLYNR ASADENGEPW SPRKSLVWWD AEQGRWVGHD TPDFENRKAP
DYEPEEDAEG VAALTGRDAF IMQADGKGWL YAPAGLNDGP MPTHYEPQDT PFENPLYGHQ
RNPTRLLYPH EYNRYHPAPG TPGSDVFPYV VTTYRLTEHF TAGGMSRWTP YLAELQPEFF
CEVGPELAEE RGLEHGGWAT VVTARNAIEA RVMVTDRMAP LRVQGRVVHQ IGMPYHWGPN
GYSTGDAVNE LMPIALDPNV HIQEVKAITA DIRPGRRPRG VERLDLVREY RERAGITEQT
GLEV