Gene Ndas_3681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3681 
Symbol 
ID9247550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4418017 
End bp4421676 
Gene Length3660 bp 
Protein Length1219 aa 
Translation table11 
GC content69% 
IMG OID 
Product2-oxoglutarate dehydrogenase, E1 subunit 
Protein accessionYP_003681585 
Protein GI297562611 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGTCTG AGGCGTCTCA ACCCCTGACA GATTTCGGTC CCAACGAGTG GTTGGTCGAG 
GAGCTTTATC AGAAGTACTT GAACGACCCG AACTCCGTTG ACAAGGCCTG GTGGAACTTC
TTCGCGGACT ACAAGCCCGC GGAGACTGCT GGCGCGAAGG GCGGGAGCGG CGCGAAGGCG
CAGGGCGAGA AGGCCCCCGC GGCCTCCGCC CCCAAGAAGA AGCCGACCGC CCCCGCGGCC
CCGGCTCCGT CCGCCAAGCC GAAGAGCAAG GACGACAAGG AGGGCAAGGA CGAGGCCCTC
CAGGTCAAGC AGGAGCGTCT GCGCGGCGCA CCCGCGCGGA CCGCGACCAA CATGGAGTCG
AGCCTGGGCC TGCCCACCGC CACCAGCGTG CGCGCGGTGC CGGTCAAGCT GCTCTTCGAC
AACCGCATCG TCATCAACAA CCACCTCCGG CGCGGCCGTG GCGGGAAGGT CTCCTTCACC
CACCTCATCG GTTACGCCAT GGTCAAGGCC CTCAAGGCCC TGCCGGAGAT GAACCACTCC
TACGTGGAGG TGGACGGCAA GCCCGGTGTC GCCAAGCCCG AGCACGTGAA CTTCGGCCTG
GCGATCGACC TCCAGAAGCC CGACGGCTCC CGGCAGCTGG TCGTGCCCTC CATCAAGAAG
TCCGACGAGA TGGACTTCAC GGAGTTCTGG AGCGGCTACG AGGACCTGGT CCGCAAGGCC
CGCAGCAACA AGCTGGGCGT GCCGGACTTC CAGGGCACCA CCATCAGCCT CACCAACCCC
GGCGGCATCG GCACCGTGCA CTCGGTCCCG CGCCTGATGC CGGGCCAGGG CACCATCCTG
GGCGTGGGCG CGATGGAGTA CCCGGCCGAG TTCCAGGGCG CCTCCTCCGA CACCCTGGCC
GAGCTGGGCA TCAGCAAGGT CATGACGCTG ACCTCCACCT ACGACCACCG CATCATCCAG
GGTGCGCAGT CCGGTGAGTT CCTGCGCCGG ATCCACCAGC TGCTGCTGGG CGAGGACGGT
TTCTACGACG AGATCTTCAA CTCGCTGCGC ATCCCCTACG AGCCGGTGCG CTGGGTGCAG
GACATCCACG TCAACAAGGC CACCCAGCTC GACAAGACCA CCCGGGTCCA GGAGCTGATC
CACGCCTACC GCGTGCGCGG CCACCTGATG GCCGACACCA ACCCGCTCGA CCACGAGCAG
CGCAAGCACC CCGACCTGGA CGTGCTCGAA CACGGCCTCA CCCTGTGGGA CCTGGACCGC
GAGTTCCCCA CCGGGGGCTT CGGCGGCAAG CAGGTCATGA AGCTGCGCGA CGTGCTGGGC
GTGCTGCGCG ACACCTACTG CCGCACGGTG GGTATCGAGT ACATGCACAT CCAGAGCCCG
GAGGAGCGGG AGTGGATCCA GGCGCACGTC GAGCGCGAGC ACGAGAAGCT CGACCGCGAC
GAGCAGCTGC ACATCCTGCA CCGGCTCAAC AGCGCCGAGG CCTTCGAGAC CTTCCTCCAG
ACCAAGTACG TGGGCCAGAA GCGCTTCTCC CTGGAGGGCG GCGAGTCGCT GATCCCGCTG
CTGGACGGCG TGATCTCCAA GGCCGCCAAG GCCGAGCTGG ACGAGGCCGT CATCGGCATG
GCCCACCGAG GCCGCCTGAA CGTGCTGGCC AACATCTGCG GCAAGTCCTA CGCGCAGATC
TTCGGTGAGT TCGAGGGCAA CCTCGACCCG CGCAGCGCGC ACGGCTCGGG CGACGTCAAG
TACCACCTGG GCACCGAGGG CACCTTCGAG ACCCACGACG GCCAGAAGAT CCGCATCTCG
CTGGCCGCCA ACCCGTCCCA CCTGGAGACG GTCGACCCGG TCGCCGAGGG CATCGTCCGC
GCCAAGCAGG ACGTGCTCAA CAAGGGCCCG CAGGGCTTCA CCGTCCTGCC GATCCTCATC
CACGGCGACG CCGCGTTCGC CGGGCAGGGC GTGGTCGCCG AGACGCTGAA CCTGTCGCAG
CTGCGCGGTT ACCGGACCGG CGGCACGGTG CACGTCATCG TGAACAACCA GGTGGGCTTC
ACCACCTCGC CGTCCGACAG CCGCTCCAGC GTCTACGCCA CGGACGTGGC GCGGATGGTG
CAGGCGCCGA TCTTCCACGT CAACGGGGAC GACCCCGAGG CCGTGGTCCG GGTCGCGCAC
CTGGCCTTCG CCTACCGCCA GGCGTTCAAC AAGGACGTCG TCATCGACCT GGTCTGCTAC
CGGCGCCGCG GCCACAACGA GGGCGACAAC CCCGCCTTCA CCCAGCCGCT GATGTACGAC
GTCATCGACG CCAAGCGCTC CACCCGCAAG CTGTTCACCG AGGCCCTCAT CGGCCGCGGC
GACATCACGG TGGAGGAGGC GGAGTCGGCG CTGCGCGACT ACCAGTCGGA GCTGGAGCGG
GCCTTCACCG AGACCCGCGA GGTCGAGAAG AAGCCGATCG AGCCCGGCTC CGTGGTCAAG
CCCGAGGTGT TCACCGAGGG CCGCCTGGAG CACTCCGCCG TCGAGACGGC GATCAGCACC
GAGACGGTCA AGCGGGTCAT CGACACCCAG GTGTCGCTGC CCGAGGGCTT CACGCCGCAC
CCGCGGCTGG CCCCCCAGCT CTCGCGCCGC GCGACGATGG TCGAGACCGA CGCGATCGAC
TGGGCCACCG GTGAGCTGCT GGCCTTCGGG TCGCTGCTGC TGGACGGCCA CCCCGTCCGC
CTCATCGGCC AGGACAGCCG CCGCGGCACC TTCGGCCAGC GCCACGCCAC GCTGATGGAC
CGCGAGACCG GCGAGACCCA CACGCCGCTC AAGCAGTTCG ACGACGGCAT CACCCGGTTC
CACGTGCACG ACTCCCTGCT GAGCGAGTAC GCGGCCCTGG GCTTCGAGTA CGGCTACTCG
CTGACCCGTC CCGACGCCCT GGTGATGTGG GAGGCGCAGT TCGGCGACTT CGTCAACGGC
GCGCAGACCA TCATCGACGA GTACATCAGC GCGGGCGAGC AGAAGTGGGG ACAGCGCTCC
AGCGTGACCC TGCTGCTGCC GCACGGCTAC GAGGGCCAGG GGCCGGACCA CTCCTCCGCC
CGCATCGAGC GGTTCCTGCA GCAGTGCGCG CAGGAGAACA TGACGGTGGC CCACCCCAGC
ACCCCGGCGA GCTACTTCCA CCTGCTGCGC TGGCAGGCCA AGTCGCCGCT GGAGCGCCCG
CTGGTGGTGT TCACGCCCAA GTCGCTGCTG CGCCTGAAGG CCGCCACCTC GGCCGCCGCG
GACTTCACCT CGGGCCACTT CGAGCCGCTG ATCAAGGACG ACTCGATCGC GCCGGACAAG
GTGCGGCGCG TGGTGCTGTG CTCGGGCAAG ATCTACTACG ACCTGGACGC GGCGCGCCGC
AAGAGCGGTG ACAAGCACAC CGCGATCATC CGGGCGGAGC GGCTCTACCC GCTGCCGATC
GAGGAGATCC GCGAGCAGCT CAAGGCCTAC CCCAACGCGG GCGAGGTCCT GTGGGTGCAG
GAGGAGCCGG CGAACATGGG CCCGTGGCCC TTCGTCGCGC TGGTCTTCTC CGAGCAGCTC
GACCGCCCGT TCACCCGGAT CTCGCGTCCG GCCTCCTCCG CGCCCGCGGC CGGTTCGGCC
AAGCGCCACG AGGCCGAGCA GCGGGCACTG GTGGACACGG TCTTCCCGCC CGCGGACTAG
 
Protein sequence
MSSEASQPLT DFGPNEWLVE ELYQKYLNDP NSVDKAWWNF FADYKPAETA GAKGGSGAKA 
QGEKAPAASA PKKKPTAPAA PAPSAKPKSK DDKEGKDEAL QVKQERLRGA PARTATNMES
SLGLPTATSV RAVPVKLLFD NRIVINNHLR RGRGGKVSFT HLIGYAMVKA LKALPEMNHS
YVEVDGKPGV AKPEHVNFGL AIDLQKPDGS RQLVVPSIKK SDEMDFTEFW SGYEDLVRKA
RSNKLGVPDF QGTTISLTNP GGIGTVHSVP RLMPGQGTIL GVGAMEYPAE FQGASSDTLA
ELGISKVMTL TSTYDHRIIQ GAQSGEFLRR IHQLLLGEDG FYDEIFNSLR IPYEPVRWVQ
DIHVNKATQL DKTTRVQELI HAYRVRGHLM ADTNPLDHEQ RKHPDLDVLE HGLTLWDLDR
EFPTGGFGGK QVMKLRDVLG VLRDTYCRTV GIEYMHIQSP EEREWIQAHV EREHEKLDRD
EQLHILHRLN SAEAFETFLQ TKYVGQKRFS LEGGESLIPL LDGVISKAAK AELDEAVIGM
AHRGRLNVLA NICGKSYAQI FGEFEGNLDP RSAHGSGDVK YHLGTEGTFE THDGQKIRIS
LAANPSHLET VDPVAEGIVR AKQDVLNKGP QGFTVLPILI HGDAAFAGQG VVAETLNLSQ
LRGYRTGGTV HVIVNNQVGF TTSPSDSRSS VYATDVARMV QAPIFHVNGD DPEAVVRVAH
LAFAYRQAFN KDVVIDLVCY RRRGHNEGDN PAFTQPLMYD VIDAKRSTRK LFTEALIGRG
DITVEEAESA LRDYQSELER AFTETREVEK KPIEPGSVVK PEVFTEGRLE HSAVETAIST
ETVKRVIDTQ VSLPEGFTPH PRLAPQLSRR ATMVETDAID WATGELLAFG SLLLDGHPVR
LIGQDSRRGT FGQRHATLMD RETGETHTPL KQFDDGITRF HVHDSLLSEY AALGFEYGYS
LTRPDALVMW EAQFGDFVNG AQTIIDEYIS AGEQKWGQRS SVTLLLPHGY EGQGPDHSSA
RIERFLQQCA QENMTVAHPS TPASYFHLLR WQAKSPLERP LVVFTPKSLL RLKAATSAAA
DFTSGHFEPL IKDDSIAPDK VRRVVLCSGK IYYDLDAARR KSGDKHTAII RAERLYPLPI
EEIREQLKAY PNAGEVLWVQ EEPANMGPWP FVALVFSEQL DRPFTRISRP ASSAPAAGSA
KRHEAEQRAL VDTVFPPAD