Gene Ndas_0683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0683 
Symbol 
ID9244525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp837523 
End bp839127 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content70% 
IMG OID 
Productmalate synthase A 
Protein accessionYP_003678634 
Protein GI297559660 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.69761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCCA CGCACGGGGT CGAGATCACC GGCCCCCTCC ACGAACGGTT CGACGAGATC 
CTCACCGAGG ACGCCCTCGC CCTCGTCGCC GAGCTGCACC GCGCCTTCGA GGCGCGCCGC
CAGGAGCTCC TCGAAGCCCG GGCGGCCAGG CAGGAGCAGA TCTCGGCCGG GGCCGACCTC
GACTTCCTCC CCGAGACCAA GCACATCCGT GAGGACGACA GCTGGCGGGT CGCCCCTCCC
GCCCCGGGCA TCACGGACCG CCGCGTGGAG ATCACCGGCC CCACCGACCG CAAGATGACC
ATCAACGCGC TCAACTCCGG CGCCAAGGTG TGGCTGGCCG ACTTCGAGGA CGCCAACACC
CCGCTGTGGG AGAACATGAT CGGGGGCCAG CTCAACCTGC GCGACGCCCT CGACCGCACC
ATCGACTTCA CCTCCCCCCA GGGCAAGACC TACGCCCTGA AGGAGGACGG CGAGCTCGCC
ACCGTCGTCG TGCGCCCGCG CGGCTGGCAC CTGGACGAGA AGCACGTCCT CGTGGACGGC
CAGCGCGTCA GCGGCGGCCT GCTGGACTTC GCCCTCTACT TCTTCCACTG CGCCCAGCGC
CAGATCGACA AGGGCAGGGG ACCCTACTTC TACCTGCCCA AGATGCAGAG CCACCTCGAG
GCGCGCCTGT GGAACGACGT CTTCGTCCTG GCCCAGGAGC GCCTGGGCAT CCCGCGCGGC
ACCATCCGCG CGACCTGCCT CATCGAGACC ATCCCGGCCG CGTTCGAGAT GGAGGAGATC
CTCTACGAGC TGCGCGAGCA CTCCGCGGGC CTCAACGCGG GCCGCTGGGA CTACCTGTTC
AGCATCATCA AGACGCACCG CACCCGCGGC CGCAGGTTCC TGCTGCCCGA GCGCAACGCC
GTCACGATGA CCGCGCCGAT GATGCGCGCC TACACCGAAC TGCTGGTCAA GACCTGCCAC
AAGCGCGGCG CCCACGCCAT CGGCGGCATG GCGGCCTTCA TCCCCTCCCG CAGGGACGAG
GAGGTCAACA GGACCGCCTT CGCCAAGGTC CGCGACGACA AGTCCCGCGA GTCCGGCGAC
GGCTTCGACG GCTCCTGGGT CGCCCACCCG GGTCTGGTCC CGGTGGCCAT GGAGGTCTTC
GACGGCGTCC TGGGCGAGCG CCCCCACCAG ATCGACAAGC AGCGCCCCGA GGTCGAGGTC
TCGGCCGAGG ACCTGCTGGC CGTGGACAGG ACCCCGGGCG GCGTCACCCT CGCGGGCCTG
CGCGGCAACG TCAACGTCGC CCTGCAGTAC CTGGCGACGT GGATGGGCGG CAACGGCGCG
GTGGCCATCC ACAACCTCAT GGAGGACGCC GCGACCGCCG AGATCTCGCG CTCCCAGGTC
TGGCAGTGGC TGCACAACGA CATCACGCTC GACAACGGCC CCAAGGTCAC CGCCGACCTG
GTCCGGGGGA TCATCGACGA GGAGCTCGCC GCCATCCGCG AACAGCTGGG CGCGGACTTC
GACGAGGACC TGTACCAGCA GGCCGGCGAG CTGTTCACCG AGGTGGCCCT GGCCGACGAG
TACGTCGACT TCCTGACGCT GCCCGCCTAC GAGCGCATGC CGTAG
 
Protein sequence
MGATHGVEIT GPLHERFDEI LTEDALALVA ELHRAFEARR QELLEARAAR QEQISAGADL 
DFLPETKHIR EDDSWRVAPP APGITDRRVE ITGPTDRKMT INALNSGAKV WLADFEDANT
PLWENMIGGQ LNLRDALDRT IDFTSPQGKT YALKEDGELA TVVVRPRGWH LDEKHVLVDG
QRVSGGLLDF ALYFFHCAQR QIDKGRGPYF YLPKMQSHLE ARLWNDVFVL AQERLGIPRG
TIRATCLIET IPAAFEMEEI LYELREHSAG LNAGRWDYLF SIIKTHRTRG RRFLLPERNA
VTMTAPMMRA YTELLVKTCH KRGAHAIGGM AAFIPSRRDE EVNRTAFAKV RDDKSRESGD
GFDGSWVAHP GLVPVAMEVF DGVLGERPHQ IDKQRPEVEV SAEDLLAVDR TPGGVTLAGL
RGNVNVALQY LATWMGGNGA VAIHNLMEDA ATAEISRSQV WQWLHNDITL DNGPKVTADL
VRGIIDEELA AIREQLGADF DEDLYQQAGE LFTEVALADE YVDFLTLPAY ERMP