Gene Ndas_0726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0726 
Symbol 
ID9244568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp890186 
End bp891886 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content70% 
IMG OID 
Product2-isopropylmalate synthase 
Protein accessionYP_003678677 
Protein GI297559703 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.719959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCCGC AGCAAAAGCC CAGCCCCATG CCCTTCCACC GCTACAAGCC CTTCGCGCCG 
GTGGACCTGC CCGACCGCAC CTGGCCCTCC AAGAGCATCA CCGAGGCCCC GCGCTGGCTG
TCCACGGACC TGCGCGACGG CAACCAGGCG CTGATCGAGC CGATGGACCC CGCCCGCAAG
CGCGAGATGT TCGAGCTGCT GGTGCGGATG GGCTACAAGG AGATCGAGGT AGGTTTCCCG
GCCGCCAGCC AGACCGACTT CGACTTCGTC CGGTCCCTGA TCGAGGGCGA CGCGATCCCC
GACGACGTGC AGATCTCGGT GCTGACCCAG GCCCGTGAGG ACCTCATCGA GCGCACCGTG
CAGAGCCTGG TCGGGGCGAA GCGCGCCACC GTGCACCTGT ACAACGCCAC CGCGCCCACC
TTCCGCCGTG TCGTCTTCCG CGTGGACCGC GAGGCCTGCA AGGACATCGC CGTCCAGGGC
ACCCGGCACG TCATGCGCTT CGCCGAGCAG TACCTGGGCG AGACCGAGTA CTTCGGCTAC
GAGTACTCGC CCGAGATCTT CATCGACACC GAGCTGGACT TCGCCCTGGA GGTCTGCGAG
GCCGTCATGG ACGTCTGGCA GCCCGGCCCG GGCCGCGAGA TCATCCTCAA CCTGCCCGCG
ACCGTCGAGC GCTCCACGCC CAACGTCTAC GCCGACCAGA TCGAGTGGAT GAGCCGCAGC
CTGTCCCGGC GCGAGCACGT GGTCGTCTCG GTGCACCCGC ACAACGACCG CGGCACCGGC
GTGGCCTCGG CCGAGCTGGC CGTCATGGCC GGGGCCGACC GCGTCGAGGG CTGCCTGTTC
GGGCACGGCG AGCGCACCGG CAACGTCTGC CTGGTCACCC TGGGCATGAA CCTGTTCAGC
CAGGGCGTGG ACCCCCGGAT CGACTTCTCC GACATCGACG AGATCCGCCG CACCGTCGAG
CACTGCACCC AGCTGCCGGT CGCCCCGCGC CACCCCTACG GCGGCGACCT GGTCTACACC
GCCTTCTCCG GCTCCCACCA GGACGCCATC AAGAAGGGCT TCGCCGCCCA GCAGGAGGCC
GCCGACGCCG CGGGGACCCC GGTGGAGGAG CACGTCTGGG ACGTGCCCTA CCTGCCCATC
GACCCCAAGG ACGTGGGCCG CAACTACGAG GCCGTCATCC GGGTCAACAG CCAGTCCGGC
AAGGGCGGCG TCTCCTACAT CATGCAGCGC GACCACTCGC TGGACCTGCC GCGCCGCCTC
CAGATCGAGT TCTCCCAGGT CATCCAGAAG TTCACCGACG CCGAGGGCGG CGAGTTCGCC
GCCGGGCGCA TCTGGGAGAT CTTCTCGCAG ACCTACCTGG CCGAGGGCGG ACCGGTCGCG
GTGCTGGCGC ACCGCTCCAC CACCGACAGC GACGGCACCT ACCGGATCGA GGCCGACGCC
CGGGTCAACG GCGAGATCCG CGAGCTGACC GGCACCGGCA ACGGCCCCAT CTCCGCGTTC
TGCGACGCCC TGACCGACGT CGACGTCAAG GTCCGCGTCA TGGACTACGT GGAGCACTCC
ATGGGCGGGG ACGGCGACGC CCGCGCCGCC GCCTACGTGG AGGCCGAGAT CGACGGCCGC
GTGGTGTGGG GCGTGGGCAT CCACAGCAGC ATCACCACGG CCTCGCTCAA GGCGCTGTGC
AGCGCCATCG CGCGCGTCTG A
 
Protein sequence
MVPQQKPSPM PFHRYKPFAP VDLPDRTWPS KSITEAPRWL STDLRDGNQA LIEPMDPARK 
REMFELLVRM GYKEIEVGFP AASQTDFDFV RSLIEGDAIP DDVQISVLTQ AREDLIERTV
QSLVGAKRAT VHLYNATAPT FRRVVFRVDR EACKDIAVQG TRHVMRFAEQ YLGETEYFGY
EYSPEIFIDT ELDFALEVCE AVMDVWQPGP GREIILNLPA TVERSTPNVY ADQIEWMSRS
LSRREHVVVS VHPHNDRGTG VASAELAVMA GADRVEGCLF GHGERTGNVC LVTLGMNLFS
QGVDPRIDFS DIDEIRRTVE HCTQLPVAPR HPYGGDLVYT AFSGSHQDAI KKGFAAQQEA
ADAAGTPVEE HVWDVPYLPI DPKDVGRNYE AVIRVNSQSG KGGVSYIMQR DHSLDLPRRL
QIEFSQVIQK FTDAEGGEFA AGRIWEIFSQ TYLAEGGPVA VLAHRSTTDS DGTYRIEADA
RVNGEIRELT GTGNGPISAF CDALTDVDVK VRVMDYVEHS MGGDGDARAA AYVEAEIDGR
VVWGVGIHSS ITTASLKALC SAIARV