Gene Ndas_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0171 
Symbol 
ID9244002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp217429 
End bp219189 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content71% 
IMG OID 
Productacetolactate synthase, large subunit, biosynthetic type 
Protein accessionYP_003678127 
Protein GI297559153 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.458212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.847135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC AGATGACCGG CGCCCAATCG CTGATCAGGT CGCTGGAACA GGTCGGCGTT 
GACACTGTCT TCGGGATTCC CGGCGGCGCC ATCCTCCCGG CCTACGACCC GCTCTACGAC
TCGGCCAAGG TGCGCCACAT CCTCATGCGC CACGAGCAGG GGGCCGGTCA CGCCGCCGAG
GGCTACGCCT ACGCCACCGG CCGTCCGGGG GTGTGCATGG CCACCAGCGG GCCCGGCGCC
ACCAACCTGG TCACCCCGCT CGCCGACGCG CACATGGACT CGGTCCCGAT GGTCGCCATC
ACCGGCCAGG TCGCCTCCCC GATGATCGGC ACCGACGCCT TCCAGGAAGC CGACATCTGC
GGCATCACCA TGCCGATCAC CAAGCACAAC TTCCTGGTCA AGGACGTGCG GGAGATCCCG
CGGATCGTCG CCGAGGCCTT CCACATCGCC TCCAGCGGAC GCCCCGGCCC CGTCCTGGTC
GACATCTCCA AGGACGCCCT GCAGAACTCC GCCGAGTTCG TCTGGCCCGA GCGCCTCGAA
CTGCCCGGCT ACCGCCCGGT CACCAAGCCG CACGGCAAGC AGGTCCGCGA GGCCGCCCGC
ATGATCGCCG AGGCCCGGCG CCCGGTGTTC TACGTCGGCG GCGGCGTCCT GCGCGCCGGC
GCCTCCGCCG AGCTGCGCGT CCTGGCCGAG CTCACCGGCG CCCCCGTCGT CTCCACCCTG
ATGGCCCGCG GCGTCTTCCC CGACAGCCAC CCGCAGTCGG TGGGCATGCC CGGCATGCAC
GGCGACGTCG CCGCCGTCGG CGCCCTCCAG AAGGCCGACC TCATCGTCGC CCTGGGCGCG
CGCTTCGACG ACCGCGTCAC CGGGCGCCTG GACAGCTTCG CCCCCGACGC CAAGATCGTC
CACGCCGACA TCGACCCGGC CGAGATCTCC AAGAACCGCC ACGCCGACGT GCCCATCGTC
GGCGACTGCC GCGAGGTCCT CGCCGACCTG GTCGTGGCCG TGCGCGCCGA CCAGGAGAAG
GGCCGCCAGG GCGATTACGA GGCCTGGTGG CAGCAGCTCA ACCGCCTGCG CAACACCTAC
CCCAAGGGTT ACGAGGCGCC CGAGGACGGC AGCCTGTCCC CGCAGGCCGT CATCGAGCGC
CTGGGCAAGG TCGTCGGCCC GGAGGCCACC TACGTCGCGG GCGTGGGCCA GCACCAGATG
TGGGCCGCCC AGTTCATCGA CTACGAGCGC CCCGGCTCCT TCGTCAACTC CGGCGGCCTG
GGCACCATGG GCTTCTCCGT CCCCGCCGCG CTCGGCGCCA AGGTCGGCGA CCCCGACCGG
ACCGTGTGGT CGGTCGACGG CGACGGCTGC TTCCAGATGA CCAACCAGGA ACTGGCCACC
TGCGCGATCG AGGGCATCCC GGTCAAGGTC GCCGTGGTCA ACAACGGCAA CCTGGGCATG
GTCCGCCAGT GGCAGACCCT GTTCTACGAG GGCCGCTACT CCAACACCGA CCTCCAGACC
TCCCCGCGCG ACGAGAAGGT CCGCATCCCC GACTTCGTCC GCCTCGCGGA GGCGTACGGT
TGTGTCGGCC TGCGCTGCGA GCGAGCCGAG GACATCGACG CCACCATCGA GAAGGCCATG
GCGATCGACG ACGCCCCCGT CGTCGTGGAC TTCACCGTCA ACCACGACGC CATGGTCTGG
CCGATGGTCG GCCCCGGCGT CAGCAACGAC AACATCCAGT ACGCGCGCGA CATGGCGCCG
AACTGGGAAC ACGAGGACTA A
 
Protein sequence
MTEQMTGAQS LIRSLEQVGV DTVFGIPGGA ILPAYDPLYD SAKVRHILMR HEQGAGHAAE 
GYAYATGRPG VCMATSGPGA TNLVTPLADA HMDSVPMVAI TGQVASPMIG TDAFQEADIC
GITMPITKHN FLVKDVREIP RIVAEAFHIA SSGRPGPVLV DISKDALQNS AEFVWPERLE
LPGYRPVTKP HGKQVREAAR MIAEARRPVF YVGGGVLRAG ASAELRVLAE LTGAPVVSTL
MARGVFPDSH PQSVGMPGMH GDVAAVGALQ KADLIVALGA RFDDRVTGRL DSFAPDAKIV
HADIDPAEIS KNRHADVPIV GDCREVLADL VVAVRADQEK GRQGDYEAWW QQLNRLRNTY
PKGYEAPEDG SLSPQAVIER LGKVVGPEAT YVAGVGQHQM WAAQFIDYER PGSFVNSGGL
GTMGFSVPAA LGAKVGDPDR TVWSVDGDGC FQMTNQELAT CAIEGIPVKV AVVNNGNLGM
VRQWQTLFYE GRYSNTDLQT SPRDEKVRIP DFVRLAEAYG CVGLRCERAE DIDATIEKAM
AIDDAPVVVD FTVNHDAMVW PMVGPGVSND NIQYARDMAP NWEHED