Gene Ndas_3253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3253 
Symbol 
ID9247110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3887830 
End bp3889887 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content72% 
IMG OID 
Productacetoacetyl-CoA synthase 
Protein accessionYP_003681165 
Protein GI297562191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGA ACAGCGTGCC CCGGGTGCTC TGGAAGCCCG ACGAGGAGAG GGCCGCGTCC 
TCCAACATCG TCGCGTTCGC CCGATGGGCC GAGCGCGAGA AGGGGGTCCG GGCCTTCGGC
GACGCCTCCG CCGCGTCCTT CGACTACGAC TCCCTGTGGC GCTGGTCGGT CACCGACGTC
GACGGCTTCT GGGAGTCGGT CTGGGAGTAC TACGGCGTGC GCTCCGAGAC CCCCTACGAG
CGGGTCCTGG GCGAGCGCAC CATGCCCGGA GCCGAGTGGT TCCCCGGGGC CACCCTCAAC
TACGCCCGGC ACGTCTTCGA GGGCCGCGAC GACGACCGCG TCGCCATCCG CCACGCCACC
GAACTGCGCG GACTCGGCGA GTGGACCTGG GGCGACCTGC GCCGCCGCAC CGCCGCCATC
GCCGCCGGCC TGCGCGGACT GGGCGTGGGT CCCGGCGACC GCGTCGTGGC CTACCTGCCC
AACCTGCCCG AGACCGTCGC CGCCTTCTAC GCCGCGGCCT CCCTGGGCGC GGTGTGGTCC
TCCTGCTCCC CGGACTTCGG CGTGCGCAGC GTCATCGACC GCTTCGCCCA GATCGAACCC
AAGGTCCTGC TCGCCGTGGA CGGCTACCGC TACGGGGGCA AGGACTTCGA CCGCCGCCCC
GTCGTGGAGG AGCTGCGCGC GGCCCTGCCC ACCGTCGAGC ACACCGTCCT CCTGGACTAC
CTCCACCCCG GAGAGTTCCT CGACGGCACC CTGCCCTGGG GTCGTCTGGA GGAGTCCGGG
GCAGGGGCCG AGCTGGAGTT CGCGTCCGTC CCCTTCGACC ACCCCCTGTG GGTGCTCTAC
TCCTCCGGAA CCACCGGCCT GCCCAAGGCC ATCGTCCACG GGCACGGCGG CATCCTGCTC
GAACAGCTCA AGAACCTCCA CCTGCACCTG GACGCCCAGG AGCACGACCG GGTGTTCTGG
TTCACCACCA CCGGCTGGAT GATGTGGAAC TTCCTGGTCA GCGTGCTGCT GACGAAGGCC
TCCATCGTCC TGTACGACGG CAGCCCCGCC CACCCGTCGC CCTCCACCAC CCTCAACGTA
CGGCCTTCGG CCGCAGCAAC TCCCCAGCCC GACCTGGGCG CCCTGTGGGA CCTGGCCGCC
GAGGCCGGGG TCACCGTCTT CGGCACCAGC GCCGGGTTCC TCTCCTCCTG CATGAAGGAG
GGCGTCCACC CCCGCCGGGG CCGCGACCTG TCCGCGCTCA AGGCCATCGG CTCCACCGGC
TCGCCGCTCA GCCCCGAGGC CTTCTCCTGG GTCTACGAGG AGTTCGGCGA GGACCTGTGG
CTGTTCTCCA CCTCCGGCGG CACCGACGTG TGCAGCTGCC TGGTCGGCGG CACCCCGACC
CTGCCCGTCC ACGAGGGCGA GATCCAGTCC CGGGCCCTGG GCATGGCCGT GGCCTCCTGG
GACCCCGACG GCAAGGAGCT GGTCAACGAG GTCGGCGAAC TCGTCGTCAC CGAGCCCGCC
CCGTCCATGC CGCTGTTCCT GTGGGGCGAC GACAGCGGCG AGCGGCTGCG CGAGAGCTAC
TTCTCCGTCT ACCCGGGCGT CTGGCGGCAC GGCGACTGGA TCGCGATCAC CGACCGCGGC
ACCGCGGTCA TCTACGGCCG CTCCGACTCC ACCATCAACC GCGGCGGCGT GCGCATGGGC
ACCAGCGAGA TCTACCGCGC GGTCCTGGCT CTGGACGAGG TCGTCGACGC GCTGGTCGTG
GACGTGCCCC AGGCCGACGG CTCCTCCCGG ATCGAGCTGT TCACCGTGCT GCGGGAGGGC
GCCGACCTGG AGGGCGACCT GCCCCGGGAG ATCGCCCGCC GCATCCGCAC CGACTGCTCG
CCCCGCCACG TGCCCGACCG CGTCCGCGTC ATCGGCGCGG TGCCCCGGAC CCTGTCGGGC
AAGGTCCTGG AGGTGCCGGT CAAGCGCATC CTCATGGGGG AGCGGCCCGA CCGGGTCGCC
AGCCGCGACT CACTGGCCAA CCCCGAGGCG CTCGACTTCT TCAGCACCCT GGCCGGGGAG
CGGGAGAACA CGGGCTGA
 
Protein sequence
MTENSVPRVL WKPDEERAAS SNIVAFARWA EREKGVRAFG DASAASFDYD SLWRWSVTDV 
DGFWESVWEY YGVRSETPYE RVLGERTMPG AEWFPGATLN YARHVFEGRD DDRVAIRHAT
ELRGLGEWTW GDLRRRTAAI AAGLRGLGVG PGDRVVAYLP NLPETVAAFY AAASLGAVWS
SCSPDFGVRS VIDRFAQIEP KVLLAVDGYR YGGKDFDRRP VVEELRAALP TVEHTVLLDY
LHPGEFLDGT LPWGRLEESG AGAELEFASV PFDHPLWVLY SSGTTGLPKA IVHGHGGILL
EQLKNLHLHL DAQEHDRVFW FTTTGWMMWN FLVSVLLTKA SIVLYDGSPA HPSPSTTLNV
RPSAAATPQP DLGALWDLAA EAGVTVFGTS AGFLSSCMKE GVHPRRGRDL SALKAIGSTG
SPLSPEAFSW VYEEFGEDLW LFSTSGGTDV CSCLVGGTPT LPVHEGEIQS RALGMAVASW
DPDGKELVNE VGELVVTEPA PSMPLFLWGD DSGERLRESY FSVYPGVWRH GDWIAITDRG
TAVIYGRSDS TINRGGVRMG TSEIYRAVLA LDEVVDALVV DVPQADGSSR IELFTVLREG
ADLEGDLPRE IARRIRTDCS PRHVPDRVRV IGAVPRTLSG KVLEVPVKRI LMGERPDRVA
SRDSLANPEA LDFFSTLAGE RENTG