Gene Ndas_5331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5331 
Symbol 
ID9249231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp496771 
End bp498741 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content70% 
IMG OID 
Productacetate/CoA ligase 
Protein accessionYP_003683217 
Protein GI297564244 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.151979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGACA GCACTCCCGC GAGCCAGGAG ACACTGTCCA ACCTTCTCCA CGAGACGCGC 
AGCTTCCCTC CCCCGGAGGC GCTCGCCGCC GACGCCAACG TCAAGGCCGA CGCCTACGCC
AGGGCCGACG AGGACCGTCT CGGTTTCTGG GAGGAGCAGG CCCGCAGGCT CCAGTGGGAA
CAGCCCTGGG ACACCGCCCT GGAGTGGAAC CCGCCCTTCG CCAAGTGGTT CGTCGGCGGG
AGGATCAACG CCTCCGTCAA CTGCGTGGAC CGGCACGTGG CCAACGGCCT CGGCGACCGG
GTCGCCTACC ACTGGGAGGG CGAGCCCGGC GACACCCGCA CCATCACCTA CGCCGAGCTC
AAGGACCTGG TCTCCCAGGC GGCCAACGCC ATGACCGGAC TCGGCGTGCG CAAGGGCGAC
CGCGTCGCCA TCTACATGCC GATGATCCCC GAGACCGTCG TGGCCATGCT CGCCTGCGCC
CGTCTCGGCG CCGTCCACAT GGTCGTGTTC GGCGGCTTCT CCGTGGACGC CCTGGGCCAG
CGCCTCGACG ACAGCCAGGC CAAGCTCGTG GTCACCGCCG ACGGAGGCTA CCGCCGCGGC
AAGGCCAGCG CCCTCAAGCC CGCCGTGGAC GCCGCCGTCG CCGACCGCCC CGCGGTCGAG
AACGTCCTCG TCGTCCGCCG CACCGGCCAG GACGTCGAGT GGACCGACCG CGACGTGTGG
TGGCACGACG TCGTGGAGTC CCAGAGCACC GAGCACACGC CCGAGGCGCA CGACGCCGAG
GACCCGCTGT ACATCATGTA CACCAGCGGC ACCACGGCCA AGCCCAAGGG CATCCTGCAC
ACCACCGGCG GCTACCTCAC CCAGGTCGCC TACACCCACT GGGCGGTCTT CGACCTCAAG
CCCGAGACCG ACGTCTACTG GTGCGCCGCC GACATCGGCT GGGTCACCGG CCACTCCTAC
ATCGTCTACG GCCCGCTCGC CAACGCGGCC ACCACCGTCC TGTACGAGGG CACCCCGGAC
ACCCCGCACA AGGGCCGCTT CTGGGAGATC ATCGAGAAGT ACAAGGTCAC CATCGCCTAC
ATGGCCCCCA CGGCGATCCG CACCTTCATG AAGTGGGGCG ACGACATCCC CGCCAAGTTC
GACCTGTCCA GCCTGCGCGT CATCGGCTCG GTGGGCGAGC CCATCAACCC CGAGGCCTAC
GTCTGGTACC GCAAGAACAT CGGCGGCGAC CGCACTCCGG TCGTGGACAC CTGGTGGCAG
ACCGAGACCG GCGCCGTCAT GGTCAGCCCC CTGCCGGGCG TGACCTCCGG CAAGCCCGGC
GCGGCCATGC GGGCCATCCC CGGCATCGTC GCCGACGTCG TGGACGAGCA GGGCGAGTCC
GTCCCCGACG GCGAGGGCGG CTTCATCGTC ATCCGCGAGC CGTGGCCGTC CATGCTCCGC
GGGATCTGGG GCGACCCCCA GCGCTACAAG GACACCTACT GGTCGCGCTT CGAGGGCCTG
TACTTCCCCG GTGACGGCGC CAAGAAGGAC GCCGACGGCG ACCTGTGGCT GCTGGGCCGC
GTGGACGACG TCATGCTCGT CTCCGGCCAC AACATCTCCA CCACCGAGGT CGAGTCCGCG
CTGGTCTCCC ACCCGCGCGT GGCGGAGGCC GCGGTCGTCG GCGCCACCGA CAAGGTCACC
GGCCAGGCCA TCGTCGGCTT CGTGATCCTG CGCGGCGGCG AGGAGGAGGT CCCCGAGGAC
CTGGTCCAGG AACTGCGCAA CCACGTCGGC ACCTCGCTCG GCCCGATCGC CAAGCCCGCC
CGCCTCCTGG CGGTGCCCGA GCTGCCCAAG ACCCGCTCGG GCAAGATCAT GCGGCGCCTG
CTGCGCGACA TCGCCGAGAA CCGGGCCGTC GGCGACACCT CCACGCTCAC CGACTCCTCG
ATCATGGAGG TCATCGCCAA GCAGCTGCCC TCCGCCAAGC GGGAGGACTG A
 
Protein sequence
MADSTPASQE TLSNLLHETR SFPPPEALAA DANVKADAYA RADEDRLGFW EEQARRLQWE 
QPWDTALEWN PPFAKWFVGG RINASVNCVD RHVANGLGDR VAYHWEGEPG DTRTITYAEL
KDLVSQAANA MTGLGVRKGD RVAIYMPMIP ETVVAMLACA RLGAVHMVVF GGFSVDALGQ
RLDDSQAKLV VTADGGYRRG KASALKPAVD AAVADRPAVE NVLVVRRTGQ DVEWTDRDVW
WHDVVESQST EHTPEAHDAE DPLYIMYTSG TTAKPKGILH TTGGYLTQVA YTHWAVFDLK
PETDVYWCAA DIGWVTGHSY IVYGPLANAA TTVLYEGTPD TPHKGRFWEI IEKYKVTIAY
MAPTAIRTFM KWGDDIPAKF DLSSLRVIGS VGEPINPEAY VWYRKNIGGD RTPVVDTWWQ
TETGAVMVSP LPGVTSGKPG AAMRAIPGIV ADVVDEQGES VPDGEGGFIV IREPWPSMLR
GIWGDPQRYK DTYWSRFEGL YFPGDGAKKD ADGDLWLLGR VDDVMLVSGH NISTTEVESA
LVSHPRVAEA AVVGATDKVT GQAIVGFVIL RGGEEEVPED LVQELRNHVG TSLGPIAKPA
RLLAVPELPK TRSGKIMRRL LRDIAENRAV GDTSTLTDSS IMEVIAKQLP SAKRED