Gene Ndas_5474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5474 
Symbol 
ID9249377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp668005 
End bp669294 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content76% 
IMG OID 
Productesterase/lipase 
Protein accessionYP_003683359 
Protein GI297564386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.777851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGAC GGAACCGCGC CGCCGCGGCC GCCCGCTGGC TGGCGACCGC CCTCGCGGGC 
ACGACCGCGC TTGTCGGCGC GGGCGCCGCC GCGCTCGTCC TGCTGCCGCC GCCCTCCTGG
GACGCGTGGC AGTACGGACT GCTGGTACTG GAGTTCAGCC TGGTGTTCGC CGCCCTGGCC
GTCGTCGGCC TGGTCCCGGC CCTGTTCGCG GCGGGGAGGC CGCGCCTCCG CGAGGGCACC
GGGCGGCGTC CGGACGCGGG AAGCAGGGGA GCGGTGTGGC GTCGGCGCCT CGCCGCGGCG
GCCGCCACGG CCAACGGCGC GGTACTGGTC GCCGCGCTCG TCCCGCCCGG GACGATCTGG
TCCACCGCAC ACGCGGAGGG AGCCCGGCTC GACCTCGGCG AGTACGCCGC CGGACTCGCC
ACCAGCGCCG ACCGCGCACC CGAGACCCGC GTCTACCTGC GCACCGGCGA GTCGGCGGTC
GGGGACGCCT CCGACCCGCA GGACCTGGAA CTGGACGTGT GGGAGCCCGA GCGGCGGCGG
GGGGACGCCC CGCCACCGAT CGTGGTGAAC GTCCACGGAG GCGCGGACGA CCTGCCGCAG
AGCCTCCTGC CCCGCTGGGA CGTGTGGCTG GCCGACAACG GGCACGTGGT GTTCGACGTC
GACTACCGCT ACTTCCCCGA CGGCGACTGG TCGGTGCCGG TCTCCGACGT CAAGTGCGCG
ATCGGCTGGG CCCGCGAGCA CGCCGCGGAG TACGGGGCCG ACCCCGGCCG GATCGCCGTC
ACCGGCCAGT CGGCGGGCGG CCTGCTCGCC CTCCTGGCCG CGTACAGCTC GGACGAGGAG
ATCCCCCCGA GCTGCGACGT GCCCGACACC GGCGTCGACG CGGTCGTGGC CTGGTACGCG
GTGGCCGACG GTACCGCCGA GGCGCCCGAG CTGCCCTGGC GCCAGCGCAA CTCCCCGATG
GGCGGCGACC TCCTGGAGGA GAGCGAGCGC CTGATGGGCG GCTCCGTGGC GGAGCTGCCC
GAGGAGTACG CGATGAGCTC GCCCATCACG TACGTCTCGC CGGAGGTTCC GCCCACGATG
CTGATCACGC CCGGCCACGA CCTGTTCGTG GGCCCGGAGG ACAACCGCCG CCTCGCCGCC
CGGCTCGACG CGGCCGGGGT CCCGCACCGG CACCTGGAGA TCCCCTGGGC GGAGCACATG
TTCGACCTCA ACTGGGGAGG GTTCGCCAGC CAGGTCACCC GGCACGGCCT GGACGGGTTC
CTCGACGAAC ACCTCGCCGC CTCGCCGTGA
 
Protein sequence
MDRRNRAAAA ARWLATALAG TTALVGAGAA ALVLLPPPSW DAWQYGLLVL EFSLVFAALA 
VVGLVPALFA AGRPRLREGT GRRPDAGSRG AVWRRRLAAA AATANGAVLV AALVPPGTIW
STAHAEGARL DLGEYAAGLA TSADRAPETR VYLRTGESAV GDASDPQDLE LDVWEPERRR
GDAPPPIVVN VHGGADDLPQ SLLPRWDVWL ADNGHVVFDV DYRYFPDGDW SVPVSDVKCA
IGWAREHAAE YGADPGRIAV TGQSAGGLLA LLAAYSSDEE IPPSCDVPDT GVDAVVAWYA
VADGTAEAPE LPWRQRNSPM GGDLLEESER LMGGSVAELP EEYAMSSPIT YVSPEVPPTM
LITPGHDLFV GPEDNRRLAA RLDAAGVPHR HLEIPWAEHM FDLNWGGFAS QVTRHGLDGF
LDEHLAASP