Gene Ndas_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1391 
Symbol 
ID9245241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1707126 
End bp1708169 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content76% 
IMG OID 
ProductPectinesterase 
Protein accessionYP_003679329 
Protein GI297560355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.161325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00169458 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCCGCC GCAGTTCCCC CGCCGCCCCC GGAGCGGAGG GCCGGGTGAT CACCGTGGCC 
GCCGACGGTT CCGGCGACCA CACCGGGGTC CAGGCCGCGA TCGACGCCGT GCCCGCGGGC
GGCGACGAAC GCGTCACCAT CCGCGTCGGG GCGGGCGTCT ACCGCGAGCC GGTGGTGGTT
CCCGCGGACA AACCCGGGAT CACCCTGCTC GGCGCGACCG GCGACCCCCG GGACGTGGTC
CTCACCTACG ACCGTGCGGC GGGCACCCCC GGGCCCGGCG GGGGCGTCCA CGGCACGTCC
GGCAGCGCCA GCGTCCTCAT CTCCGGGGAC GGCACGCACG CCCGCGACCT GACCTTCGCG
AACTCCTGGC TGCGCGAGGA GCACCCCGGC GTCACCGGAA CCCAGGCGGT CGCGCTGCGC
GCCACCGGGG ACCGGCTGGT CTTCGACAAC GTGCGCTTCC TGGGCCACCA GGACACGCTG
TACGCGGACT CGCCGGACGC GGACACCCCC GCGCGGCAGT ACTACCGCGG CTGCTACGTC
GAGGGCGACG TGGACTTCGT CTTCGGCCGG GCCACGGCCG TGTTCGACGG GTGCGTGTTC
CACTCCCTGG GCCGGGGCAG CGACACCGAC AACGGCTACG TGACCGCGCC GAGCACCCGG
CCCGGCCGGG AGTTCGGCTT CCTGGTCACC CGCGGCCGCT TCACCGGTGA CGCCCCCGCC
GGGACCGTCT ACCTGGGCCG CCCGTGGGTG CCCAGCTCGC ATCCGGACGC CGAGCCGCGG
GTGCTGGTGC GCGACTCCTG GATGGGCCGC CACTTCCGCG GGGAGGGCTG GATCGCGATG
GCCTCCGGCC ACGACTGGCG CCGGTTCCGG ATGCTGGAGT ACCGCAACTC CGGTCCCGGC
GCGCTGGTCA CCGCGGACCG ACCGCAGATG GACCCGACCG AGGCCGCCCG GCACACCATT
GAGGCCTACC TGGCCGGGGA CGACGGGTGG AACCCGGCGC GGGAGCGCAC GGGGCGCCCG
GAGTCCGCCA CCCGGGCACG CTGA
 
Protein sequence
MVRRSSPAAP GAEGRVITVA ADGSGDHTGV QAAIDAVPAG GDERVTIRVG AGVYREPVVV 
PADKPGITLL GATGDPRDVV LTYDRAAGTP GPGGGVHGTS GSASVLISGD GTHARDLTFA
NSWLREEHPG VTGTQAVALR ATGDRLVFDN VRFLGHQDTL YADSPDADTP ARQYYRGCYV
EGDVDFVFGR ATAVFDGCVF HSLGRGSDTD NGYVTAPSTR PGREFGFLVT RGRFTGDAPA
GTVYLGRPWV PSSHPDAEPR VLVRDSWMGR HFRGEGWIAM ASGHDWRRFR MLEYRNSGPG
ALVTADRPQM DPTEAARHTI EAYLAGDDGW NPARERTGRP ESATRAR