Gene Ndas_3574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3574 
Symbol 
ID9247443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4286459 
End bp4287601 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content76% 
IMG OID 
ProductPolyprenyl synthetase 
Protein accessionYP_003681481 
Protein GI297562507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.504985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACCCG GGGAGGACAC CGTCAATCCC ATCCTGAACA CCCACAGGCA CCCGCGCCCG 
CCGGACGCCG AAGACGCCGA CGAGCGCTCC GCCCTGGAGG CGTCCCTCAA GGACTACCTG
GAGCGCAGGC TCCGCGACTC GGAGGACCTC GACGGAGATT TCGGCCGGGA CCTGGCCGGC
ACGACCGTCC GCTTCACCCT GGGCGGCGGC AAACGGATGC GGCCCCTGCT CGCCTGGTGG
GGGTGGCTCG CGGGCGGAGG CGCCCCCTCG GGGGAGACGG CCCGCGCCGC CCGCCAGGCC
TGCGCCGCGG TCGAACTCGT CCAGACCTGC GCGCTCGTCC ACGACGACGT CATGGACGGC
TCGCCGACCC GCCGGGGCCG CCCCTCGGTG CACGCCGCGC ACGCCCTCGA ACACGAGCGG
GACGGCCACG TCGGCGACTC CCGCCGCTAC GGGGAGGCGC TGGCGGTCCT CGTGGGCGAC
CTGGCCCTCG TCTGGGCCGA CGACATGCTC AACGAGGCCC TGCCGGGCGT GCCCGAGCCC
GTCCGGGCGC GGGCCGTGTG GCGGGACCTG CGCACCGAGA TCATGGCCGG ACAGTTCCTC
GACGTGCGCG CCCAGGCGCG CCGGGAGCGC TCGGAGGAGG CCGCCCTGCG TGTGGACCTG
CTCAAGACGG CCTCCTACAG CGTCGAACGC CCCCTGCACC TGGGCGCGGC GATGGCCGGG
GCCGACCCGG CGGCGGTGGG GGCCCTGCGC GGCTACGGCC GGGACGTGGG CATCGCCTTC
CAGCTCAGGG ACGACCTGCT CGGCGTCTAC GGGGACAGCT CCCGGACCGG CAAACCCGTG
GGCGAGGACA TCCGCGAGGG CAAGTGCACG CTGCTGCTCG TGATCGGCAC CCGCCTGGCC
CGCGAGCGCG GGGACGACGC CGCGCTGCGG CTGCTCGACC GGATCGGCCT GCCCGGCGAG
GACGTGGACC CCGCGGAGGC GGCCGACGCG CTGGACCGCC TCGGCGCACG CGACCTGGTC
GGAGCCAGGT GCCGTGAGCT CGCCGAACGG GGCCGGGCGC ACCTGGAGGG GCTCGACGCG
CCAGCCCACG TGCTGGAGGG GCTGGGCGGC CTGGCCTCCC GGATCGCGCG TGACCGTGTG
TGA
 
Protein sequence
MLPGEDTVNP ILNTHRHPRP PDAEDADERS ALEASLKDYL ERRLRDSEDL DGDFGRDLAG 
TTVRFTLGGG KRMRPLLAWW GWLAGGGAPS GETARAARQA CAAVELVQTC ALVHDDVMDG
SPTRRGRPSV HAAHALEHER DGHVGDSRRY GEALAVLVGD LALVWADDML NEALPGVPEP
VRARAVWRDL RTEIMAGQFL DVRAQARRER SEEAALRVDL LKTASYSVER PLHLGAAMAG
ADPAAVGALR GYGRDVGIAF QLRDDLLGVY GDSSRTGKPV GEDIREGKCT LLLVIGTRLA
RERGDDAALR LLDRIGLPGE DVDPAEAADA LDRLGARDLV GARCRELAER GRAHLEGLDA
PAHVLEGLGG LASRIARDRV