Gene Ndas_1712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1712 
Symbol 
ID9245562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2083404 
End bp2084594 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content76% 
IMG OID 
Productprotein of unknown function DUF664 
Protein accessionYP_003679647 
Protein GI297560673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.833609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACAG ACAAGAACAC CTCGCCCAGC CGGGAGTTCT GGGAACCCCG CTACCGGGGC 
GGGGACCCCT CGACGCCGCC GCCCGGCCCC AACGCGGCCT TCGCCCGCCT CGCCGGGGAA
CTGGCCCTGG TGCCACCGCC GGAACCGGAG CGCGGGGCGG ACGCGCGCCG CGCCCTCGAA
CTCGCCTGCG GCCGGGGCGG GGACGCCCTG TGGCTGGCGG GCCGGGGATG GGACGTCACG
GCCGTCGACG TCGCGGAACA CGCCCTGGCC GTGCTGGCCG AGCGGGCCCG CCGGGCCGGG
GTAGGGGACC GCCTCACCAC GCGGCGCCAC GACCTGGCGC TGTCGGTGCC CGACGCCGGA
CCGTGGGACC TGGTCTACGC GAACTACTTC CACACCCCGG TGGACATCGA CCGTGACGCC
GTGCTGCGCC GGGTCTCGCG GTCGGTGGGC GGGGGCGGGC TGCTGGTCGT GATCGACCAC
GCGTCCAGTG CGCCCTGGTC CTGGGAACAG CGCGACGACT TCCCCGCCCC CGAGGAGCTG
TGGCGGTCGC TGGACCTGGG CGCGGACTGG ACCGGCCTCG TGTGCGAGCG GCGTTCGCGG
CTGGCCCACG GCCCCGACGG CCGCAGCGCG CGGGTGAGCG ACAACGTGGT GGTGGCCCGG
CGCCGTACGG GGGCGACGCC GAAGGGTTCG ACGAGGACGG CCGATGCACC CTCGCGCCGA
CGCGACCAGC CCCCGCCCGG GACGGGGTCC GCGGAGAAGG AGGTCCTCAC GGGGTTCCTG
GCCTACCTGC GCGAGAGCGT CCTCGCCAAG CTGGACGGCG CGCCGGAGCA GCACGTGCGC
ACTCCGGGCG TCGCGTCCGG CACGAACCTG CTCGGACTGG TCAAGCACCT GGCCCACGTC
GAGCGCGCCC TCTTCCTCGG GGAGGAGGTC GGCGACTGGC AGGCCACGTT CCACGCCGAC
ACCGGTGAGA CCACGGCTGG CGTTCTGGAG GGATACCGCG CGGCCGTGGC CGCCGCCGAC
CGTGCCATCG CCGACTGCGA CGACCTCGGC GGGCCCGCGC ACGCGGGGCG CTGGAACGGA
CCGGCCCCGT CGATGCGGTG GGCGCTCGTG CACATGATCG AGGAGACCGG CCGCCACGCC
GGGCACCTGG ACATCCTCCG CGAACTGGTG GACGGCCGGA CCGGTCGCTG A
 
Protein sequence
METDKNTSPS REFWEPRYRG GDPSTPPPGP NAAFARLAGE LALVPPPEPE RGADARRALE 
LACGRGGDAL WLAGRGWDVT AVDVAEHALA VLAERARRAG VGDRLTTRRH DLALSVPDAG
PWDLVYANYF HTPVDIDRDA VLRRVSRSVG GGGLLVVIDH ASSAPWSWEQ RDDFPAPEEL
WRSLDLGADW TGLVCERRSR LAHGPDGRSA RVSDNVVVAR RRTGATPKGS TRTADAPSRR
RDQPPPGTGS AEKEVLTGFL AYLRESVLAK LDGAPEQHVR TPGVASGTNL LGLVKHLAHV
ERALFLGEEV GDWQATFHAD TGETTAGVLE GYRAAVAAAD RAIADCDDLG GPAHAGRWNG
PAPSMRWALV HMIEETGRHA GHLDILRELV DGRTGR