Gene Ndas_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0501 
Symbol 
ID9244342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp618506 
End bp619603 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content75% 
IMG OID 
ProductDomain of unknown function DUF2394 
Protein accessionYP_003678454 
Protein GI297559480 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.569038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGAAT CGGCGGACAG CTCGGTGGAC GGCAAACGGC TCCTGCTGGT GGGCACCTAC 
ACCCCCGACT CCGACCCTCC CGGGGAGGGC GAGGGGATCT ACCGCGTCTG GTTCGACCCC
ATGACCGGCG AGATGACCCA CGGCGGCGCC GCCGCCCGTA CCCCGGGCCC CTCCTTCCTC
GCCTTCCGCG AGGACCCGCC CACGGTCTAC GCGGTCAACG AGCGCGAGAA GGGCACCGTC
ACCGCCTTCC GGATCGACGG CGCAGCCGGG CTCACCGAAC TTGGCCAGTC CCCGACCGGC
GGCGGGTCGC CCTGCCACGT GCTCGCGCGC GGCTCCGAAC TGGCGGTGAC CAACTACGCC
AACGGCGTGG CCACGCTGTA CGCCCTGGCC GAGGACGGCT CCCTCGACGG AACGGCGGCG
GAGTTCGCGC ACTCCGGAAG CGGCCCGGTC ACCGACCGCC AGGAGGGGCC GCACGCGCAC
AGCACCGCCG CCCCCGACGA CCATCACCTG CTGGTGGCCG ACCTGGGCAC CGACGAGCTG
CGCGTCCTGC GCGGCGGCGA GGAGGTCGGC GCCGTCTCCC TGCCCCCGGG CACCGGCCCC
CGGCACACGG CCGTCCTCGG CGAGTACCTC TACGTAGCGG GTGAGCTGGA CTCGCGCGTG
CACGTCCTGC GATGGAACCC CGACGAAGGC ACCGCCGAGC ACCTGGGCTC CGTCGAGGCC
ACCGGAGAGG AGGCCGCAGG CGAGAACTTC CCCGCCGAGA TCCTCAGCAA CGGCGACCAC
GTGTACGTGT CCAACCGGGG CGCGGACACG ATCGCCACCT TCGCCGTCCG CGACGGCGGC
GCCCGTCTGG AGCACGTCGC CGACACCCCG GCCGGAGGGC CGTGGCCGCG CAACTTCACC
GTCGTGCGCG GCCACCGCGA GGAACCCGAC CACCTGGTCG TGGCCGCCCA GAACGGCGGC
TCGCTGGCCT CGCTTCTCCT GGACCCCGGC ACGGGCGTCC CGGCCGACAC CGGCCACCGG
CTGCGCCTGC CCGTCCCCGT GTGCGTGCTC CCGGTCCCGA TCACCCGCAT CCGCCGCGCC
GGGGGAACCC GGGGCTGA
 
Protein sequence
MGESADSSVD GKRLLLVGTY TPDSDPPGEG EGIYRVWFDP MTGEMTHGGA AARTPGPSFL 
AFREDPPTVY AVNEREKGTV TAFRIDGAAG LTELGQSPTG GGSPCHVLAR GSELAVTNYA
NGVATLYALA EDGSLDGTAA EFAHSGSGPV TDRQEGPHAH STAAPDDHHL LVADLGTDEL
RVLRGGEEVG AVSLPPGTGP RHTAVLGEYL YVAGELDSRV HVLRWNPDEG TAEHLGSVEA
TGEEAAGENF PAEILSNGDH VYVSNRGADT IATFAVRDGG ARLEHVADTP AGGPWPRNFT
VVRGHREEPD HLVVAAQNGG SLASLLLDPG TGVPADTGHR LRLPVPVCVL PVPITRIRRA
GGTRG