Gene Ndas_0028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0028 
Symbol 
ID9243855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp36038 
End bp37195 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content75% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003677986 
Protein GI297559012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.36117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA CGGTGACCGC GAGGTTGGCG GACGCGGCTG AGCGCGCCCT GCCCGGCATG 
ATCGACGACC TGCGCGCCCT GGTGGAGCTG GAGACCCCCA GCGGCGACAG GGACCTGCTG
TCCGCCGGAC TGGACGGCAT CGAGGGGTGG CTGGCCCGGC GGCTGGGCGC GCCCGAGACC
CGGGTCCGCT ACGACGGCGG CTCCTTCGGC GACGTGCTGG AGGTGTCCTA TCCCGGTACG
GGGGCGGGGA CCGTGCTCTT CGTCAGCCAC TACGACACCG TCTGGCCCGC CGGAACCCTG
GCCGGGTGGC CCGTCACGGT CGAGGGCGAC CGGTTCAGCG GCCCCGGCTG CTTCGACATG
AAGGCCGGGA TCGTGCAGAG CGCCTGGGCT CTGCGGCTCC TGCGCGAACT GGACCTGCCC
CGGCCCGCCG TGCGGATGCT GCTCACCGGG GACGAGGAGA TCGGCAGCCC CGCGTCGCGC
CCGCACATCG AGCGGGCCAG CGAGGGCGTG GACCTGACCC TGGTCCTCGA ACCCAGCCGG
GAGGGCATGC CCAAGACCCG ACGCAAGGGC ATGGGGATCT TCGACGTGGA CGTGCGCGGC
GTGGAGTCCC ACGCGGGCCT GGACCCCGCA GCGGGGGCGA GCGCCGTGCA CGCCCTGGCC
CAGGTCGTGC CCGCGCTCAC CGCCCTGTCC GCGCCGGAGC TGGGCACCAC GGTGAACGTG
GGCCTGGTCT CCGGGGGGAC CGGGTACAAC GTCGTCGCCG GGCACGCCCG CTGCGGGGTG
GACGTGCGGG TGCAGGACCC CGCCGAGATG GCCCGCGTGG ACGCCGGGCT GGCCGCGCTC
GCCGCCGCCG ACCCGCGCGT GGCGGTCCGG GTCACCGGCG GGTGGAACCG CCCTCCGATG
AACCCCAACC CGCCCTCGGA GAAGGCGTTC GGCCTGCTGC GCGAGGTGGC CGGGGAACTG
GGCGCCTCCC TGGAGGAGGT GTCGGTGGGC GGGGCCAGCG ACGCCAACTT CGTCTCCGCG
CTGGGCCGCC CGGTGCTCGA CGGGCTGGGC GCGGTGGGCG CCGGACCGCA TTCGCGCGAC
GAGCACGTCC TGGTCGGCGG GACGCCGCGC CAGGTCGCCC TGGTGGCGGG CCTGATGGAG
CGGATCGCGG GGAGGTAG
 
Protein sequence
MSQTVTARLA DAAERALPGM IDDLRALVEL ETPSGDRDLL SAGLDGIEGW LARRLGAPET 
RVRYDGGSFG DVLEVSYPGT GAGTVLFVSH YDTVWPAGTL AGWPVTVEGD RFSGPGCFDM
KAGIVQSAWA LRLLRELDLP RPAVRMLLTG DEEIGSPASR PHIERASEGV DLTLVLEPSR
EGMPKTRRKG MGIFDVDVRG VESHAGLDPA AGASAVHALA QVVPALTALS APELGTTVNV
GLVSGGTGYN VVAGHARCGV DVRVQDPAEM ARVDAGLAAL AAADPRVAVR VTGGWNRPPM
NPNPPSEKAF GLLREVAGEL GASLEEVSVG GASDANFVSA LGRPVLDGLG AVGAGPHSRD
EHVLVGGTPR QVALVAGLME RIAGR