Gene Ndas_3211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3211 
Symbol 
ID9247068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3840452 
End bp3841858 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content76% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003681125 
Protein GI297562151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCCACA TCGCCACTTC CTTCCCAGAG AGAACCGACG GCTTCCCGGA GGGGGCCGAC 
GGCCTCGCGG CGGCCGGGGA GGACGCCGTC CACCTGGCCC GAGGGCTCCT CCGCCGCGAC
TCCACCAACC ACGGCGGCGG TCAGGGCGAC GAGCGCGAGG CCGCCGAGTA CGTCGCCGAG
GCCCTGGGCG ACGCCGGACT GGACCCCCTG CTGCTGGAGT CGGCCCCCCG GCGCGCGAAC
GTGGTCGTGC GCGTCCCCGG CACCGACCCC TCGGCCCCGG CGCTGCTGGT GCACGGCCAC
CTGGACGTGG TGCCCGCCGA CGCGGCCGGC TGGACGCTGC CCCCCTTCGC CGGTGAGGTC
GCCGACTGCC CCGTCACCGG TGTGCCCGCG CTGTGGGGGC GCGGCGCGGT GGACATGAAG
AACACCATCG CCACGGTCAC CGCCGTGGTG CGCCACTGGG CGCGCCACGG GCTGCGGCCC
CGGCGCGACA TCGTGCTGGC CTTCGTCGCC GACGAGGAGG ACAGCGCCGC CTACGGCGCC
GACTACCTGG TCCGCGAGCA CGCCGAGCTG TTCGAGGGCT GCACCACCGC GATCGGCGAG
GGCGGCGGGG AGACCATCCA CGCCCGCACC GCCTCCGGGG AGCCGGTGCG CCTGTACCCG
GTGGGCGCGG CCGAGCGCGG CAGCGCCTGG CTGAACCTGC GCGCCCAGGG CACGGCCGGG
CACGGTTCCC GACCGCCGCG CGACAACGCG ATCGGCGCCC TGGCCGCGGC CCTGGCCCGG
ATCGACGGCT ACGAGTGGCC GCTGCACCTG ACCCCCGTCA CCCGTGCGGC GATCGACGCG
ATCGCCGCGG CCCTGGAGGT GGAGCGCTTC CCCGGTGACA CGGCGACCGC GGAGGCGGTG
GACGCCCTGG TCGCCAGCCT GGGGACGGCC GCGCCGCTGA TCGGCCCGAC CACGCGCAAC
AGCGCGACGC CCACGATGTT CTCGGCCGGG TACAAGGTGA ACGTGGTGCC GGGGGAGGCC
ACGGCGGGTG TGGACGGCCG GGTGCTGCCC GGCGCGGAGG AGCAGTTCGC CGCGGTCATG
GAGGAGCTCA CCGGCGACCG GGTGACCTGG GAGTACGCGC ACGGTTCGCC GCCGGTGTCC
GCGCCCGTGG ACTCCCCCGC GTTCGCGGAG CTGCGCGAGG CCCTGCTGCT GCACGACCCC
GGGGCCCACG TGGTGCCGGT GTGCCTGTCC GGGGGCACCG ACGCCAAGGT GTTCTCCCGG
CTGGGCATCG ACTGCTACGG ATTCTCGCCC CTGGCCCAGC CCGAGGGCCT GGACTACTCG
GGTCTGCTGC ACGGCGTGGA CGAGCGGGTG CCGCTGGAGG GGCTGCGCTT CGGGGTGCGC
GCGCTGGACA CGTTCCTGCG CGCCTGA
 
Protein sequence
MGHIATSFPE RTDGFPEGAD GLAAAGEDAV HLARGLLRRD STNHGGGQGD EREAAEYVAE 
ALGDAGLDPL LLESAPRRAN VVVRVPGTDP SAPALLVHGH LDVVPADAAG WTLPPFAGEV
ADCPVTGVPA LWGRGAVDMK NTIATVTAVV RHWARHGLRP RRDIVLAFVA DEEDSAAYGA
DYLVREHAEL FEGCTTAIGE GGGETIHART ASGEPVRLYP VGAAERGSAW LNLRAQGTAG
HGSRPPRDNA IGALAAALAR IDGYEWPLHL TPVTRAAIDA IAAALEVERF PGDTATAEAV
DALVASLGTA APLIGPTTRN SATPTMFSAG YKVNVVPGEA TAGVDGRVLP GAEEQFAAVM
EELTGDRVTW EYAHGSPPVS APVDSPAFAE LREALLLHDP GAHVVPVCLS GGTDAKVFSR
LGIDCYGFSP LAQPEGLDYS GLLHGVDERV PLEGLRFGVR ALDTFLRA