Gene Ndas_3906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3906 
Symbol 
ID9247777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4678365 
End bp4679441 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content75% 
IMG OID 
Productpeptidase M50 
Protein accessionYP_003681809 
Protein GI297562835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGC CCGAGCCGCC GCGCAAGGCG GAGACGACCG ACGTGCCCGA GGCCGGCGGG 
GCAGTCGCCG ACCCGTGGGA GCAGATGCCC GAGGACGCCC TCAACCGGCC GGACCTGGGG
GAGGACACCG GGGGCCCCGG CAGGGCCGGG GACTCCGGGA CCGGCGCCCG GCCGCTGGAC
GCCGCGGGGA CGAAGGACGC GGGACCCGCG TCCGCGCGCC CGGACGCCTC CGGGGACGCC
GACCCGGACG CCTCCCGGGG CGGCCCCGAC GCCGCGCGCT CCGGGAGCTG GGCGGACTTC
CTGCCCAGCC CGGTGTTCGT GCTGCTGCTG GGCCTGGCGG GGTTCGCGGG CTGGCTGTCG
TGGACCGCCG CGGAGCTGGA GTGGGCCGCC GAGGGCACCA GCGTCACCCC GCTGGTCCCG
CCGCTGTTCA TCCTGCTCTG CTGGATCGTC TCCCTGGCCG TGCACGAGTT CGCGCACGCG
CTCGCCGCCT ACCTGGCCGG TGACCGCTCC CTGCGCGGCA GCGCCTACCT GCGGCTCAAC
CCGTTCGCCT ACCGGCATGC CTTCGCCGGG CTGGTCCTGC CCTCGGCCTA CCTGGGCCTG
GGCGCCTTCG GCATGACCGG TCCGCCCACC TACGTGGACT GGGACCGCAT CCCGTCCCGG
GGCCGCCGCG TCCTGGTGGC ACTCGCCGGA CCGCTGGCCA GCCTCCTGGT GGCCGCCGCG
TTCGCGGTCA CCGTGTCCGT TCTGGTCCCC CCGGGCAACG ACACCACCAA CTGGGCGATC
TCGGCGATGG CCTTCCTGTG CTTCGCGAAC CTGACGGCCG CCCTGATCAA CCTGCTGCCC
GTCCCCGGCC TGGACGGCTT CGAGGTGCTG GCCGCCTGGA CGCGCGGGAA GTGGGTCACC
GCGGCCCGCG ACAACGCGCT GTTCGGCTCG GTGGCCGTGT TCGCGGTCCT GTGGTTCCCG
GGCCTGAACG ACCTGCTGGT GAACGCGGTG TACGGCCTGT TCGACCTGGT GCTGCCCAAC
CCGGTGTTCC GCGGCATCGC CTTCTACGGC GAGCTGCTCC TCCAGTTCTG GGCCTGA
 
Protein sequence
MPAPEPPRKA ETTDVPEAGG AVADPWEQMP EDALNRPDLG EDTGGPGRAG DSGTGARPLD 
AAGTKDAGPA SARPDASGDA DPDASRGGPD AARSGSWADF LPSPVFVLLL GLAGFAGWLS
WTAAELEWAA EGTSVTPLVP PLFILLCWIV SLAVHEFAHA LAAYLAGDRS LRGSAYLRLN
PFAYRHAFAG LVLPSAYLGL GAFGMTGPPT YVDWDRIPSR GRRVLVALAG PLASLLVAAA
FAVTVSVLVP PGNDTTNWAI SAMAFLCFAN LTAALINLLP VPGLDGFEVL AAWTRGKWVT
AARDNALFGS VAVFAVLWFP GLNDLLVNAV YGLFDLVLPN PVFRGIAFYG ELLLQFWA