Gene Ndas_3977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3977 
Symbol 
ID9247848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4756751 
End bp4757884 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content73% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003681880 
Protein GI297562906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.745092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGGCC TCGCCGCGGC CGGCGCCCTC TCCGTCCCCC TGGCCCTGTC GGGCGCCGTC 
TCGGCCGGCG CCGACGAGCT CGCCCCCCTC CACACCGTCG GCGCCTCGGC CACCGGCGAC
TACTTCGTCG TCCTGGAGGA CACGCTCAGC GCCGCCTCCG TCACCCCCTC GTCCCTCGGC
ATCTCCTCCG ACCAGGTCAA CCACACCTAC GACGAGGTCG TCAACGGCTA CTCGGCCAGC
CTGAGCGCGG CGGAGGTGCG CGAACTGCGC GGGCAGTCGG GCGTCGCCTA CGTCGAGGAG
GTCGGCGTCG CGCACACCAC CGTCACGTGG GGCCAGGACC GCATCGACCA GGAGGACCTC
CCCCTGGACG GCTCCTACGG CACCACTGGC GACGGGGCCG GCGTCTCCGC CTACATCATC
GACAGCGGCA TCGACGCCTC GCACCCCGAC TTCGGCGGTC GCGCCTCCTC CGCCTTCGAC
GCCTACGGCG GCGACGGCTC CGACGGCAAC GGCCACGGGA CCCACGTCGC GGGCACCATC
GGCTCCGCCA CGTACGGCGT CGCCCCCGCC GCCGACCTCT TCGGCGTCAA GGTGCTCGAC
GACAACGGCT CGGGCTCCTA CGACGACGTC ATCGCCGGTA TCGACTGGGT GGCCGCCAAC
GCCGGATCCA ACGCCGTGGC CAACCTGTCC CTGGGCGGCC CGTCCTCCCC GACCCTGGAC
GAGGCCGTCA ACGGCCTGGC CGAGTCCGGC GTGTTCGTCG CGGTCGCGGC GGGCAACGAG
GGCCAGGACG CGGGCAACAC CTCTCCGGGC GGCGCCGAGG GCGTGACCAC GGTCGGCGCC
TCCGACGCGA CCGACGCCGC GGCCGTCTTC TCCAACCACG GCCCGTCCGT CGACATCTAC
GCCCCCGGCG TGGACGTGGA GTCGACCGTC CCCGGCGGCG GAACCGACTC CTATGACGGC
ACCTCGATGG CCAGCCCGCA CGTGGCCGGG GCCGCCGCCC TGTACAAGAG CGTGAACGGC
GACGACGACC AGGCCACCAT CCAGGACTGG CTCGTCTCCA ACGCCGGCGT GGACAAGCTC
AGCGGCGTTC CCGCGGGCAC GGTCAACCTG CTGCTCAACG TCCAGGGCCT CTGA
 
Protein sequence
MLGLAAAGAL SVPLALSGAV SAGADELAPL HTVGASATGD YFVVLEDTLS AASVTPSSLG 
ISSDQVNHTY DEVVNGYSAS LSAAEVRELR GQSGVAYVEE VGVAHTTVTW GQDRIDQEDL
PLDGSYGTTG DGAGVSAYII DSGIDASHPD FGGRASSAFD AYGGDGSDGN GHGTHVAGTI
GSATYGVAPA ADLFGVKVLD DNGSGSYDDV IAGIDWVAAN AGSNAVANLS LGGPSSPTLD
EAVNGLAESG VFVAVAAGNE GQDAGNTSPG GAEGVTTVGA SDATDAAAVF SNHGPSVDIY
APGVDVESTV PGGGTDSYDG TSMASPHVAG AAALYKSVNG DDDQATIQDW LVSNAGVDKL
SGVPAGTVNL LLNVQGL