Gene Ndas_0405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0405 
Symbol 
ID9244243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp496533 
End bp497882 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content74% 
IMG OID 
Productpeptidase M16 domain protein 
Protein accessionYP_003678359 
Protein GI297559385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.445293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCACA AGTCGCGTCC GGACCTCGGT GCCGTAGCGT CCTACACCTT TCCCAAGCCC 
CGCCGCCTCA CCGTCGGCGG CGGCACGGTC GTCGCCATCG ACGTACCCGG CCAACAGCTC
GTCTCCCTCC GCCTCGTGCA CCCCCACGGC GGTGCCGCCG AGCCGCTCGA CGCCATGGGC
GTCAGTGCTC TGACCAGCGA GGTCCTGGAG GACGGTCCCA ACGGCAACAG CTCCCTGGCG
CCCGCCCTGG AGCGCCACGG CGCCGAGTGG GTCTCCCGCG TCAACTGGGA CGCCTTCATC
ACCGGCCTGG ACGCCCCCGC GGGCCGCCTG CCCGAGGCGG TGCGACTGTT CGCCGACGCG
GTGCGCCGTC CCGCGCTGAA CCCCGACGAC GTCGTCCGCC GCCGCGACCA GCTGCTGGAG
CGGTTCTGGC TGGAGGCGGC CTCGGCCAGC ACCCTGGCCA TGCGCAGCCT GGGCGGGCAG
CTGTTCACCG GCCGCTACGC CACACCGCTC GCGGGCGGCC CGGTCAAGCT GGCCGACGTC
ACCCCCGAGA CCGTGGCGGC CTTCCACGCC GACTCCGTCG CCTCGGTGGC CGGAACCCTC
GTGGTCGTCG GCGACCTCAC CGGGATCGAC CTGGAGGACC TGGGCAAGAC CGTGTTCGGC
GACGCCGCCG CCGTGCGCGC GCCCGAGCCG ACCGAGCCCG CCCCGCCGCC CGGGGAGCTG
CCCCGCGTCC TGATCGTGGA CCGGCCCGGC TCCGTGCAGT CCGCCCTGGT CATCGCGCAC
CGCGCCCCCT CCCGCTCGCA GGTGGACCTG CCGCGCGCCG AGGGCGTCAG CGAGGTCCTG
GGCGGCATGT TCACCTCCCG GCTCAACCTG GAGCTGCGCG AGCGCCTGGG CTACACCTAC
GGCGCCGGGT CCCGCTTCGA CCTGCGCCGC GACAGCGGGG TGTTCTTCAT GTCCGCCCAG
GTCGAGGCGG ACACCACCGC CCACTCGGTC ACCTCCTCGC TGGAGCAGGT CGCCAAGCTG
CGCGAGTCGG GGGTCACCGA GGAGGAGCTG GCCGCCGTGC GCGACTCCAA CACGGTGGGC
CTGCCGGTCA CCTACGCCAC GGCGCGCGCC ATGGCCGGGG CCCTGGTCGA CATGGTGGTG
CACGACCTGC CCGAGGACCA CGTGGACCGC GTCCGGGCCG GGTACGAGAG GCTCACCAAG
GAGTCCCTGG ACAGCGCCGC CACCGAGTAC CTGCGCCCCG AGGAGTCGGT CGTGGTGGTC
GTCGGCGACG CCGAGCGCCT GCGCCAGCCG CTCACCGACA CCGGGGTCGG CCCGGTCGAG
GTGCGCACCC CCGACTCGCT CTGGACCTGA
 
Protein sequence
MPHKSRPDLG AVASYTFPKP RRLTVGGGTV VAIDVPGQQL VSLRLVHPHG GAAEPLDAMG 
VSALTSEVLE DGPNGNSSLA PALERHGAEW VSRVNWDAFI TGLDAPAGRL PEAVRLFADA
VRRPALNPDD VVRRRDQLLE RFWLEAASAS TLAMRSLGGQ LFTGRYATPL AGGPVKLADV
TPETVAAFHA DSVASVAGTL VVVGDLTGID LEDLGKTVFG DAAAVRAPEP TEPAPPPGEL
PRVLIVDRPG SVQSALVIAH RAPSRSQVDL PRAEGVSEVL GGMFTSRLNL ELRERLGYTY
GAGSRFDLRR DSGVFFMSAQ VEADTTAHSV TSSLEQVAKL RESGVTEEEL AAVRDSNTVG
LPVTYATARA MAGALVDMVV HDLPEDHVDR VRAGYERLTK ESLDSAATEY LRPEESVVVV
VGDAERLRQP LTDTGVGPVE VRTPDSLWT