Gene Ndas_2658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2658 
Symbol 
ID9246509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3166334 
End bp3168262 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003680581 
Protein GI297561607 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.55127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.367777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC CGTCGAAGCT GATCACCGTC GAGGAGTTCT TCGGCCCGCC CGTCCGCAGC 
CGGGCCCTTC TGTCACCCGA CGGCACGAGG GTCGCCTACC TCGCTCCCTG GCGCGGTCGG
CTCAACGTGT TCGTCCGCGA CCCGGACTCG GACTGGACCG CCCCCGACCA GGTACACGAG
GCCGACGCCC CGGGCGTCCG GCGCGTCACC TCCGACACCC GGCGCAACAT CGACGCCTTC
TTCTGGACCG CCGACGGCCG CTACCTGCTG TTCCAGCAGG ACACCGACGG CGACGAGAAC
TGGCACCTGC ACCGCGTGGA CCCGAACCGG CCCGACGAGC CCGCGGTCGA CCTGACCCCC
TTCGAGGGCG TGCGGCTGCT CGGCGCGCAA CTCCCGCCGG ACCGCCCCGG CACCGCCTTC
GTACAGCTCA ACATGCGCCG TCCCGACCTG GCCGACCTGT TCGAGCTCGA CCTGGAGACC
GGCCGCCTGA CCACCGCCGC GGAGAACCCC GGCGACGTCC TCTCCTGGCT GCGCACCCCG
GACCGGCTGC TGGCGTTCAC CATGGAGGAG GGAGGCGACC ACGTGCTGTC GGAGCACACC
GAGGGCGCAC GGCGCGCGAT CGCGCGGTTC CCCGGCACCG ACGCCCTCTT CGGCATCCTC
CCGGCTGTGC TCACCCCGGA CGGGAACGGA CTGTGGATCG GTTCGTCGCG GGGTTCCGAC
CGCACCCGCC TGGTCCGGCT CGACCTGGAG ACCGGCGAGC AGGCCGGCGT GGACAGCCAC
CCCGTGTTCG ACCTGGACAC CCCGCGCCCC GAGGCCGACC CGCGTTTCCC GTCCTCGCTG
ATCCTGCACC CGGGAACCGG GGACCTGCTC GGCGCCCGCT ATCTCGGCAC CCGTCAGGAG
ATCCACGCGC TCGACCCGCG CTTCGCCGAG GTCCTGCCGC GGCTGGCCGA GCTGTCCGAC
GGCGACCTGG CCCACGTCTC CTGCGACACC GCGGCGCGGC GCTGGGTGGT GGACTTCACC
CACGACCGCG ACCCCGGCGT CACCTGGTTC TACGACCACG CCACGGGACG GGCGCGCCGC
CTCTTCCGGC CCTTCCCCCA CCTGGACCCG GCCGAGTTGG CCCCGGTCAC CCCGGTCACC
GTCAGCGCCC GCGACGGCCT GACCCTGCCC TGCCACCTCA CCCTGCCGGT CGGGGTCGAA
CCGCGCGACC TGCCGACCGT GCTGCTGGTG CACGGCGGAC CGTGGTACCG CGACAGCTGG
TGCTACGACC CGGAGGTGCA ACTCCTGGCC AACCGCGGTT ACGCGGTGCT GCAGGTCGAC
TTCCGCGGCT CCACCGGCTA CGGCAAGGCC CACACACAGG CCGCGATCGG CCAGTTCGCC
GGGCGCATGC ACGACGACCT GATCGACGCC CTCGACTGGG CGGTCGAACA GGGCTACACC
GACCCGGACC GGGTGGCGGT CTACGGCTGC TCCTACGGCG GTTACGCGGC GCTGGTCGGA
GCGGCGTTCA CCCCGGACAG GTTCGCCGCC GCGGTCAGCT ACACCGGAAT GTCCGACCTG
GTCGACCTCG TCGAGTCGGT CGTCCCGTTC GCCCGCCGTA CCGTCGAGAA CAGCTACCTG
CGCTACATCG GCGACCCGGA CGACCCCCGC CAGAGGGCCG ACATGCTCGC CCGCTCGCCC
ATCAGCCGGG TCGACGACAT CACCGCGCCG GTTCTGCTGA TCCACGGCGC CAACGACGTC
CGCGTCCACC GGCGCAACTC CGACCGGGTC TTCGACGCGC TCCGCTCCCG CGGCGCCGAG
GTCGAGTACC TGCTGAACGA GACCGAGGGC CACTGGTTCA CCAACCCGGA CAGCAACATC
GAGTTGTACG GGAGGCTGGA GCGCTTCCTG GCCCGCCACC TGGGCGGGCG GTCCGCGACC
GGGTCCTGA
 
Protein sequence
MTAPSKLITV EEFFGPPVRS RALLSPDGTR VAYLAPWRGR LNVFVRDPDS DWTAPDQVHE 
ADAPGVRRVT SDTRRNIDAF FWTADGRYLL FQQDTDGDEN WHLHRVDPNR PDEPAVDLTP
FEGVRLLGAQ LPPDRPGTAF VQLNMRRPDL ADLFELDLET GRLTTAAENP GDVLSWLRTP
DRLLAFTMEE GGDHVLSEHT EGARRAIARF PGTDALFGIL PAVLTPDGNG LWIGSSRGSD
RTRLVRLDLE TGEQAGVDSH PVFDLDTPRP EADPRFPSSL ILHPGTGDLL GARYLGTRQE
IHALDPRFAE VLPRLAELSD GDLAHVSCDT AARRWVVDFT HDRDPGVTWF YDHATGRARR
LFRPFPHLDP AELAPVTPVT VSARDGLTLP CHLTLPVGVE PRDLPTVLLV HGGPWYRDSW
CYDPEVQLLA NRGYAVLQVD FRGSTGYGKA HTQAAIGQFA GRMHDDLIDA LDWAVEQGYT
DPDRVAVYGC SYGGYAALVG AAFTPDRFAA AVSYTGMSDL VDLVESVVPF ARRTVENSYL
RYIGDPDDPR QRADMLARSP ISRVDDITAP VLLIHGANDV RVHRRNSDRV FDALRSRGAE
VEYLLNETEG HWFTNPDSNI ELYGRLERFL ARHLGGRSAT GS