Gene Ndas_2888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2888 
Symbol 
ID9246739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3447101 
End bp3449137 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content74% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003680805 
Protein GI297561831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0760896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTCCGTT CCTGGTATGA ACTGCCGGAG TACGTCCGAC TTCGCCGTGT CAACGGACTG 
CGCCTCTCGC CGGACGGAAC CCGTCTGGTC GCGCCCGTCA GCGGCCCCGC CCCCGACGCC
AAGTCCTTCC GCAGCGCCCT GTGGGAGATC GACACCGCCC CCGAGTCCGA GGGCGGGCGC
GCCCCCAGAC GCCTCACCCG CTCGGCCAAG GGCGAGAGCG GCGTCGGCTT CCTCCCCGAC
GGCTCTGTCC TGTTCACCAC CGGCCGCCCC GACCCCGAGG CCGAGGCCGA CGCCAAGGCC
CGCACCGCCC TGTGGCTCCT GCCCGCAGAC GGCGGCGAGG CCCGCCAGGT CGCCTCCCGC
CCCGGCGGCA TCGGCTCCTT CACGGTCGCC CGCGACTCCG GACTGGTCGC CCTCACCGCC
GACACCCTCC CCCGCAGCGA GGACGAGGAG GCCGACAGGG AGGCGCGCAA GGCCCGCGAG
GAGGCCGGGG TCACCGCCAT CCTGCACGAG GCGCTGCCCG TGCGCTCCTG GGACAGCGAC
ATCGGCCCCG GACACCCCCG CTACCTGGTC GCCGAACCGC CCGCGGACGA GGACTCCAGG
CTCGGCGACC CGCGCGACCT GACCCCCGAC GCGGGAATGG CGCTCCTGGG CGCGAGCGGC
GATCTCACCC CCGACGGCAC CACCCTGGTC ACCGAGTGGC AGGTCCCCGT CGGCCGGGGC
GCCCGGCGCA CCGACGTCGT CGCCATCGAC ACCGCCACGG GCCGGCGCCG CACCCTGGCC
ACCGACCCCG CCCACGACTT CGACAGCCCG CTCGTCTCCC CCGACGGCCG CCACGTCCTG
CTCGTGCGCA GCTCCCAGGG CGACTACGAC GGCGAACCCC GGGACGAGAC CGCCTGGCTG
GTCGACCTGG CCACCGGACA GGGCCGCGAC CTGCTCGCCG ACCACGAGCT GTGGCCCCGC
GAACTCGCCT GGGCCGCCGA CTCCGGCGCG GTGTTCCTCG TCGCCGACCA CGGCGGCCGC
CGGCCCGTCT TCCGGATCGA CCTGGCCAGC GGCGCGCTCA CCCGCGTCAC CGGCGACCAC
GGCGCCTACA GCAACCTCAA CCCGTCCCCC GACGGCCGCC ACGTCTACGC CCTGCGCGAC
GCCTGGGACG CCCCGCCCGC GCCCGTGCGC CTGGCCGCCG ACGCCGCCGA CGGCCAACCC
GTGCACCTGC GCACCCCCGG CTCCGAGCTC ACCATGCCCG GCACCCTCAC CGAGATCGAG
GCCACCGCCG ACGACGGCAC CCCGATCCGC TCCTGGCTGG TGCTGCCCGA GGAGGCCTCG
GCCGACTCGC CCGCCCCGCT CATGCTCTGG GTCCACGGCG GCCCCTACAT GAGCTTCAAC
GGCTGGTCCT GGCGCTGGAA CCCGTGGCTG CTCGCCGCGC GCGGCTACGC CGTCCTGCTG
CCCGACCCCG CCCTGTCCAC CGGCTACGGC CAGGACATGC TCCGCCGCGC CTGGGGCCAG
TGGGGGCCGC GCACCTTCGC CGACGTCATG GCCGTCACCG ACGCCGCCGA GGCGCGCGAG
GACATCGACG CCGAGCGCAC CGCCATGATG GGCGGCTCCT TCGGCGGCTA CATGGCCAAC
TGGATCGCCG GGCACACCGA CCGGTTCAGG GCCATCGTCT CCCACGCCTC CCTGTGGGGC
CTGGACGGCT TCAACGGCAC CACCGACTAC CCGCCGGTGT GGGAGCGCGA GTTCGGCACC
CCGCTGGAGC GCCCCGAGCG CTACACCCTC AACTCCCCGC ACCTGCACGC GGACCGCATC
CGCACCCCGA TGCTCGTCAT CCACGGCGAC AAGGACTACC GGGTGCCCAT CTCCGAGGGC
CTGCGCCTGT GGCGCGACCT GATGCTGCAC GAGGTGGACG CCAAGTTCCT GTACTTCCCG
GACGAGAACC ACTGGATCCT CACCCCGGGG AACGCCCGGA TCTGGTACGA GACGGTCTTC
GCCTTCCTCG ACCACCACGT GCACGGCAAG GAGTGGAGCA GGCCCGAACT GCTCTGA
 
Protein sequence
MVRSWYELPE YVRLRRVNGL RLSPDGTRLV APVSGPAPDA KSFRSALWEI DTAPESEGGR 
APRRLTRSAK GESGVGFLPD GSVLFTTGRP DPEAEADAKA RTALWLLPAD GGEARQVASR
PGGIGSFTVA RDSGLVALTA DTLPRSEDEE ADREARKARE EAGVTAILHE ALPVRSWDSD
IGPGHPRYLV AEPPADEDSR LGDPRDLTPD AGMALLGASG DLTPDGTTLV TEWQVPVGRG
ARRTDVVAID TATGRRRTLA TDPAHDFDSP LVSPDGRHVL LVRSSQGDYD GEPRDETAWL
VDLATGQGRD LLADHELWPR ELAWAADSGA VFLVADHGGR RPVFRIDLAS GALTRVTGDH
GAYSNLNPSP DGRHVYALRD AWDAPPAPVR LAADAADGQP VHLRTPGSEL TMPGTLTEIE
ATADDGTPIR SWLVLPEEAS ADSPAPLMLW VHGGPYMSFN GWSWRWNPWL LAARGYAVLL
PDPALSTGYG QDMLRRAWGQ WGPRTFADVM AVTDAAEARE DIDAERTAMM GGSFGGYMAN
WIAGHTDRFR AIVSHASLWG LDGFNGTTDY PPVWEREFGT PLERPERYTL NSPHLHADRI
RTPMLVIHGD KDYRVPISEG LRLWRDLMLH EVDAKFLYFP DENHWILTPG NARIWYETVF
AFLDHHVHGK EWSRPELL