Gene Ndas_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3423 
Symbol 
ID9247290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4092172 
End bp4094265 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content73% 
IMG OID 
ProductProlyl oligopeptidase 
Protein accessionYP_003681334 
Protein GI297562360 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAAC TCCGCTACCC CCAGGCCGCA CGGCTGGACA TCATCGAGAA CCTGCACGGT 
CACACGGTCG GTGACCCCTA CCGGTGGTTG GAGGACGGCG ATTCGGCGCA GACCAAGGAG
TGGGCTGCCG CGGAGGACGC CCTGTACACC GAGGTCTCAT CACGGTTCAC CACAAGGAAC
TGGTTCGCCG ACCAGGTGCG GTCCCTGCTG GGCGCCGGAA GCGTCGGGGC GCCCGTGTGG
CGGGGTGAGC GCCGCTTCTT CGTCCGCCGC ACCGCCGAAC AGGAGCACGG TGTGCTCTAC
GTCCAGGACG GCGACGGCCC CGAGCGGGTG CTGGTGGACC CCGGGGCGCT CGACCCCCAG
GGTCTGACCA CACTGGACGC CTGGAAGCCC GACCGCTCCG GCGCGCTCCT GGCCTACCAG
CTCTCCGAGG GCGGCGACGA GGAGTCGGTG CTGCGCGTCG TGGACGTGGC CAGCGGAGAC
CTGGTGGACG GACCCGTCGA TCGGGTCCGG TACAGCCCGA TCGCGTGGCT GCCCGACTCC
TCCGCCTTCT ACTACGTGCG CCGGCTCCCC CGGGACCAGG TGCCCGAGGG TGAGGAGAGC
TACCACCGCC GCGTCTACCT GCACCGCCTG GGCACCCCCG CCGAGCAGGA CACCCTGGTC
TTCGGCGAGG GCCGCGACAA GACCGAGTAC TTCACCCCGG TCGTGAGCCG GGACGGGCGC
TGGCTGGTGC TGCTGGCCAA CCGCGGCACC TCGGCGGCCA CCGACGCGTG GATCGCCGAC
CTGTCCGACG GCGGCCTGGA CGCGCCGCGC CTGGTCCCCT TCCAGGAGGG CGTGGACGCC
TCCCTGTACC CGCACGTGGG CCGGGACGGG CGCCTGTACC TGTTCACCGA CCGGGACGCC
CCGCGCGGAC GCCTGTGCGT GGCCGACCCC GCCGACCCCG GCTACGGGAA CTGGACCACT
CTGGTCCCGG AGGACCCCGA GGCGGTGCTG GAGGGGTACG CGGTCCTGGA CGGCCCCGGG
ATGGAGCGGC CCGAACTGCT GGTCTCGCGC AGCCGCCACG CCGTCAGCGA GATCACCCGG
CACGCCCTGG CCACCGGCGA ACTCCTGGGC GGGCTGGACA TGCCCGGCCT GGGCACCGTG
ACCGGGCTGC TCGAACGCCC CGAGGGCGGG CACGAGGCGT GGTTCGGCTA CACCGACAAC
ACCCACCCCT CGTCGGTGTA CCGCTACGAC GCGCTCTCGG GCGAGGTGTC GCTGTGGGCC
GACGCCCCCG GTGACATCGA CCTGCCCGAC GTGCGGGTGG AGCAGGTCAC CTACACCTCC
CGAGACGGCA CGGCGGTGCG CATGCTCGTG GTCTCGCCCC AGGCGCCCGC CGAAGGACCG
CGGCCGACCA TCCTGTACGG CTACGGGGGC TTTCGCATCT CCCTCACTCC CGCCTACTCG
GCGGTGGCGC TGGCCTGGGT GCGCGCGGGC GGCGTGTACG CCATCGCCAA CCTGCGCGGC
GGACTGGAGG AGGGCGAGGA GTGGCACCGG GCGGGCATGT TCGCCGACAA GCAGAACGTC
TTCGACGACT GCGCGGCGGC GGCCGAGCAC CTGGTCGCCA CCGGGGTGAC CGACCCCGGG
CAGCTGGCCG TCATGGGCGG CAGCAACGGC GGACTGCTGG TGGGGGCCAT GGTCACGCAG
CGGCCGGACC TGTTCACCGC CGCGGTCTGC TCGGCCCCGC TGCTGGACAT GGTCCGCTAC
GAGCGGTTCG GACTGGGCCA GTTGTGGAAC GTCGAGTACG GCACCGCCGA CGACCCCGAG
CAGCTGGGGT GGCTGCTGGG GTACTCGCCC TACCACAACG TGCGCGAGGG CACCCGCTAC
CCGGCGACGC TGTTCACCGT GTTCGACAAC GACACCCGCG TGGACCCGCT GCACGCGCGC
AAGCTGTGCG CGCTGATGCA GCACGCCACG GGGGCCGCGC CCGAGGAGCG GCCGATCCTG
CTGCGCCGGG AGTCGGAGGT GGGCCACAGC TCGCGGTCGG TGAGCCGCAG TGTGGCGCTC
AACTCCGAGC AGCTGGCCTT CCTCGCCCAC TACCTGGGGC TCCGCGTGGA CTGA
 
Protein sequence
MRELRYPQAA RLDIIENLHG HTVGDPYRWL EDGDSAQTKE WAAAEDALYT EVSSRFTTRN 
WFADQVRSLL GAGSVGAPVW RGERRFFVRR TAEQEHGVLY VQDGDGPERV LVDPGALDPQ
GLTTLDAWKP DRSGALLAYQ LSEGGDEESV LRVVDVASGD LVDGPVDRVR YSPIAWLPDS
SAFYYVRRLP RDQVPEGEES YHRRVYLHRL GTPAEQDTLV FGEGRDKTEY FTPVVSRDGR
WLVLLANRGT SAATDAWIAD LSDGGLDAPR LVPFQEGVDA SLYPHVGRDG RLYLFTDRDA
PRGRLCVADP ADPGYGNWTT LVPEDPEAVL EGYAVLDGPG MERPELLVSR SRHAVSEITR
HALATGELLG GLDMPGLGTV TGLLERPEGG HEAWFGYTDN THPSSVYRYD ALSGEVSLWA
DAPGDIDLPD VRVEQVTYTS RDGTAVRMLV VSPQAPAEGP RPTILYGYGG FRISLTPAYS
AVALAWVRAG GVYAIANLRG GLEEGEEWHR AGMFADKQNV FDDCAAAAEH LVATGVTDPG
QLAVMGGSNG GLLVGAMVTQ RPDLFTAAVC SAPLLDMVRY ERFGLGQLWN VEYGTADDPE
QLGWLLGYSP YHNVREGTRY PATLFTVFDN DTRVDPLHAR KLCALMQHAT GAAPEERPIL
LRRESEVGHS SRSVSRSVAL NSEQLAFLAH YLGLRVD