Gene Ndas_5250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5250 
Symbol 
ID9249147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp411796 
End bp413145 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content71% 
IMG OID 
Productprotein serine/threonine phosphatase 
Protein accessionYP_003683136 
Protein GI297564163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.383211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCG CTCTCCGATA CGCGGCGTAC TCCGACGTAG GATGCCTCCG CGAAGGCAAC 
GAAGACTCCG GCTACGCCGG CCAGAACCTC CTCGCGGTCG CCGACGGCAT GGGCGGTTAC
GCCGGCGGCG AGGTGGCCAG CTCCATCGCG ATCTCCTCGA TCCGCCGCCT CGACTCCGAG
CCGCCCCGGT CCGACGAGAT GGCCGAGGTG CTCCAGCGCG CCGTCGAGCA GGCCAACGCC
TCCCTGTCGC GCAGGATCAT GGAGGAGCCC CAGCTGGAGA ACATGGGCAC CACCCTGACG
GCCATGCTCT GGGCCGGTCC GCGGGTGGCG CTCATCCACA TCGGCGACTC CCGCGCCTAC
CTCATGCGCG GCCCCCGCTT CGAGCAGATC ACCCACGACC ACACCCTGGT GCAGACCCTG
GTGGACGAGG GCAAGATCAC CGAGGAGGAG GTCGCCACCC ACCCGCAGCG CTCCCTCATC
CTGCGCGCCC TGGACGGCAA GAGCCCGGTC GACCCCGACA TCTCCATCAG CGAGGCCAAG
GCGGGCGACC GCTACCTGCT CTGCTCCGAC GGGCTCTCCG GCGTGGTCAG CAAGAAGACC
ATCCACGAGA CCCTCGCCAC CGAGGCCGAC CCGCGCAGCG CGGCCAAGAA GCTCATCGAG
CTGGCCATCC GCGGCGGCGG CCCGGACAAC ATCACCGCGG TCGTCGCCGA CGTCATCGAG
GCCGAGACCG ACAGCGAGGG GCCCACCCGC GCCTCCCAGG TGGTCGGCGC CGCGGACCAG
CGCCGCGAGA ACGTCGACCA GGGCAACGAC ACACCCGCCA GACGGGCGCA GGAACTGCGC
GGGGGATCCG GCGACACCGC CGAGATGGAC CCCGTCCGTG ACGAGCCCGG CCCGGACGCC
TACGCCTCGG GCGGCGCCTA CCAGGAGTCC TACAACGGCG ACTACGACGA CTACGAGGCC
CCGCCGGCCG ACCGGCGGGG CCGTCCCGAA CCCGAGTACC GCCGCAGGCG CTGGTGGCCG
ATGGTGCTGG TGTTCCTGGT CGTGGTCGCC GTCGTCGCGG GGGCCACGTA CTACTTCGGC
AGCCGCTACG TGAACAGCCA GTACTACGTG GGCCCCTCCC CCTCGGGGGA CACCGTCAGC
ATCTACCAGG GCATCAACAC CGACATCGCG GGCTTCAGCC TGTCGGAGGA GGTGGAGGAG
ACCGGGATCA CCCTGGACTC GCTCTCGGAG GCCGACCGCG GATCGGTGGA GAACACCCTG
CCCGCGGAGA GCCTGGACGA CGCCCGGGCG AGCGTGGACG TGCTGAGCGA GGGTACGGCC
GGGGCCCGGA CCGAGGAAGG GTCCGGGTGA
 
Protein sequence
MTIALRYAAY SDVGCLREGN EDSGYAGQNL LAVADGMGGY AGGEVASSIA ISSIRRLDSE 
PPRSDEMAEV LQRAVEQANA SLSRRIMEEP QLENMGTTLT AMLWAGPRVA LIHIGDSRAY
LMRGPRFEQI THDHTLVQTL VDEGKITEEE VATHPQRSLI LRALDGKSPV DPDISISEAK
AGDRYLLCSD GLSGVVSKKT IHETLATEAD PRSAAKKLIE LAIRGGGPDN ITAVVADVIE
AETDSEGPTR ASQVVGAADQ RRENVDQGND TPARRAQELR GGSGDTAEMD PVRDEPGPDA
YASGGAYQES YNGDYDDYEA PPADRRGRPE PEYRRRRWWP MVLVFLVVVA VVAGATYYFG
SRYVNSQYYV GPSPSGDTVS IYQGINTDIA GFSLSEEVEE TGITLDSLSE ADRGSVENTL
PAESLDDARA SVDVLSEGTA GARTEEGSG