Gene Ndas_2171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2171 
Symbol 
ID9246021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2592630 
End bp2593799 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content70% 
IMG OID 
Productprotein serine/threonine phosphatase 
Protein accessionYP_003680099 
Protein GI297561125 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.529271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000569605 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGTGC AACACGCGTC CAACGAGCTG ATGGTCTCCC TCGTGCGGGC CGGACACCTG 
GCCACCTTCG AGGAACTACC CGCACTCGTG GCCAAGAAGG CCGACAGCGC CGGGCTCTCC
CAGGCGCGCA TCTACCTGGC CGACCACCAG CAGCAGGTCC TGCGCGAGGT CACGGGAGAG
GGCATCGACG CCCACCGGGG CGGTGAGGAC CTGCTGGTGG ACGACTCCCC GGCCGGTCAG
GCCTACGTCA CGGGTGTGAC GGTGCGGATG GAGGACGAGC AGCGCTACTG GGTGCCGGTC
CTGGACGGCG CCGAGCGCCT GGGCGTGCTG CACGTGAGCT ACCCGGGCGA CCCCGACCGC
TCGGCGATGC GCGACCTGGC CTCCATGGTG GCGCTGCTGG TCATCGCCAA GCGCTCCAAC
AGCGACGCCT ACGCCCGGCT GATCCGCACC AAGCCCATGT CGGTCTCGGC GGAGATGCAG
TGGACGCTCA TGCCGCCCGG CACCTTCGCC GACTCGCGGG TGACGATCTC GGCCGCCACC
GAACCGGCCT ACGACAACGC CGGGGACTCC TTCGACTACG CCCTGGACGG GGAGACCGCC
CACCTGGCGA TGTTCGACGC CATGGGCCAC GACACCGCCG CCGGGCTCAT CGCGAACCTG
GCCGTGGGGG CCTTTCGCAA CGAGCGCCGC AAGGGCACCC CGCTGGTCGA CGTGTGCCGG
GGGGTGGAGC ACACCCTGAT CCAGGAGTTC GTGCGCACCC GATTCGCCAC CGCGATCCTG
GCCGAGCTGA ACATGGCCAC CGGGGAGCTG TACTGGGTCA ACTGCGGGCA CCTGCCGCCG
GTGCTCATCC GGGGCGAGGA GGTCCGCGAC CTGGAGTGCG AGCCCTCCCA CCCGCTGGGG
ATGGACCTGG GGCTGCCGGT GACGGTGTGC CGCGAACAGC TCGAACCCGG CGACCGGCTG
CTGCTGTACA CCGACGGCAT CATCGAGGCG CGCGACTCCG AGGGGCGCGA GTTCGGTGTG
GAGCGGTTCG TGGACTTCGT CATCCGCCAC CAGGCCGACA ACATGCCGGT TCCCGAGACG
CTGCGGCGCC TGGTGCACGC GGTGCTGGAG TACCACCACG GCAGGTTCGG CGACGACGCC
ACGGTGCTCT TCTGCGAGTG GCACGGCTGA
 
Protein sequence
MTVQHASNEL MVSLVRAGHL ATFEELPALV AKKADSAGLS QARIYLADHQ QQVLREVTGE 
GIDAHRGGED LLVDDSPAGQ AYVTGVTVRM EDEQRYWVPV LDGAERLGVL HVSYPGDPDR
SAMRDLASMV ALLVIAKRSN SDAYARLIRT KPMSVSAEMQ WTLMPPGTFA DSRVTISAAT
EPAYDNAGDS FDYALDGETA HLAMFDAMGH DTAAGLIANL AVGAFRNERR KGTPLVDVCR
GVEHTLIQEF VRTRFATAIL AELNMATGEL YWVNCGHLPP VLIRGEEVRD LECEPSHPLG
MDLGLPVTVC REQLEPGDRL LLYTDGIIEA RDSEGREFGV ERFVDFVIRH QADNMPVPET
LRRLVHAVLE YHHGRFGDDA TVLFCEWHG