Gene Ndas_4404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4404 
Symbol 
ID9248279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5238439 
End bp5239851 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content77% 
IMG OID 
Productprotein serine/threonine phosphatase 
Protein accessionYP_003682299 
Protein GI297563325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.275687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGG TGCGGACCTG CCCCGCCTGC TCGGACAGGG TGTCCACGGA GGACGCCTTC 
TGCGAGGGGT GCGGACGCCC CCTCCCCGAG GGCGCGGAGA ACGCGGGGCA CCCCGACAGC
ATGCCCACCG CCCCGCAGGC GAGCCTGGCC GGTGCGGTAC CGCCCGGCGG GTGGTCCCGG
GACGGGGCCG ACGGCGGCGC GCGGAACGGG GGAGCCCCGG GTCCGTCCGG TGCCGTGTCG
CCCGGCGCCG ACGCGCGTCC CGGCCTGGTG CCCGACGACG GCGCACCCAC GGCTCCGCAG
GTGAGCCTGC GGTCCCTCGA CACGGGCGGG CGGGACGCGC CCGCGCCGCA GGCGCTGCCC
GACTCCGCCC CGCACGCGGC CGACTCCATG GTCACCCAGC CGATACGCCG GGACAGCCTG
CCCTCGTTCG CGTCCTCCGC GCCCCCGGCC GCCCAGCCGG AGGCCGTCCC CGACTGGCCG
CCGCCCGCCA CCGGGAGCAA CCCGGTGCGG CCCGCCAACC CCGGCCTGTG CGCGTGGTGC
CCCGGAGCGG TCAGCGACGG CTACTGCGAG CGGTGCGGCC TCCTCCAGCC CACCGGGCGC
GACCACGTCG AGGTGCGCAC GCGCGCCGCC GTCGGCGTCA GCGACCGCGG GCTGCGGCAC
AGGCGCAACG AGGACGCCAT GGCGATCCGT GTGATCGACG CCGACCACCC CCGCGCACCC
GGCGTGGTCT GCGCCGTGGT CTGCGACGGG GTGTCCAGCT CGCCGCGCTC GGACGAGGCC
TCCCGCGTCA CCGCCGAGAC CGGAGTGGCC GTCCTCGCCG AGCGCGTCAG CCAGGGCGCC
GACCCCCGCG AGGCCACCGG CGCGGCGATG ATCCGGGCCG CCGAGGCGGT CGCCGGGATC
GCCGACTCGC CCCGCTCCGC GCCCGCGTGC ACCTTCGTGT CGGCGGTCTA CGACCCCGCC
GCGGGCACCG TCACCGTCGG CTGGGTCGGC GACAGCCGCG CCTACTGGCT CTCCGGAGGC
CCCACTTCCA GCGCTTCGGC CCTGCTGACC AGGGACGACT CCTGGAGCGA GGCGATGGTG
CAGATGGGGG CGCTCTCCCG CGAGGAGGCG ATGCGCTCCT CCAACGCCCA CGCCCTCGTC
GCGTGGATGG GCGCCGACTC CGGCGAGATC GACGCCCACA TCTCCACCGT GACCCCGACC
GGCCCCGGCG CGGTCGTGCT GTGCAGCGAC GGCCTGTGGA ACTACTTCCC CGAGGCGCAG
GCGCTCACCG ACGCCGTCCC GGGGGCGGGG GCCAGACCCC ACGAGGCCGC GCGCGCCTAC
GTCGACCTCG CCCTGGAGGC GGGCGGCAAG GACAACATCA CCGTCGTGAT CGTTCCCGTG
CCCGCTGGGG GTCCCCGTGC CCGACACGAC TGA
 
Protein sequence
MTKVRTCPAC SDRVSTEDAF CEGCGRPLPE GAENAGHPDS MPTAPQASLA GAVPPGGWSR 
DGADGGARNG GAPGPSGAVS PGADARPGLV PDDGAPTAPQ VSLRSLDTGG RDAPAPQALP
DSAPHAADSM VTQPIRRDSL PSFASSAPPA AQPEAVPDWP PPATGSNPVR PANPGLCAWC
PGAVSDGYCE RCGLLQPTGR DHVEVRTRAA VGVSDRGLRH RRNEDAMAIR VIDADHPRAP
GVVCAVVCDG VSSSPRSDEA SRVTAETGVA VLAERVSQGA DPREATGAAM IRAAEAVAGI
ADSPRSAPAC TFVSAVYDPA AGTVTVGWVG DSRAYWLSGG PTSSASALLT RDDSWSEAMV
QMGALSREEA MRSSNAHALV AWMGADSGEI DAHISTVTPT GPGAVVLCSD GLWNYFPEAQ
ALTDAVPGAG ARPHEAARAY VDLALEAGGK DNITVVIVPV PAGGPRARHD