Gene Ndas_2180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2180 
Symbol 
ID9246030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2604052 
End bp2605206 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content79% 
IMG OID 
ProductDual specificity protein phosphatase 
Protein accessionYP_003680108 
Protein GI297561134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000036575 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000547398 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCCGCAG TACCCGCGCC CCTGCTCGAC TTCCACGACC CCGCCAGCGA CCGGGCCGTC 
GGCGCCCTGT TCGGCGCCCC GACCGCCGCC GCCCTGGTCG GCACCCCACC CTCCCTGGCC
GCGCACGCCA CCGGACCCGG CACCGCCCAC GCCCGCCTGG TCCGCGCCGC GCTCACCGAC
CTGACCGGAC CCGACCAGCC GCCCCGGCCC CCCGGGGACC CCGTCGAGCA CGCCTGGGCC
GCGGCGCTGC GCACCGCCGC CCGCACCGGC GCCCTGCCCT CCTTCGCCCC CGCCCTGGAG
CACACCACCG CCGACGACCC CCTGGGCGCC TCCTGGCGTG CGTTGACACG CACCCGCCGC
CCCGCCGAGG ACCCCGCCCG CGGCTCCTTC GCCTGCACCC ACCTGGTCGA CTCCGTCTGG
TCGGCCCACG CCGCCGCCGG GGACCGGGCC GCCCTGTACA CCGGCACCCT GGCCACCGCC
CTGTGGGGCG CCTCCGCCCT GCCCCTGCAC GCCCTGCGCC GCCTGTCCGA GACCACCGAC
CCCCACACCC TGACCACCAC CGCCCTGACC ACCGCCCGCG GCCCCCACCC CACCCTGTGG
CCCGAGCACC CCGTCCTGGT CGCCGAACCC CGCCGCCTGC CGCCCTTCGC CGTCCCCCAC
CCCCTGGACC CGGGCGTCCT GCTGGGCAAC CTCGACCACC TGCGCGCCCA CCCCGACACC
GTGGACGCCG CGGTCTCGCT GTGCCGCACC CACCCCAGCG ACGCACCCCA CCTGCCCCCA
GCCGACTGGG TGCGGGTGTG GCTGCACGAC CGCAGGGGCG CCAACTCCCA CCCGCACTTC
ACCCTGGACG AGGCCGCCGC CGCGGTCGCG GCCCTGCGCG CGGAGGGCAA ACGCGTCCTG
CTGCACTGCT GGGCGGGCGC CTCGCGCACC CCGGCCGTGG CCACCCGCTA CGCCGTCACC
GCCCTGGGCG CCCCCGTCCT GCCCACCCTG GCCGCCATGA TCCGCACCGT GGGCGGGCAC
CTGGACAACC CCAGCCTGTC CACCGCCGTC GCCCGGCTCA GCGGCGTGGA CCTGCCCGAC
CCGGCCGCCA CCCTCTTCCC CGAGGGCGTG CCGCCCCGGC GCCCCGAACT GCCCGAACCC
CACATCACCG GGTGA
 
Protein sequence
MPAVPAPLLD FHDPASDRAV GALFGAPTAA ALVGTPPSLA AHATGPGTAH ARLVRAALTD 
LTGPDQPPRP PGDPVEHAWA AALRTAARTG ALPSFAPALE HTTADDPLGA SWRALTRTRR
PAEDPARGSF ACTHLVDSVW SAHAAAGDRA ALYTGTLATA LWGASALPLH ALRRLSETTD
PHTLTTTALT TARGPHPTLW PEHPVLVAEP RRLPPFAVPH PLDPGVLLGN LDHLRAHPDT
VDAAVSLCRT HPSDAPHLPP ADWVRVWLHD RRGANSHPHF TLDEAAAAVA ALRAEGKRVL
LHCWAGASRT PAVATRYAVT ALGAPVLPTL AAMIRTVGGH LDNPSLSTAV ARLSGVDLPD
PAATLFPEGV PPRRPELPEP HITG