Gene Ndas_2934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2934 
Symbol 
ID9246786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3505027 
End bp3506247 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content72% 
IMG OID 
Productprotein serine/threonine phosphatase 
Protein accessionYP_003680850 
Protein GI297561876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG CGACAGTCCC GGCCGCCGAG GCCACGCAGC TGTTCGAGGA ACTGGTCAGG 
TCCGTGCACG GGTGCGCCCC GATCGAGGTC CTCGAAGCCG CCGGCCGCTA CGGCGAGCGG
ATCGGACTCA GCGGCATCTG CGTGTACCTG GTCGACCTGC AACAGCGACT CCTCGTCCCG
CTGCTCGGCG GCCGGGCGCT CAAGGTCGAC TCCAGCGTGG CGGGGGAGGC CTACCGCTCC
GAGACGTTGC GGCTGGTCGA GGGCGGCGAC GGCGAGCTGG GCCTGTGGCT GCCCCTGCGC
GACGGCGCCG ACCGCATGGG GGTCGTGCAC ATCAGCGCTC CCGTGCTCGA CGAGTCCACC
CTGCGCCGCT GCCACGCGCT CGCCTCGCTG CTGGCCCTGG TCGTGACCTC CAAACGCGCC
TACAGCGACA CCTACGTCCG CCACACCCGC ACCCAGGCGA TGGACCTGCG CACCGAGATG
CTGCGGGCCT TCCTGCCGCC CCGCACCCTG GGCACCTCGC GGGGTGTGTC CACCGCCGTC
CTCGAACCCG CCTACCGTCT GGGCGGCGAC GCCTTCGACC ACTCGATCAC CAAGGAGACC
CTGCACGCCG CCATCCTCGA CGCAATGGGG CACGACCTGG CCTCCGGACT GACCGCGTCC
GTGGCCATGG CCGGGATCCG CAACGCCCGG CGCAACGGCG CCGACCTCGC CGAACTCACC
GACAGCGTGG AGGGCGCGCT CACCTCCTGG CTCCCCGACC GCTTCTGCAC CGCCGTCTTC
ACCTCCCTGG ACCTGTCCAC CGGGGAGTTC GCCTGGGTCA ACTGCGCCCA CCCCGCACCC
CTGCTCCTGC GGCGCGGACT CCTGCTGGAG GACGCACTGG AGCGGACCCC CGAGGTGCCG
CTCGGACTCG GCGGTGTGCT CGGCGAGGCC GAACCGCGCA CCGTGCACCG GGTCCTGCTC
GAACCCGGCG ACCGGATCCT GCTCTACACC GACGGGGTGA CCGAGGCGCA CGACAGCCAG
GGGCGGATGT TCGGCCTCGA ACGGTTCGCC GACTTCATCA TCCGCGCCAC CGCCGCCGAC
GAACCCGCCC CGGAGACGCT GCGGCGCCTG GTCCACGCCA TCCACGACCA CCAGCGCGGC
AGCTTCACCG ACGACGCCAC CATCATGCTG CTGGAGTGGC GCCCCGACGG CGGGGTGATG
CCCCGGGTCG AGGGCTGCTG A
 
Protein sequence
MDDATVPAAE ATQLFEELVR SVHGCAPIEV LEAAGRYGER IGLSGICVYL VDLQQRLLVP 
LLGGRALKVD SSVAGEAYRS ETLRLVEGGD GELGLWLPLR DGADRMGVVH ISAPVLDEST
LRRCHALASL LALVVTSKRA YSDTYVRHTR TQAMDLRTEM LRAFLPPRTL GTSRGVSTAV
LEPAYRLGGD AFDHSITKET LHAAILDAMG HDLASGLTAS VAMAGIRNAR RNGADLAELT
DSVEGALTSW LPDRFCTAVF TSLDLSTGEF AWVNCAHPAP LLLRRGLLLE DALERTPEVP
LGLGGVLGEA EPRTVHRVLL EPGDRILLYT DGVTEAHDSQ GRMFGLERFA DFIIRATAAD
EPAPETLRRL VHAIHDHQRG SFTDDATIML LEWRPDGGVM PRVEGC