Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2934 |
Symbol | |
ID | 9246786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3505027 |
End bp | 3506247 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | protein serine/threonine phosphatase |
Protein accession | YP_003680850 |
Protein GI | 297561876 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACG CGACAGTCCC GGCCGCCGAG GCCACGCAGC TGTTCGAGGA ACTGGTCAGG TCCGTGCACG GGTGCGCCCC GATCGAGGTC CTCGAAGCCG CCGGCCGCTA CGGCGAGCGG ATCGGACTCA GCGGCATCTG CGTGTACCTG GTCGACCTGC AACAGCGACT CCTCGTCCCG CTGCTCGGCG GCCGGGCGCT CAAGGTCGAC TCCAGCGTGG CGGGGGAGGC CTACCGCTCC GAGACGTTGC GGCTGGTCGA GGGCGGCGAC GGCGAGCTGG GCCTGTGGCT GCCCCTGCGC GACGGCGCCG ACCGCATGGG GGTCGTGCAC ATCAGCGCTC CCGTGCTCGA CGAGTCCACC CTGCGCCGCT GCCACGCGCT CGCCTCGCTG CTGGCCCTGG TCGTGACCTC CAAACGCGCC TACAGCGACA CCTACGTCCG CCACACCCGC ACCCAGGCGA TGGACCTGCG CACCGAGATG CTGCGGGCCT TCCTGCCGCC CCGCACCCTG GGCACCTCGC GGGGTGTGTC CACCGCCGTC CTCGAACCCG CCTACCGTCT GGGCGGCGAC GCCTTCGACC ACTCGATCAC CAAGGAGACC CTGCACGCCG CCATCCTCGA CGCAATGGGG CACGACCTGG CCTCCGGACT GACCGCGTCC GTGGCCATGG CCGGGATCCG CAACGCCCGG CGCAACGGCG CCGACCTCGC CGAACTCACC GACAGCGTGG AGGGCGCGCT CACCTCCTGG CTCCCCGACC GCTTCTGCAC CGCCGTCTTC ACCTCCCTGG ACCTGTCCAC CGGGGAGTTC GCCTGGGTCA ACTGCGCCCA CCCCGCACCC CTGCTCCTGC GGCGCGGACT CCTGCTGGAG GACGCACTGG AGCGGACCCC CGAGGTGCCG CTCGGACTCG GCGGTGTGCT CGGCGAGGCC GAACCGCGCA CCGTGCACCG GGTCCTGCTC GAACCCGGCG ACCGGATCCT GCTCTACACC GACGGGGTGA CCGAGGCGCA CGACAGCCAG GGGCGGATGT TCGGCCTCGA ACGGTTCGCC GACTTCATCA TCCGCGCCAC CGCCGCCGAC GAACCCGCCC CGGAGACGCT GCGGCGCCTG GTCCACGCCA TCCACGACCA CCAGCGCGGC AGCTTCACCG ACGACGCCAC CATCATGCTG CTGGAGTGGC GCCCCGACGG CGGGGTGATG CCCCGGGTCG AGGGCTGCTG A
|
Protein sequence | MDDATVPAAE ATQLFEELVR SVHGCAPIEV LEAAGRYGER IGLSGICVYL VDLQQRLLVP LLGGRALKVD SSVAGEAYRS ETLRLVEGGD GELGLWLPLR DGADRMGVVH ISAPVLDEST LRRCHALASL LALVVTSKRA YSDTYVRHTR TQAMDLRTEM LRAFLPPRTL GTSRGVSTAV LEPAYRLGGD AFDHSITKET LHAAILDAMG HDLASGLTAS VAMAGIRNAR RNGADLAELT DSVEGALTSW LPDRFCTAVF TSLDLSTGEF AWVNCAHPAP LLLRRGLLLE DALERTPEVP LGLGGVLGEA EPRTVHRVLL EPGDRILLYT DGVTEAHDSQ GRMFGLERFA DFIIRATAAD EPAPETLRRL VHAIHDHQRG SFTDDATIML LEWRPDGGVM PRVEGC
|
| |