Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2171 |
Symbol | |
ID | 9246021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2592630 |
End bp | 2593799 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | protein serine/threonine phosphatase |
Protein accession | YP_003680099 |
Protein GI | 297561125 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.529271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000569605 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGTGC AACACGCGTC CAACGAGCTG ATGGTCTCCC TCGTGCGGGC CGGACACCTG GCCACCTTCG AGGAACTACC CGCACTCGTG GCCAAGAAGG CCGACAGCGC CGGGCTCTCC CAGGCGCGCA TCTACCTGGC CGACCACCAG CAGCAGGTCC TGCGCGAGGT CACGGGAGAG GGCATCGACG CCCACCGGGG CGGTGAGGAC CTGCTGGTGG ACGACTCCCC GGCCGGTCAG GCCTACGTCA CGGGTGTGAC GGTGCGGATG GAGGACGAGC AGCGCTACTG GGTGCCGGTC CTGGACGGCG CCGAGCGCCT GGGCGTGCTG CACGTGAGCT ACCCGGGCGA CCCCGACCGC TCGGCGATGC GCGACCTGGC CTCCATGGTG GCGCTGCTGG TCATCGCCAA GCGCTCCAAC AGCGACGCCT ACGCCCGGCT GATCCGCACC AAGCCCATGT CGGTCTCGGC GGAGATGCAG TGGACGCTCA TGCCGCCCGG CACCTTCGCC GACTCGCGGG TGACGATCTC GGCCGCCACC GAACCGGCCT ACGACAACGC CGGGGACTCC TTCGACTACG CCCTGGACGG GGAGACCGCC CACCTGGCGA TGTTCGACGC CATGGGCCAC GACACCGCCG CCGGGCTCAT CGCGAACCTG GCCGTGGGGG CCTTTCGCAA CGAGCGCCGC AAGGGCACCC CGCTGGTCGA CGTGTGCCGG GGGGTGGAGC ACACCCTGAT CCAGGAGTTC GTGCGCACCC GATTCGCCAC CGCGATCCTG GCCGAGCTGA ACATGGCCAC CGGGGAGCTG TACTGGGTCA ACTGCGGGCA CCTGCCGCCG GTGCTCATCC GGGGCGAGGA GGTCCGCGAC CTGGAGTGCG AGCCCTCCCA CCCGCTGGGG ATGGACCTGG GGCTGCCGGT GACGGTGTGC CGCGAACAGC TCGAACCCGG CGACCGGCTG CTGCTGTACA CCGACGGCAT CATCGAGGCG CGCGACTCCG AGGGGCGCGA GTTCGGTGTG GAGCGGTTCG TGGACTTCGT CATCCGCCAC CAGGCCGACA ACATGCCGGT TCCCGAGACG CTGCGGCGCC TGGTGCACGC GGTGCTGGAG TACCACCACG GCAGGTTCGG CGACGACGCC ACGGTGCTCT TCTGCGAGTG GCACGGCTGA
|
Protein sequence | MTVQHASNEL MVSLVRAGHL ATFEELPALV AKKADSAGLS QARIYLADHQ QQVLREVTGE GIDAHRGGED LLVDDSPAGQ AYVTGVTVRM EDEQRYWVPV LDGAERLGVL HVSYPGDPDR SAMRDLASMV ALLVIAKRSN SDAYARLIRT KPMSVSAEMQ WTLMPPGTFA DSRVTISAAT EPAYDNAGDS FDYALDGETA HLAMFDAMGH DTAAGLIANL AVGAFRNERR KGTPLVDVCR GVEHTLIQEF VRTRFATAIL AELNMATGEL YWVNCGHLPP VLIRGEEVRD LECEPSHPLG MDLGLPVTVC REQLEPGDRL LLYTDGIIEA RDSEGREFGV ERFVDFVIRH QADNMPVPET LRRLVHAVLE YHHGRFGDDA TVLFCEWHG
|
| |