Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3034 |
Symbol | |
ID | 9246887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3623015 |
End bp | 3624022 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | HAD-superfamily hydrolase, subfamily IIA |
Protein accession | YP_003680950 |
Protein GI | 297561976 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.224813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.86283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTGC TCGCCGCAGA CCGCCCCCTG AACGAGATCC ACGACGCGAT GCTCCTGGAC CTGGACGGGG TCGTCTACAT CGGCCCGAGG GCCGTACCGG CGGCACCGGA GGCGGTGGGC AAGGCGCGGG CCGCCGGAGC GCGGGTGGCG TTCGTGACCA ACAACGCCGG GCGCACCCCG GCGCGCATCG CCGAGCACCT GACCGAACTG GGCGTGGGCG CCGCCCCCGG GGACGTGGTG ACCTCCGCGG AGGCCGCCGC CCGCCTGGTC GGCGAACACC ACCCCGCGGG TTCGGACGTA CTGGTGGTGG GCGACACCGC GCTGCGCCAG GCGGTGCGCA GGATGGGGCT GCGCCCGGTG TCGGTCGACA GCCCCTCGGT GGTGGCCGTG GTGCAGGGCT ACTCCCGGCA CATGACCCGC GACCTGCTCG ACCAGGGCAC GGTCGCGGTC CGGCGCGGCG CGTTCTACGT GGCCAGCAAC AACGACGCCA CCGCACCCAG CGAGTGGGGC CTGACCCCCG GCAACGGGTC CTTCGTCCGG GTCATCGCCA ACGCCACCGG GGTCGAACCC GTCGTCGCCG GAAAGCCCAT GCGCCCCCTG CACGAGGAGG GCATCCTGCG CACCGGAGCG CGCAACCCGC TGATCGTGGG CGACCGGCTG GACACCGACA TCGAGGGCGC GACCGCCCAC GGCGCGGCGG GGATGCTGGT GCTGACGGGG GTGGCCACCC CGATGGACGC CCTCGCCGCG CCCGAGCACC AGCGCCCCAG CTACCTGGCG TGGGACCTGT CGGGCATGAA CCACACGCAC CCGGCCGTCG TCCGCGAGGG CGACCGCACC CGCTGCGCGG GGTGGACGGT GACCGTCACC GGCGGCGCAC CACGCGTCGA GGGGGACGGG GACCGGCTGG ACGGGCTGCG CGCGCTGTGC GTCGCGGTCT GGGCGGACCG GGCGGCGGAC CCGTCCGGCC CGGCCGCACG CGAGGCGCTG TCCCGCCTGG GCTGGTGA
|
Protein sequence | MSLLAADRPL NEIHDAMLLD LDGVVYIGPR AVPAAPEAVG KARAAGARVA FVTNNAGRTP ARIAEHLTEL GVGAAPGDVV TSAEAAARLV GEHHPAGSDV LVVGDTALRQ AVRRMGLRPV SVDSPSVVAV VQGYSRHMTR DLLDQGTVAV RRGAFYVASN NDATAPSEWG LTPGNGSFVR VIANATGVEP VVAGKPMRPL HEEGILRTGA RNPLIVGDRL DTDIEGATAH GAAGMLVLTG VATPMDALAA PEHQRPSYLA WDLSGMNHTH PAVVREGDRT RCAGWTVTVT GGAPRVEGDG DRLDGLRALC VAVWADRAAD PSGPAAREAL SRLGW
|
| |