Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3699 |
Symbol | |
ID | 9247568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4437326 |
End bp | 4438735 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | metal dependent phosphohydrolase |
Protein accession | YP_003681603 |
Protein GI | 297562629 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.849083 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCATGG GTCCCTACTC GGGCCTGGAT CTGCGCACAC TGGTCCTGAT CGCGATTCTC TTCGTGGTCG CGGAGTCGAT CAGTACGGCG GTCCTGACGG GCGTCAAGGC GGGGATCTCG CCGAGTTCGG CGGTCTCCCT GGCCGCCGTC GTCCTGGTCG GACCGGTCGG GGCGGCGGTG GTCGGTTTCG CCTCCTGCTT CGCCATCCGC AGGCAGAACC TCGTCAAGAG GGCGTTCAAC GGAGCACAGT TCGCACTGGC GGCCTACGCG GCCGGACACG TCTACCTTCT GCTCGGCGGA ACGGTGGGAG AACCCGGCCG GGAGGACTTC CCGGCCATCG TCCTCCCCTA CGTCGCCGCC GTCCTGACCC ACTCCCTGGT CAACTCGGTG CTCGTGGCCG TCCTGATGTG GACCCTGGGC GTCGTGCGCC CCGGCGGCTC GCGTGGCCGG TTCGACTGGC GTCCGCTGGT GGGGGTCGAG CTGTCCTCCA TGGGCTACCA GATGCTCGGC CTGGCCATCG CCGCGGTCTG GGGCGGGGTC GGCCTCATCG CGCCGCTGCT GGTGCTGCTC CTGCTGTTCA TCGCGCGCTG GACCTTCGCC CAGCAGCTGG ACGAGGCGCG CGCGCACGAG GCCACCCTGG CCACCCTGTG CCAGGCGGTG GAGACCAAGG ACTACTACAC CCGGGGCCAC TGCATGCGCG TGGCCGAGGG CGCCGCCATG ATCGCACGCG AACTGGGCAT GCCCGCCGAC CGGGTGCAGA AGATCCGCTA CGCCGGGATG CTGCACGACA TCGGCAAGCT CGGAGTGCCC ACCAAGGTGC TGCAGAAGAC CGGCAAGCTC ACCGACGACG AGTACGCGGC CATCAAGCTG CACCCCACGC GCGGCTACGA GATCGTGCGG GAGATCAGCT TCCTGGACGA GGCGCTGGCC GGGATCCGGC ACCACCACGA GCGCCTGGAC GGGCGCGGGT ACCCCATGGG CCTGGTCGGG ATGGAGATCC CCGAGTCCGC GCGCATCATC AGCGTCGCCG ACGTCTTCGA CTGCCTCACC TCCACGCGCT CCTACCGCAG GGCGTGGTCG GTGGAGGACG CCGTCGCCGA ACTGCGGGCC TGCGCCGGGA CCCAGTTCGA CCCCAGGATG GTCGAGGCGC TCGTGCGCGC CGTCGAACGC GAGGGCTGGG AGACACCCGA CATCGCCGAG CCTCCGGGCG GCTGCTACGG GTCGGAGGCG GCCGAACGCT CCGTGGAGGG GTCCGGCCGA GCCGGGGAAC CGGAGGAGGT GCCCGCCGAC GCGTCCGCCG AGGAGAGCGC CGCCGGGGAG CCGTCCTCCG CAGGGGCGGC GGCCGACGAG GCCGCCTCCG CGGACGCCAC GGCCTCCGCC GAGGCAGTGA CGGGCGGGCG TGAACGATGA
|
Protein sequence | MLMGPYSGLD LRTLVLIAIL FVVAESISTA VLTGVKAGIS PSSAVSLAAV VLVGPVGAAV VGFASCFAIR RQNLVKRAFN GAQFALAAYA AGHVYLLLGG TVGEPGREDF PAIVLPYVAA VLTHSLVNSV LVAVLMWTLG VVRPGGSRGR FDWRPLVGVE LSSMGYQMLG LAIAAVWGGV GLIAPLLVLL LLFIARWTFA QQLDEARAHE ATLATLCQAV ETKDYYTRGH CMRVAEGAAM IARELGMPAD RVQKIRYAGM LHDIGKLGVP TKVLQKTGKL TDDEYAAIKL HPTRGYEIVR EISFLDEALA GIRHHHERLD GRGYPMGLVG MEIPESARII SVADVFDCLT STRSYRRAWS VEDAVAELRA CAGTQFDPRM VEALVRAVER EGWETPDIAE PPGGCYGSEA AERSVEGSGR AGEPEEVPAD ASAEESAAGE PSSAGAAADE AASADATASA EAVTGGRER
|
| |