Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5408 |
Symbol | |
ID | 9249311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 586806 |
End bp | 588710 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Endonuclease/exonuclease/phosphatase |
Protein accession | YP_003683293 |
Protein GI | 297564320 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.521256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCGGG ACCTGCTCGC CGCCGCCGTC GGCACCGTGC TGCTCCTGGA CGCCCTCCGG GTGTTCCTGC CGTCCCTGAT CACCGTGTTC GGCCAGGCCG GGACGACCGA TCCCGCGCTG ATGGGCGCGT TCGCGCTCGC CTGGTTCCTG GCCCCGTTCC CCTTCGTCCC CCTCGCCCGC CGGATCCCGC CCACGGCCAT GGCGGGCGCC GCCGTCGTGG TGATGGCGGC GGGGCGCCTG GCGCTCCAGG CCACCGAGGG CGGCGACCCG CAGCTGTACG TCTCCGCGGC CACGGTGGGC GCGGGGACCG TGTGGCTGGT GTGCACCGCG ATGGCGTCCG CCGTCGACGG ACGCCTGTGC GGACGCGAGG TGGTGACCGG GGTCGTCGCC GCGATCGCGG CCACCGCGGT CGTACGCTCC CTCTTCGGCG TGATCGACCT CGTGTGGTGG CCGACGCCCC TGGCGTGGGC GCCGGTCATC GCGGAGGTCG GGCTCCTCCT GCTCCTGCTG CTCAACGTCC GGGACACCGC CCCGGGGCCC GCCCGCCCCG TGCCGCCCCG GGCCTGGCTG GTCCTGGGCC CCCTGTTCTT CCTCAGCGGC CTGTACACCG CCAATCCGGC GGTGGGGCAG ACCCTCGCGG GGTCACCTCT GGGCGCGGCG GCCGTCGCCA CGGGCGCCGT GCTGTCGGTG GGCCTCGTCC GGCGTCCGCT CCTTCCGGGC CGGAGCCGGT GGGCGGCCCC CGTCGTGCTC CTGGCCGCGC TGGCGTGGCT GGCCTGGGCG TCCTCCGACG GCGTGACCCC GGAGGCGGCG GCCCTGCCGT CCGCGGCGGC GCTGGTCGCC GGTCAGCTCG CCCTGGCCGC CTGCGCGGGC CGGGCCGTGT TCGCCCGGCC CGGGCGGGCG TCGGCGGCGC GCAGCGGGCT GGCCGCCGCG TCGGGCCTGC TGGTGTTCGT GGTGCTGGTA TTCGCGTTCT ACTCCGCCTA CGACCTGTAC GTGCCCAACG CGTACGTGCC CTTCCTCGCC GCGGCCCTGC TGCTGCCCGC CGTGTCCTCC CCGGTTCTGG AGACCGGTGG GCCTCCGCGC CCTCGCGCAC TCGCGCTGAC CGCGGGGACG GCCGCGCTCA CCCTGGCGGC CACCGCCCTC TGGCCCGTGT TCGCGGCCCG CACGTTCGCA CCCGCACCCG CGAGCGGGAC CGAGGGGCTG CGCGTGGCCG CCTACAACGT GCGGATGGGC TTCGGCATGG ACGGCCGGTT CTCCGTCACC GAGCAGGCCG GGGCGCTGCG CCGCCTGGAC GCCGACGTCG TCGTCCTCAG CGAGGTGGAC CGGGGCTGGC TGCTCAACGG AGGCAACGAC GTGCTGTCCC GGCTCGCGCT CGAACTCGGC ATGGCCGCGC ACTGGGGACC GGCCGACGGG CCGCTGTGGG GCGACGCCGT GCTCACCTCC CTGCCGGTCA CCCGTGAGCG GCGGCACCCG CTCACGCCGA GCGGTCCCAC CGGGGCCCAG GCGCTGGAGG TGACCGTGGA CCACGGCGGT ACCGGGGTCA CGGTGGTCTC CACCCACGTG CAGCCCGCCG ACCGCGGCTT CCGCGCGGAG TCCTCCCGGC GCCAGCTGCG CGAGATCGCG GAGATCGCCC GCCGGGCGCG GGAGCGCGGA ACGCCCGTCG TGGTGGCGGG GGACCTCAAC ATCGAACCGG ACGACCCCGC GTGGGGCCTG CTCACCGAGC ACGGACTGCT CGACGCGTTC CGGAACACGC GTCCCTTCCC CACTCTGCCG GGGGAGACCG GCTCCGACCA GCAGATCGAC CACGTCCTGC ACACCGGGGA CCTGGCGCCG AGCGATCCGG CCAACCCGGA CGTGCCGCAC TCCGACCACC GGCCGGTGGC CGTCACGCTG ACCCCGGTCG CCTGA
|
Protein sequence | MRRDLLAAAV GTVLLLDALR VFLPSLITVF GQAGTTDPAL MGAFALAWFL APFPFVPLAR RIPPTAMAGA AVVVMAAGRL ALQATEGGDP QLYVSAATVG AGTVWLVCTA MASAVDGRLC GREVVTGVVA AIAATAVVRS LFGVIDLVWW PTPLAWAPVI AEVGLLLLLL LNVRDTAPGP ARPVPPRAWL VLGPLFFLSG LYTANPAVGQ TLAGSPLGAA AVATGAVLSV GLVRRPLLPG RSRWAAPVVL LAALAWLAWA SSDGVTPEAA ALPSAAALVA GQLALAACAG RAVFARPGRA SAARSGLAAA SGLLVFVVLV FAFYSAYDLY VPNAYVPFLA AALLLPAVSS PVLETGGPPR PRALALTAGT AALTLAATAL WPVFAARTFA PAPASGTEGL RVAAYNVRMG FGMDGRFSVT EQAGALRRLD ADVVVLSEVD RGWLLNGGND VLSRLALELG MAAHWGPADG PLWGDAVLTS LPVTRERRHP LTPSGPTGAQ ALEVTVDHGG TGVTVVSTHV QPADRGFRAE SSRRQLREIA EIARRARERG TPVVVAGDLN IEPDDPAWGL LTEHGLLDAF RNTRPFPTLP GETGSDQQID HVLHTGDLAP SDPANPDVPH SDHRPVAVTL TPVA
|
| |