Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4715 |
Symbol | |
ID | 9248597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5596657 |
End bp | 5597916 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein of unknown function DUF1205 |
Protein accession | YP_003682607 |
Protein GI | 297563633 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.683026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.640764 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGTGC TGATCGCGAC GCAGGCGGAG CGGACCCACT TCCTGGGGCT GGTGCCCCTG GCGTGGGCGC TGCGCGCCGC GGGCCACGAG GTCCGGGTGG CCAGCCAGCC CGAACTGGAG GCGGTGGTCA CCGGGACGGG CCTGCCCTTC TCCCCCGTGG GCAGGGACCA CCTCCTGCGC AGGGTCATGC GGCAGTACCA CGCGATGACC GGCGGGGAGG ACGACGACTT CGACATGGCC GAGGACCGTG ACGAGGTCCT GACCTGGGAC TACCTCCTGG AGGGCTACCG CCTGACCGTG CAGTGGTGGT GGCGGATGGT CAACGACCCC ATGGTCGACG ACCTGGTCGC CCTCTGCCGC GAGTGGCGCC CCCACCTGGT CGTGTGGGAG CCCATCACCT TCTCCGGGGC GATCGCCGCC GAGGCCTGCG GGGCCGCGCA CGTGCGCTAC CTGTGGGGGG CCGACATCTT CGCCCGCACC CGCGCGCGCT TCCTGGCGCG GATGGGCGAA CAGCCCGCCT CACGGCGCGA GGACCCCCTG GCCGCGTGGC TGGGGACCAG GGCGGCCCGG TACGGGGTGA ACTTCTCCGA GACCCTGGTC CACGGCCAGG CCACCGTCGA GCAGGTCCCC GCGTCCCTGC GGGTGGACAC GCCCGCGCAC CTGGAGTACC TGCCGGTGCG CTACGTGCCC TACAACGGAC GCGCCGTCGT CCCCCACTGG CTGCGCACAC AACCCGACCG CCCCCGGATC GGACTCAGCC TCGGGACCAG CGCGAACGAG TGGTACGGCG GTCACCGGGT CTCCGCCGGG GAGATCCTGG AGGGTCTGGC CGAGCTGGAC GTGGAGGTGG TGGCCACCCT GCCCGCCAGT GAGCAGGCCA AGCTCGGCGC CGTCCCCGGC AACGCCCGCC TGGTCGAGTA CGTCCCCCTG CACGCCCTGG CCCCCACCTG CGCCGCCATG GTCACCCACG GCGGCCCCGG CACCGTCCTG ACCGGCCTCG CCCACGGAGT CCCCCAACTC CTGTCACCCA ACGCGCACAT GTTCGACACG GTCCTGCTGT CCGGGCTGGT GGAGGAGGCC GGGGCGGGCA GGGTCGTGGA CCCCGACCGC CTGGACGCCG CCACCGTCGC CGCAGGCGTG CGCACCCTCC TGGAGGACCC CCGCCACACA AGCGCCGCCC GCGCCCTGCG CGCACGCATG GACGCCATGC CCACCCCCGC CGACCTCGCC CACACCCTCG CCGGCCTCAC CCGCACCTGA
|
Protein sequence | MRVLIATQAE RTHFLGLVPL AWALRAAGHE VRVASQPELE AVVTGTGLPF SPVGRDHLLR RVMRQYHAMT GGEDDDFDMA EDRDEVLTWD YLLEGYRLTV QWWWRMVNDP MVDDLVALCR EWRPHLVVWE PITFSGAIAA EACGAAHVRY LWGADIFART RARFLARMGE QPASRREDPL AAWLGTRAAR YGVNFSETLV HGQATVEQVP ASLRVDTPAH LEYLPVRYVP YNGRAVVPHW LRTQPDRPRI GLSLGTSANE WYGGHRVSAG EILEGLAELD VEVVATLPAS EQAKLGAVPG NARLVEYVPL HALAPTCAAM VTHGGPGTVL TGLAHGVPQL LSPNAHMFDT VLLSGLVEEA GAGRVVDPDR LDAATVAAGV RTLLEDPRHT SAARALRARM DAMPTPADLA HTLAGLTRT
|
| |