Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4788 |
Symbol | |
ID | 9248671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5674811 |
End bp | 5676037 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003682678 |
Protein GI | 297563704 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.123096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGAGTT CGCGGGCGCG GCGGCGCGGG CCGGGCCTCC GACCGGCCAG GTTGTCCGTC TCCGACCGGC TGCGGACCGG GGCGAGCGGC CTGCGTGCCC GGCCGACGCG GGTGGTGCTG TCCGCCCTGG GCATCGCCAT CGGGATCGCG GCCATGGTCG CGGTGGTGGG CGTCTCGGAG TCCGGCAGGG CCGAGCTGGA CGCCCGGATC GGACGCCTGG GCACCAACAT GCTGACCGTC GCCCCGGGCA GCGACCTGTT CGGCGGCACC GCCGTGCTCC CGCCCGAGGC CAAGGGCCGG ATCGACCGTA TGCCGGAGGT GGAGCGCTCC GCCCAGGTGG AGATGGTGAA GGGGGCGGGC GTCTACCGGA GCGACCTCGT CCCCGAGGGC GAGTCCGGGG GGATCGCCGC GTACGGCGTG GAACCGGGCC TCCTGGACAC ACTGCGCGCC CGGGTGGACG AAGGCGTGTG GCTGAACCCG GCCACCACCG ACCACCCGTC GGTCGTGCTG GGGCGGGACG CGGCCGCGAG GCTGGGCGTT ACCCGGGTCA CCCCGGACAC CCTGGTGCTG GTCGGGGACG AGTACTTCGC CGTCGTCGGC ATCCTGGACG CGGTGGAACT GGCCCCCGAG CTGGACAACG CGGTGCTGGT CGGCCAGGAG GTGGCCGAGA GCCTGCTCGG CGCTCGCGGT GAGGCCTCCA CGATCTACGT GCGCCTGGCC CCCGATCGGG TCGCGGACGC ACGATCGCTG GTCGGGCGCA ACGCCAACCC GGAAAACCCC AACGAGGTGA GGGTGTCGCG CCCCTCGGAC GCGCTGGAGG CGCAGCGGGC CGCCGACCAG ACCCTCAACG GGCTGCTGCT GGGACTGGGC GGGATCTCCC TGCTGGTGGG CGGGGTGGGG GTGGCCAACA CCATGGTCAT CTCGGTCCTG GAACGTCGCG GGGAGATCGG GCTGCGCCGG GCCCTGGGCG CCACACGCCG CGACATCCGG ACGCAGTTCC TGGTCGAGGC GGTCGTGCTC TCGGCCCTGG GCGGCGCGGC CGGGAGCGTG CTCGGCGTCC TGACGACGCT GGTCTACGCG GTCCTGCGGA GCTGGCCGTT CGCCGTGCCC TGGTGGGCGG GGGCCGGGGC CCTGGCGGCG ACGGTCGTCA TCGGTGCGGT GGCTGGCCTG GTGCCCGCGC TGCGCGCGGC GGCGCAGCAC CCGACCGAGG CGCTCGGTTC CGCCTGA
|
Protein sequence | MRSSRARRRG PGLRPARLSV SDRLRTGASG LRARPTRVVL SALGIAIGIA AMVAVVGVSE SGRAELDARI GRLGTNMLTV APGSDLFGGT AVLPPEAKGR IDRMPEVERS AQVEMVKGAG VYRSDLVPEG ESGGIAAYGV EPGLLDTLRA RVDEGVWLNP ATTDHPSVVL GRDAAARLGV TRVTPDTLVL VGDEYFAVVG ILDAVELAPE LDNAVLVGQE VAESLLGARG EASTIYVRLA PDRVADARSL VGRNANPENP NEVRVSRPSD ALEAQRAADQ TLNGLLLGLG GISLLVGGVG VANTMVISVL ERRGEIGLRR ALGATRRDIR TQFLVEAVVL SALGGAAGSV LGVLTTLVYA VLRSWPFAVP WWAGAGALAA TVVIGAVAGL VPALRAAAQH PTEALGSA
|
| |