Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1398 |
Symbol | |
ID | 9245248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1712806 |
End bp | 1713912 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | protein of unknown function DUF6 transmembrane |
Protein accession | YP_003679336 |
Protein GI | 297560362 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.509319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000513734 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGAACACA CCGAGACACG TCCCCGCCGC TCCGTACCGC CCCTCCTGCT CGCCGCCGGA CTGGTGGTGA TGTGGAGCTC CGGCTTCGTC GGCGCGGATC TGGGCACCCG GTACGCGCCC GCGACCACGT TGCTGGCCTG GCGTTTCCTG GTCGTGGCCG CCCTGCTGGC GGGGTGGTGG CTGTGGCGGG GCCCGCGGAT GTCCCGGCGG GACCTGGCGG CGCACGCCGT GCTGGGCCTG CTGGCCCAGT CCGGGTACCT GTACGGGGTG TTCGCCGCCG CGCAGGCGGG CGTGGCCGCC GGGACGAGCG CGCTGGTGGC CGCCCTCCAG CCCCTGGTGG CGACCGCCCT GGCCGTCCCG CTGCTGGGCG AACGGGTGCG GCCGCGCCAG CTGGCGGGCC TCGCGCTGGG GCTGGGCGGG GTCGGCCTGG TGGTGGGCGC GGACCTGTTC CGGCCGGGCG CGGCGCCGTG GTGGGGCTAC CTGCTGCCGT TCGGGGCGAT GCTGTCCCTG GTGGCCGCGA CCCTGCTGGA GCGCCGCGCG CGGCCCGGGG GCTCGGTGGT GCAGGCCCTG GCGGTGCAGT GCGCGGTGAG CGCGGTGCTG TTCACGGGGC TGGCCGCGGT CACCGGGACG CTGGCGCCGC CCGCCGACCC CGGGTTCTGG GCGGCGGTGG TGTGGGTGGT GGTGCTGTCC ACCCTGGGCG GCTACGGCCT GTACTGGGCC GTCCTGGCCC GCTCGGGCGT GGCCCGGGTG TCGGCCCTGC TGTACCTGAC CCCGCCCACC ACGCTGGTGT GGTCGTGGCT GATGTTCGGC GATCCCGTGG GGCCAGCCGC CCTGGCGGGG ATGGCGGTGT GCGCGGTGGC CGTGGTGCTG GTGAGCACCG GGGGAACCGG GAGCCGGGCC GCCCGGACGG ACGCGAAGGC TACCGGCGCT CCCGACCGGG AACCGAAACG CACGGCCGGT TCCACCACGA GCACCCCCGA CCGGCTACCG GGGCGACGGG CCGAGGCCGC CGCCGCTGCT CCCGCCCGGG CCCCGGAACG CGGCACCGGG GGCGCTACGG ACACTCCCGG GCCGGAAGCG GACCGACCCC GTCCGAGGAG ACGGTGA
|
Protein sequence | MEHTETRPRR SVPPLLLAAG LVVMWSSGFV GADLGTRYAP ATTLLAWRFL VVAALLAGWW LWRGPRMSRR DLAAHAVLGL LAQSGYLYGV FAAAQAGVAA GTSALVAALQ PLVATALAVP LLGERVRPRQ LAGLALGLGG VGLVVGADLF RPGAAPWWGY LLPFGAMLSL VAATLLERRA RPGGSVVQAL AVQCAVSAVL FTGLAAVTGT LAPPADPGFW AAVVWVVVLS TLGGYGLYWA VLARSGVARV SALLYLTPPT TLVWSWLMFG DPVGPAALAG MAVCAVAVVL VSTGGTGSRA ARTDAKATGA PDREPKRTAG STTSTPDRLP GRRAEAAAAA PARAPERGTG GATDTPGPEA DRPRPRRR
|
| |