Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4726 |
Symbol | |
ID | 9248608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5609595 |
End bp | 5610851 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF1205 |
Protein accession | YP_003682618 |
Protein GI | 297563644 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGAATCG TCGTCGCGGC TTACGCGGAC AAAGCTTATT TCTTTAGCAT GGTGCCGACG GCGTGGGCAC TGCGCGCCGC CGGGCACGAG GTCCGGATCG TCACCCAGCC CTCCATGACC GAGGCCGCGG CGGAGACGGG CCTGACCGTG GTCCCGGTCG GCGGCGACCA CACCCTGGCC GAGGTGCTGG CCCACGCCCG CGATCAGCAG GGCGAGTCGA TCTTCGACCT GGCCGAGGAG CGGCCGGAGA TGCTGGTGCC GGAGAAACTG CACCACGCCT ACGAGGAGTA CGTCACCTGG TGGTGGAAGC TCGTCAACGA GCCGATGGAG CGGGACCTGG TCGCCTTCTG CCGCGAGTGG CGCCCCGACC TCGTGCTGTG GGAGCCCAAC ACCTACTCCG CGGCGATCGC CGCGGAGGCG TGCGGCGCCG CGCACGGGCG GTTCCTGTGG AGCGTGGACC TCTTCTCGCG CATGCGGCGC CTGTACCTGG GGGCCGCCGA CGCCACCCCC GGACCCGACC CGCTCAGGAG CTGGCTGGAG GAGAGCGCGG ACCGGCACGG GGTGGCGTTC TCCGAGGACC TCGTACTCGG CCAGTTCTCC GTCCACCAGA TCCCGGAGGC GCTGCGGCCG CGCGAACTGG AGAAGACGGG CACCCACCTG AGCGTGCGCC CGGTTCCCTA CGCCGGAAGC GCCGTTCTGC CCTCCTGGGC ACGGGCCGGG TCCGAGCGGC GGCGCGTCCT GGTGGACTGG GGGTCCTGGA GCAGGACGGC CGAGGGCGCC GCCGCCCTGG TGGACGTCAT CGACGCCTGC GCCGAGATCG GCGCCGAGGC CGTCGTCCTC TCCCCCGCCT CCCGCAGGGA CTCCCTGCCC GCCCTGCCCG AGGACGTGGT GGTGACCGAC TCGGGGGCGG CCCACATGCT CATGGGCTCC GGCTCGCTGA TCGTCCACGG CGGCGGCTTC GACGTGTGCT GCAACGCCGT GGTCGAGGGA CTGCCGCAGC TGGTCGTGCT CAACACCGAG CAGTTCGACG CCGCTCCGCT CTCGCGGGCG CTCCGGGAGC GCGGGGCCGC GCGCGTACTG GCGGTGGAGG AGGTCCTCAC CCGGGGCGTG GACGACCTCC TCACGGAGCT CCTGGACAGC GGGGAGGTAC GCGCGGCGGC CGGGCTGGTG CGCGACGAGG CCCTGGCCGT GCCCGCCCCG GACCAGGTGG TGCCGGAGCT GGAGCGCATC GCGGCGGCCC GTGGCGGCGG CGTCTGA
|
Protein sequence | MRIVVAAYAD KAYFFSMVPT AWALRAAGHE VRIVTQPSMT EAAAETGLTV VPVGGDHTLA EVLAHARDQQ GESIFDLAEE RPEMLVPEKL HHAYEEYVTW WWKLVNEPME RDLVAFCREW RPDLVLWEPN TYSAAIAAEA CGAAHGRFLW SVDLFSRMRR LYLGAADATP GPDPLRSWLE ESADRHGVAF SEDLVLGQFS VHQIPEALRP RELEKTGTHL SVRPVPYAGS AVLPSWARAG SERRRVLVDW GSWSRTAEGA AALVDVIDAC AEIGAEAVVL SPASRRDSLP ALPEDVVVTD SGAAHMLMGS GSLIVHGGGF DVCCNAVVEG LPQLVVLNTE QFDAAPLSRA LRERGAARVL AVEEVLTRGV DDLLTELLDS GEVRAAAGLV RDEALAVPAP DQVVPELERI AAARGGGV
|
| |