Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0638 |
Symbol | |
ID | 9244480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 784924 |
End bp | 786264 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF323 |
Protein accession | YP_003678590 |
Protein GI | 297559616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.749121 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCACG AACAGGAGCG CGCCAAGGAG CGCATCGCCG CCGAACTCGA CCGGGCGAGG GACCGCAGCA ACGGGCTGAC CCTCCACGCC CTCGACGAGG GGGAGCTGCT CGCCCAGCAC TCGCCCCTGA TGTCGCCGCT GGTCTGGGAC CTGGCGCACG TGGGCAACTA CGAGGAGCAG TGGCTGCTGC GCGCCGCGGG GGGCCGTGAG GCGCTGCGCC CCGACATCGA CACCCTCTAC GACGCCTTCG AGAACCCGCG CGCCGAACGG GTCAGCCTCC CGCTGCTGCG GCCCGAGGAG GCCCGCGACT ACAACGCGCG CGTGCGCAGG GAGGTGCTCG ACGCGCTGGA GTCCGCCGAC CTCACCCGGG TGGAGACCGG CCCGGACGGC GAACGCTCCC TGCTGGACGC CGGGTTCGTC TTCCACATGG TCATCCAGCA CGAGCACCAG CACGGCGAGA CCATGCTCGC CACCCACCAG CTCCGCAAGG GCGAGCCCGT CCTGCTGGAG GAGGCCGCGC CCGTCACCAC GCTGCGCCCG CCGGTCCGGG ACGAGGTGTT CGTGCCCGAG GGGCCGTTCA CCATGGGCAC CGACGACGAC CCCTGGGCCT ACGACAACGA GCGCCCCGCG CGCACCGTCG ACCTGGGCCC CTACTGGATC GACACCGCCC TCGTGACCAA CGCCGCCTAC CAGGAGTTCA TGGACGACGG CGGCTACCAG ACCCGCCGCT GGTGGACCCG CGACGGCTGG GAGTGGAAGG AGAAGCGGGG AGCGGTCTCC CCTGCCTTCT GGACCCGGGA GGGGACCGGG TGGTCGCGCC GCCGGTTCGG CCGCCAGGAG ATGGTGCCCC CCGACGAGCC CGTGCAGCAT GTGTGCTTCC ACGAGGCCCG GGCCTACGCC GCCTGGGCGG GCAAGCGCCT GCCGAGCGAG CCCGAGTGGG AGAAGGCCGC GCGCTTCGAC CCGGTCAGCG GCCGGTCCCG GCGCTACCCG TGGGGCGACA CCGATCCCGG ACCCGGCCAC GCCAACCTGG GCCAGCGCAG GCTGGGCCCC TCACCCGCCG GGCGCCACCC CGACGGCGCC TCGCCGCTGG GCGTCCAGCA GCTGGTCGGC GACGTGTGGG AGTGGACCTC CACCACCTTC ACCGGCTACC CGGGGTTCCG CGCCTTCCCG TACGAGGACT ACTCGGAGGT GTTCTTCGAC GACGGGTACA AGGTGCTGCG GGGCGGCTCC TGGGCCACCC ACCCCACCGC GGCCCGCGCC ACGTTCCGCA ACTGGGACCA CCCGATCCGC CGGCAGATCT TCAGCGGTTT CCGCTGCGCG CGCGACGCGG AACCCGCCTG A
|
Protein sequence | MNHEQERAKE RIAAELDRAR DRSNGLTLHA LDEGELLAQH SPLMSPLVWD LAHVGNYEEQ WLLRAAGGRE ALRPDIDTLY DAFENPRAER VSLPLLRPEE ARDYNARVRR EVLDALESAD LTRVETGPDG ERSLLDAGFV FHMVIQHEHQ HGETMLATHQ LRKGEPVLLE EAAPVTTLRP PVRDEVFVPE GPFTMGTDDD PWAYDNERPA RTVDLGPYWI DTALVTNAAY QEFMDDGGYQ TRRWWTRDGW EWKEKRGAVS PAFWTREGTG WSRRRFGRQE MVPPDEPVQH VCFHEARAYA AWAGKRLPSE PEWEKAARFD PVSGRSRRYP WGDTDPGPGH ANLGQRRLGP SPAGRHPDGA SPLGVQQLVG DVWEWTSTTF TGYPGFRAFP YEDYSEVFFD DGYKVLRGGS WATHPTAARA TFRNWDHPIR RQIFSGFRCA RDAEPA
|
| |