Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4611 |
Symbol | |
ID | 9248492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5474405 |
End bp | 5476129 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein of unknown function DUF88 |
Protein accession | YP_003682503 |
Protein GI | 297563529 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0253473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.838732 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACCGCT GCGCGTTGTT CGTTGACGCG GGGTACCTTC TCGCGGACGG CGCGATGGCA GTGCACGGAA CCCGCAACCG GGACTCCGTC TCCTGGGACT ACACCGGTCT CGTCCAGTTC CTCAACGAGG TCGCGCGGGA CCGCACCGGC CTGCCGCTCC TGCGCTGCTA CTGGTACGAG GCGGTCGCCG ACGACCGGCG GACCCAGGAG CAGGACGGCA TCGCCGACAT CCCCGGTATC AAGTTCCGCG GCGCGCGGAT AAGGCCCGGC CGCCGCGAGG GCGTCGAGAG CTACGTCCAG CGCGACCTCA CCACGCTGGC CCGTACCGGC GTCCTGTGCG ACGCCGTCCT GGTCAGCGGG GACGAGGACA TGGCGCCGGT CGTGGCCGAC GTACAGGACA TGGGCGTGCG CGTCACGGTC GTGCACGTCT CCGTCGAGGG CAACTGGACC ATCTCCCGGG CCCTGCGCCG CGAGTGCGAC GACCTCATCG AGATCGGCGC GGGCCACCTG CGCCCGCACG TCAACCTGCT CTCCAGCGGC GGCGCCGCCC AGGAGACCGC TGCCAAGACC ACCACGCCGT TCTCCAACGG CCGCGCCCGC TCGGCGCCCG AGACCCCCAG GCAGCCCGTC GCGACGGCGC AGCCCGGCGG CAGCCGGGTG GAGGCGGCCT CCTCGTCCGC CGTGGGCCTG GAGGCGATGT TCGCCGGAAC CGGCGGACAG CCCGCCCCCC AGGGCAGCGC CATGGACCAG CTGCGCGCCA TGCGCAGGTC CCTCGCCCAG CAGCGCGGCG GCGGGGACCA CCTGGCCGGG CCGAACACGG ACAACCAGCA GTCCGGCCCG ATCGACGCCA ACGGGTTCCC CTCCGGCGGC CAGGTCCCGC CGAACGTGGG CGGCAACAGC GCCTATCCGG GCGCCTCGAT GACCGGGGGG CACCCGGCGC CGGGCGGCCA GGGGGGCTAC CCCGTCACCG GGGGGCACCA GTCGCTGCGC GGGGCCGACC AGACGGGCTA CGGGCTTCCG GCGAGACAGG CACCGCCCCC GCAGCAGCAC ATGACGGGGC CGCAGCAGTC CTTCGCCGGC GGGACCGGGC CACAGCAGTC CTTCGCCGGC GGGACCGGGC CGCAGCGCTC CTTCTCCCAG GAGGGGCAGT CCCTGCAGCC GGGCCAGCAC GGCCAGCCGG GCCAGCACGG CGGGCAGCCG GGGCACGGGC AACCGGGGCA CGCGCCACCG CAGAGGAGCG CCCCGCCCCA GGACCCCAGG TTCGGCCCGG GGGACAACCC CGGCTTCGGG GGTGCGAACG ACGCAAACCC TACCTATGGG ACCAGTACGG AAACTGACAC CGCCCAACGC GGTGGGGCGG CCCCCTCCTA CGGGGGACCG CAGCCTACTG GTCCCGAACA AGACAGCCGG TTCGCCGCCG GCGACTACCG GGAACCGCGT CCCAGTCCTG GATATCCCCA GCAGACGCGT CCCGCCACGC ATACCGTTGA CGAAGCGGTG CATGTTGCGC GCAAAGAAGG GAACGACTTC GCGGAGTCGA TCGCCCGCGA GGCGCCCGCC CTGTGGGTCG AGGCCGTCCT CGCCCGCAGG CCCCGGATGC CCTCCGACCT GGAGGCGCGT CTGCTCCAGG GTTCCTCGCT GCCGATCGAC CACCTGTTGC GTGACGAGGT CCGGGACGGA CTGCGCCAGG GGTTCTGGCA GGCACTGGAG CGTGCCAGGC CCTGA
|
Protein sequence | MDRCALFVDA GYLLADGAMA VHGTRNRDSV SWDYTGLVQF LNEVARDRTG LPLLRCYWYE AVADDRRTQE QDGIADIPGI KFRGARIRPG RREGVESYVQ RDLTTLARTG VLCDAVLVSG DEDMAPVVAD VQDMGVRVTV VHVSVEGNWT ISRALRRECD DLIEIGAGHL RPHVNLLSSG GAAQETAAKT TTPFSNGRAR SAPETPRQPV ATAQPGGSRV EAASSSAVGL EAMFAGTGGQ PAPQGSAMDQ LRAMRRSLAQ QRGGGDHLAG PNTDNQQSGP IDANGFPSGG QVPPNVGGNS AYPGASMTGG HPAPGGQGGY PVTGGHQSLR GADQTGYGLP ARQAPPPQQH MTGPQQSFAG GTGPQQSFAG GTGPQRSFSQ EGQSLQPGQH GQPGQHGGQP GHGQPGHAPP QRSAPPQDPR FGPGDNPGFG GANDANPTYG TSTETDTAQR GGAAPSYGGP QPTGPEQDSR FAAGDYREPR PSPGYPQQTR PATHTVDEAV HVARKEGNDF AESIAREAPA LWVEAVLARR PRMPSDLEAR LLQGSSLPID HLLRDEVRDG LRQGFWQALE RARP
|
| |