Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3767 |
Symbol | |
ID | 9247636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4525648 |
End bp | 4526637 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | LPPG domain protein containing protein |
Protein accession | YP_003681671 |
Protein GI | 297562697 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATAG TCGTACCGGC TGGCGGTATC GGCGGCGCGC GGTTCCTCCG CGGCCTCAGG GCGGCGCTCG GAACCGTGGC GTCGGCCACC GAAGAGCACA CCACAGACAG CAGCGTCACC GTCATCGGCA ACACCGGTGA CGACATCACC CTGTTCGGGC TGAGAATCTG TCCGGACCTG GACACGGTGA TGTACACCCT CGGCGGGGGG ATCAACGAGG AGCAGGGCTG GGGCCGGACC GACGAGTCCT TCACCGTCAA GGAGGAGCTC GCCGCCTACG GCATGAAGCC CACCTGGTTC GGTCTGGGCG ACCGCGACAC CGCCACCCAC ATCGTGCGGA CGCAGATGCT CGCCGCCGGA TTCCCCCTCT CGGCGGTCAC CGAGGCGCTG TGCGACCGCT GGCGCCCGGG CGTGCGGCTG CTGCCGATGA CCGACGACCA GGTCGAGACG CACGTGGTCG TCGACGACGG CGGGGGGCGC CGGGCCATCC ACTTCCAGGA GTGGTGGATC CGGCACCGCG CGGCGCTGCC CGCCGAACAG ATCGTCAGCG TCGGCGCCGA GACGGCCAAG CCCGCACCCG GTGTGCTGGA GGCGATCGCC GATGCCGACG CCGTGATCCT GCCCCCGTCC AACCCGGTGG TCAGCGTGGG CTCCGTCCTG GGCGTCCCCG GGATCCGCGA GGCCATGGTG GCCAAGACCG TGGTGGGCGT CTCCCCCATC ATCGGCGGAG CGCCGGTGCG CGGCATGGCC GACGCCTGCC TGACCGCGAT CGGGGTGGAG ACCAGCGCGG GCGCGGTCGC CGCGCACCTG GGCGCCGACC TGCTGGACGG CTGGCTGGTG GACGAGGCCG ACGCGGGCAC CGAGGTGGAG GGGATCGAGG TGCGCTCGCG TCCCCTGTAC ATGAGCGACC CCGCCGCCAC CGAGGCGCTC GCACGGGCGG CCGTGGACCT GGCCGTGGAG CTGGCGCAGA GGCGGGGAGG CGGCGAGTGA
|
Protein sequence | MRIVVPAGGI GGARFLRGLR AALGTVASAT EEHTTDSSVT VIGNTGDDIT LFGLRICPDL DTVMYTLGGG INEEQGWGRT DESFTVKEEL AAYGMKPTWF GLGDRDTATH IVRTQMLAAG FPLSAVTEAL CDRWRPGVRL LPMTDDQVET HVVVDDGGGR RAIHFQEWWI RHRAALPAEQ IVSVGAETAK PAPGVLEAIA DADAVILPPS NPVVSVGSVL GVPGIREAMV AKTVVGVSPI IGGAPVRGMA DACLTAIGVE TSAGAVAAHL GADLLDGWLV DEADAGTEVE GIEVRSRPLY MSDPAATEAL ARAAVDLAVE LAQRRGGGE
|
| |