Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1049 |
Symbol | |
ID | 9244895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1294487 |
End bp | 1295800 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003678998 |
Protein GI | 297560024 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.340866 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTAC CCCCGGGCCG AGTGCGCCGA CACGGCCGCC GCGGCACCGC CCTGTCCGCG ACCGCCGCGG CCGCGGCGAC GGTCCTGCTC GCCTCCGCCT GCTCGGGCTC CGACGACGGC ACCGTCGAGC TGCGCTTCTC CTGGTGGGGC TCCAACGAGC GCCAGGCCAC CATGCTCCAG GTCATCGAGA ACTTCGAGGC GGACAACCCC GACATCCGGA TCACGGCGGA GACCACCGAC TGGTCCGCCT ACTGGGACCG CCTGGCCACC ACCACCGCGG CCAACGACTC CCCGGACGTC CTCATGCAGG AGGAGCGCTA CCTGCGCGAG TACGCCGACC GCGGCGCCCT GCTCGACCTG GGCGAGGCCG AGGGCCTGGA CCTGTCGCTG ATCGACCCGC TGGTCGCCGA GAGCGGCCAG CTGGACGGGC AGACCTTCGG CGCGGCCAGC GGCGTCAACG CCTACTCCAT CCACGCCGAC CCCGAGGCCT TCGCCGCCGC GGGGGTGGAG ATGCCCGACG ACGACACCTG GACCTGGGCG GACTACGTCG AGATCGCCGG GCAGATCAGC GAGGGCACCG GCGGCGAGAT CGCCGGCGCC CAGAGCATGA GCTACAACGA GGCCGGTTTC CAGGTCTTCG CCCGCCAGCA CGGGGAGGCG CTCTACAACG AGGACGGCAG CCTCGGCTTC TCCCAGGAGA CCCTGGAGGC CTGGTACGAG ATCACCCAGG ACCTGCTGGA GAACGGCGGC CAGCCCAGCG CGGCCCGGAG CGTGGAGATC CAGGCGGGCG GCATCGACCA GTCGGTCGTG GCCACCGGCG AGGGCGCCAT GGCGCACTTC TGGAGCAACC AGCTCGGCAA CGTGGTCGAG GCCTCCGGGC GCGAGATCCA GCTCCTGCGC TACCCCGGGG AGACCGAGTT CGACCGGACC GGCCTGTTCT TCAAACCGGC CATGTTCTAC TCGATCTCCG CGGGCTCCGA GCACCCCGCG GAGGCGGCCC GCTTCGTCGA CTACATGCTC AACGACCCGG CGGCGTCCGA GCTGCTCCTG GCCGACCTGG GCCTGCCCGC CAACACCGAG GTCCGCGAGG CCATCCTCGA CGACCTGCCC GAGTCCGACG CCCGGATGGC CGAGTTCATG GGCGAGATCG AGGGAACGAT CGTGGACGGC AACCCGCCCG CGCCGATCGG CGCCGGTCAG GTCGTGGACA TCAGCAGCCG CGTCAGCGAC GGGCTCGCCT TCGGCGACCT CACCCCGGCC GAGGCCGCCG AACAGTTCAT GACCGAGGTC GAGGCGGCCA TCGAGACCTC CTGA
|
Protein sequence | MRVPPGRVRR HGRRGTALSA TAAAAATVLL ASACSGSDDG TVELRFSWWG SNERQATMLQ VIENFEADNP DIRITAETTD WSAYWDRLAT TTAANDSPDV LMQEERYLRE YADRGALLDL GEAEGLDLSL IDPLVAESGQ LDGQTFGAAS GVNAYSIHAD PEAFAAAGVE MPDDDTWTWA DYVEIAGQIS EGTGGEIAGA QSMSYNEAGF QVFARQHGEA LYNEDGSLGF SQETLEAWYE ITQDLLENGG QPSAARSVEI QAGGIDQSVV ATGEGAMAHF WSNQLGNVVE ASGREIQLLR YPGETEFDRT GLFFKPAMFY SISAGSEHPA EAARFVDYML NDPAASELLL ADLGLPANTE VREAILDDLP ESDARMAEFM GEIEGTIVDG NPPAPIGAGQ VVDISSRVSD GLAFGDLTPA EAAEQFMTEV EAAIETS
|
| |