Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4930 |
Symbol | |
ID | 9248817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 65582 |
End bp | 66841 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003682819 |
Protein GI | 297563846 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.265943 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAAC ACCTGTACCG ACACCCGCAC CGCCCGCTAC GAACGGCGTC GGCGGCCGTC TGCGCCGCGG CGCTCGCCCT GACCGGAGTC GGCTGCGCGG CCGAACGCGA CCCGAGCGTG TACACCATCA TGGACTCCAG CACCGACGAG CCCTACCACA CCTGGGACCA GGAGGCGATG GACGCCTGCG GGGAACAGGT GGGCGTGGAG GTGCGCCACA TCAGCGTGCC CGCCGACCAG CTCGTGCCCA AGGCCCTGCG GATGGCCTCC TCCGACTCGC TCCCCGACGT GCTCAACCTC GACGGCTCGG ACCTGCCGCA GTTCGCGGCG GCCGGGGGCC TGGTACCGCT GGAGGAACTC GGCATACCCA CGGACGGCCT GTCCGAGGGC GCGCTGTCCA TCGGCAGCTA CGAGGGCGTG TACTACGGCG CCGCCCGGTC GGTGAACTCC CTCGGCCTCT TCTACAACGC GGCCGCCCTG GAGGAGGCGG GCATCGACCC GCCCCGGACC TGGGCGGAGC TGGAGGAGGC CGCCGCCGAG CTGACCGGGG GCGGGCGCTA CGGCCTGGCG ATCAGCGCCC TGGCCACCGA GGACGGCGTC TACCAGTTCC TGCCGTGGCT GTGGTCCAAC GGCGGCGACG AGAGGGAGCT GGCCTCACCG GAGTCGGTGG AGGCACTGGA GTACGTCACC TCGCTGGTCG AGGCGGGATC GGTCTCCCCC TCGGTGGTCA ACTGGACCCA GGCCGACGTC AACGACCAGT TCATCGCGGG CAACGCGGCC ATGATGGTGA ACGGCCCCTG GCAGCTGCCG GTCCTCCAGG AGCACCCCGA CCTGGAGTGG GCCGTGGCGG AGATCCCCGT GCCCGAGGCC GGGGACACCT CGGTGGCTCC GATCGGCGGG ACCACCTTCA CCGTGCCGGT CAACGCCGAG GACCCCGACC GCGAGCGCGT GGCCGCCGAA CTCGTGGCCT GCCTGACCAC GGCCGAGGCG CAGCTGGACT GGTCCACCAA GGGCAGCAAC GTGCCCGTGG ACACCGGGGC GGCCGAGCAG TACCGCGACC TGGTCCCGGA ACTGGCCCCG TTCGTGGACC AGGTGCGCAC GGCGCGCAGC CGGACCGAGC ACGCCGGCAC CGAGTGGAAC GCCTACTCCC AGGCCATCGG CACCGCGCTC CAGGCGGCGC TCACCGGCGA GGCGAGCCCG CGGGAGGCCA TGGAACGCGC CCAGGCGCGG GTCGAGGCGG AGCTGGAGGC CCGGTCATGA
|
Protein sequence | MREHLYRHPH RPLRTASAAV CAAALALTGV GCAAERDPSV YTIMDSSTDE PYHTWDQEAM DACGEQVGVE VRHISVPADQ LVPKALRMAS SDSLPDVLNL DGSDLPQFAA AGGLVPLEEL GIPTDGLSEG ALSIGSYEGV YYGAARSVNS LGLFYNAAAL EEAGIDPPRT WAELEEAAAE LTGGGRYGLA ISALATEDGV YQFLPWLWSN GGDERELASP ESVEALEYVT SLVEAGSVSP SVVNWTQADV NDQFIAGNAA MMVNGPWQLP VLQEHPDLEW AVAEIPVPEA GDTSVAPIGG TTFTVPVNAE DPDRERVAAE LVACLTTAEA QLDWSTKGSN VPVDTGAAEQ YRDLVPELAP FVDQVRTARS RTEHAGTEWN AYSQAIGTAL QAALTGEASP REAMERAQAR VEAELEARS
|
| |