Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2207 |
Symbol | |
ID | 9246057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2637588 |
End bp | 2638595 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | periplasmic binding protein |
Protein accession | YP_003680135 |
Protein GI | 297561161 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000185763 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCGAGT TGTCGCGTCC TGTGCGTGCC CTGGCCACAC TAACCGCCCT GGCGGCACTG ACCCTGTCCG GCTGCGCCGC CCCGCCCTCC TCCCCCCGAA CCGGCGACGG GCCCACGCGC ACCGTACAGG CCGCCAACGG AGAGGTGGAG ATCCCCGCCG CACCCCAGCG CGTCGCCGTC CTGTGGCGCC CCACCCTGGC CGCCGCGACC CTGCTGGGAC ACGACGTGGC CGCCACCATG GGCACCCCCG GCGCACCCGA ACAGGGACTG GCCCCCTTCC TGCCCCCGGG CGCCGACGGC GACGCCCTGA CCCTGGTGAC CAACTCCCCC GCCGAGGACG ACGTCCAGCT CGAAGCACTG GCCAACGCCG CCCCGGACCT GATCATCGGC GTCCACACCC GGTCCGGCGC CCAGGCGCAG ATGCTGTCGG ACCTGGAGGC CATAGCGCCC ACGGTCCTGC TGGAGTGGGA GGGCACCGGA TCCTGGCGCG GCCACCTGCA CCAGGTCGCC GAGGTACTCG ACGCCCCCGA ACAGGCCGAG CGTGCCGTGG CCGAGTACGA GACGGCCCTG GAGGAAGCGC GCGAGCAGAT CACCGGGGCC GGGGTGGATC CGGCCGCCAC CGAGGTCTCG CTCGTACGGC TGCAGAGCCC CAGCGAGATA CGCCTGGAGA CGCCCGCCTC CTTCCCCGGA CAGATCGTCC GGGACCTGGG CCTGGCCCGC CCCCGGGGCC AGCACCAGGC CCAGGGCGCC ACCGACTTCA TCGCCCTGGG CTATGAGCAC CTGGAACGCG CGGACGGCGA CACGGTGTTC GTCCTGGCCG GATCGGGCTA CCCCGACGCG CCCCGCACCT TCTCCGAGGG GGTGTGGTCC AACCTGCGGG CGGTGCGGGA CGCACGGGTG TACCGCATGG ACCACGACGT GTGGGGCGCG GCCAACCACC ACGCCGCGCA CCGCATCGTC CAGGACGTGA CCGCGGCGCT GACCGGACAG GCCGAACCCG CGGTGTGA
|
Protein sequence | MPELSRPVRA LATLTALAAL TLSGCAAPPS SPRTGDGPTR TVQAANGEVE IPAAPQRVAV LWRPTLAAAT LLGHDVAATM GTPGAPEQGL APFLPPGADG DALTLVTNSP AEDDVQLEAL ANAAPDLIIG VHTRSGAQAQ MLSDLEAIAP TVLLEWEGTG SWRGHLHQVA EVLDAPEQAE RAVAEYETAL EEAREQITGA GVDPAATEVS LVRLQSPSEI RLETPASFPG QIVRDLGLAR PRGQHQAQGA TDFIALGYEH LERADGDTVF VLAGSGYPDA PRTFSEGVWS NLRAVRDARV YRMDHDVWGA ANHHAAHRIV QDVTAALTGQ AEPAV
|
| |