Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2437 |
Symbol | |
ID | 9246287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2890666 |
End bp | 2891712 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | periplasmic binding protein |
Protein accession | YP_003680363 |
Protein GI | 297561389 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0367724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.545256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCCA CCGGGGCACG CGCGGTCCGG TACCGCCTGC CCGCCTGCGC CGCCCTCGCC CTGCTCCTGG CCACGACCGC CTGCGGCGGG GGCTCCGCCG CCCCCTCCCC CGAGGCTGCC GGGGAGACGA TCCGCAACTG CGGCGTCGAC GTGGCCGCGG ACAGCCCGCC GGAACGGGTG TTCGCCGCCT ACCAGCCCGC CATCGAGACC GCCCACGCGC TGGGCCTCAG CGACCGCCTG GTGGGGACGG CGTTCCTGGA CGCCGCCGTC CTGGAGGAGT ACGCCGACGC CCAGGCCGGG CAGGAGTACT ACCCCAACCT GCCCAGCCGG GAGGAACTGC TCAGCCACGG GCCCGACTTC GTGCTCGCCG GGTTCAACGA CGTGTTCACC GACGAGAACC TCGGTACGCG GGCCTCCCTG CGCGAGCTGG GCATCGGCAG CTGGATCCCG GCTCCGCTGT GCCCTAGCGG GGACGGCCGC ACCGACGCGA CCATCGACCC GGCCTCGGTG ACGATGGACA ACGTCTACGC CGACCTGCGG AACCTGGGCG CCCTGTTCGA CGCGCGCGAG CGCGCCGAAG AGGTCATCGC GGACATGGAG GGCACGATAG GCGGGGTCAC CGAGGCCCTG GAGGGCGGCG TGCCTGAGGA GGACCGGCCC TCGGTGATGG TCGGCCGCCC CAGCGACCAG GGGTTCCGGG TGGCCGGCGG GCAGGACTTC TCCACCGAGA TCATCCGGCT GGCCGGAGGC GTCAACGCCT TCGCCGACCT GGACGGCGGT CGCAACCACG ACGTGGCCGT CGAGGACGTG ATCGCGCGCG ACCCCGACTT CATCCTGGTC GACGTGTGCT GCGACGCGCG GATGACCGCC GCCGACGCCG CCCCGGACGT GGAGCGGATC ACGGCCGATC CCGTGCTGGC CAACCTGACC GCGGTCACCG AGGAGCAGGT GGAGGAGTTC ACCTTCGCCG ACCGTTCCGC GGGTGTGCGC AGCGCGGCCG CGGTGGAGAC GGTCGCCCGG ATCCTGCACC CCGGCCTGTT CGGGTAG
|
Protein sequence | MKPTGARAVR YRLPACAALA LLLATTACGG GSAAPSPEAA GETIRNCGVD VAADSPPERV FAAYQPAIET AHALGLSDRL VGTAFLDAAV LEEYADAQAG QEYYPNLPSR EELLSHGPDF VLAGFNDVFT DENLGTRASL RELGIGSWIP APLCPSGDGR TDATIDPASV TMDNVYADLR NLGALFDARE RAEEVIADME GTIGGVTEAL EGGVPEEDRP SVMVGRPSDQ GFRVAGGQDF STEIIRLAGG VNAFADLDGG RNHDVAVEDV IARDPDFILV DVCCDARMTA ADAAPDVERI TADPVLANLT AVTEEQVEEF TFADRSAGVR SAAAVETVAR ILHPGLFG
|
| |