Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3813 |
Symbol | |
ID | 9247684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4575790 |
End bp | 4576842 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003681716 |
Protein GI | 297562742 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.941713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00422068 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCGCCG TCCGCACGAC CGCCGCCCGC GTGTCCGCCG CCACCGCCGC GTTCGCCCTC GCCGGAGCCC TGACCTCCTG CGCCGCCTCC GACCCCGCCG ACACCGTCAC CGTCTACACC GCCGACGGCC TGCGCAACGA GGAGGGCACC GGCTGGCTCG ACCAGGTCTT CGCCGACTTC GAGGCCGAGA CCGGCGTCAC CGTCCAGTAC GTCGAGGGGG GCTCGGGCGA GATCGTCCAG CGCGCCGGGC GCGAGACCGC CAACCCGCGG GCCGACGTCA TCATCACCCT GCCGCCCTTC ATCCAGCAGG CCGACGGCAT GGGCCTGCTC CAGCAGTACG AGCCGCAGGG CTCCGAGCAC GTCGGGACCA AGGACCCCGA CGGGAGGTGG ACCACGATCG TCGACAACTA CTTCTGCCTC ACCTACAACA CCGAGGAGCT GGAGGAGCCG CCCGCCACCT GGGACGACCT GCTCGACCCC CGCTTCGAGG GCCGCCTCCA GTACTCGACG CCGGGCGTGG CGGGCGACGG CACGGCCGTG CTCATCCAGA CCATGCACGA CTTCGGCGGC CTGGAGCCCG CCATGGACTA CCTCGGACGG CTCCAGGCCA ACAACGTCGG CCCGTCCTCC TCCACCGGCG CGCTGGGCCC CAAGGTGGAC AAGGGCGAGC TGCTCGTCGC CAACGGCGAC GTCCAGATGA ACCTCGCCCA GGCCCGCACC ATGCCCAACC TCGGCATCTG GTTCCCCGCG CACGAGGAGG GGGAGCCGAG CACCTTCGCC CTGCCCTACA CCGCCGGGCT GGTCGAGGGC GCGCCCCAGG CCGACAACGG CCGCGCCCTC CTGGACTTCC TGCTCTCCGA GGCCGCCCAG GAGCAGGTCG TCCCCGTGGC CGGGGGCTTC CCGGCCCGCA CCGACGTCCC GGTGGAGGGC GGGGGCGCCG AGGAGCTGGA GGCGCTCATG GAGGGCGTGG AGGTCTTCGA GCCCGACTGG GACGACATCG ACGCCAACCT GCCCGAGTAC CTCGACGCCT GGCGCGAGGC CACCGGCAGC TGA
|
Protein sequence | MPAVRTTAAR VSAATAAFAL AGALTSCAAS DPADTVTVYT ADGLRNEEGT GWLDQVFADF EAETGVTVQY VEGGSGEIVQ RAGRETANPR ADVIITLPPF IQQADGMGLL QQYEPQGSEH VGTKDPDGRW TTIVDNYFCL TYNTEELEEP PATWDDLLDP RFEGRLQYST PGVAGDGTAV LIQTMHDFGG LEPAMDYLGR LQANNVGPSS STGALGPKVD KGELLVANGD VQMNLAQART MPNLGIWFPA HEEGEPSTFA LPYTAGLVEG APQADNGRAL LDFLLSEAAQ EQVVPVAGGF PARTDVPVEG GGAEELEALM EGVEVFEPDW DDIDANLPEY LDAWREATGS
|
| |