Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2586 |
Symbol | |
ID | 9246437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3078872 |
End bp | 3080683 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003680510 |
Protein GI | 297561536 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.653336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00337365 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGACCACCG ACCACCACAC AGGCGCGGCG CCCTCGCGGC GCCGCGGACG CCGCCGGACG ACCGCGACCC TGGCCGCGGG CACGGCCGCG GCGCTGCTGC TGGCCGCCTG CGGGGGCGCG GACGGAGGCG GGGGCGAAGG CGGCCCGGTC GACGCCGAGT TCAACCAGGG CGAGACCGAG GTGGTCAACC CCTCCGACCA GACCGGCGGC ACCCTGCGCT ACGCCATCTC CTCGGACTTC GACTCACCCG ACCCCGGCAA CACCTACTAC GCGTTCGGCT GGAACTTCAG CCGCTACTAC GCGCGCACCC TGCTCACCTA CGTCGGGGCG CCCGGCGCGG AGGGCACCGA ACTCCAGCCC GACCTGGCCG CGGAGATGCC CAGGCCCAAC GAGGACCTCA CCGAGTGGAC GGTGCCCATC AAGCAGGGGC TGCGCTACGA GGACGGCTCC GAGATCACCG CGCCCGACAT CGAGTACGCC ATCGCGCGCG GCAACTTCGG CGCCCAGGCC CTGCCCAACG GCCCCAAGTA CTTCCAGGAC CTGCTGGCCG ACAGCGACGA CTACGAGGGC CCCTACGCCG ACGAGGACGA CCCGCTGGCC GGGTTCGACG GGATCGAGAC CCCCGACGAC CACACCCTGG TCTTCCACCT CAAGGACCCC TTCGCGGAGT TCCCGTACCT GCTCATCCAG CCGCAGACCG CGCCGGTCCC GCCCGAGGCC GACCGCGGTG AGCAGTACCA GAGCCGCGTC GTCTCCTCCG GGCCCTACAA GTTCGACGGC GAGTACCGGC CCGGCGTCTC CCTGAACCTG GTGCGCAACG ACCAGTGGGA CCCCGCCACC GACCCGACCC GCGAGGCGCT GCCGGACCGG GTCGAGGTGC AGCTGGGAGT GGACCAGAAC GAGATCGACC AGCGCCTGGC CAGCGGCGAC CTGGACGTGG ACCTGGCCGG GGCCGGGGTG GGACCCGCCA TGCGGGGCAC CCTGCTCACC GACGAGGCGC GCAAGAACAG CGTGGACAAC CCGCAGAGCA ACACGCTGCG CTACGTCAAC ATCAGCACCG TCCTGGAACC CCTGGACGAC CTGGCCTGCC GTGAGGCGGT CATGTACGCG GCCGACCGCG ACGCCCTCCA GCGGGCCTGG GGCGGCGACA CCGGCGGCGA CATCGCCACC CAGATCATGC CCGCGTCGCT GCCGGGGGCC GATCCCGGCA TCGACCTGTA CCCCTCACAG GACAACCAGG GCGACCTGGA CAAAGCCCGG CAGAAGCTGG AGGAGTGCGG CGAGCCCGAC GGGTTCTCCA CCTCCATCGG CGTCCGGGCC GACCGGCCCT CCGAGGTGAG CACCGCCGAG GCGCTCCAGC AGGCCCTGGC CAGGGTGGGC ATCGAGACGC GGATCAAGCA GTACCCCTCG GACACCTACA CCAACACCCA GGCCGGGTCG CCGTCCTTCG TGGAGGACAA CGACCTGGGC CTGACCGTGT ACGGGTGGGC CCCGGACTGG GCGAGCGGCT ACGGCTTCAT GAGCAAGATC CTGGACGGCG ACGCCATCCA GGACGCGGGC AACGCCAACA TCTCGGAGCT GGACGACGAG CGGATCAACG GCTGGTTCGA CGAGGTCATC ACCGTGCGGG ACCCCGAGGA GCGCGCCTCG ATCTACACCC GGATCGACCG GCGGGCGATG GAGCAGGCGG CGATCCTGCC CGCGGTGTTC GAGCGCACGG TGCTCTACCG GCCGCCGAAC CTGACCAACG TGTACTACCA CTCGGGCTAC TCCATGTACG ACTACATGGC GCTCGGCACC ACCCGGGAGT GA
|
Protein sequence | MTTDHHTGAA PSRRRGRRRT TATLAAGTAA ALLLAACGGA DGGGGEGGPV DAEFNQGETE VVNPSDQTGG TLRYAISSDF DSPDPGNTYY AFGWNFSRYY ARTLLTYVGA PGAEGTELQP DLAAEMPRPN EDLTEWTVPI KQGLRYEDGS EITAPDIEYA IARGNFGAQA LPNGPKYFQD LLADSDDYEG PYADEDDPLA GFDGIETPDD HTLVFHLKDP FAEFPYLLIQ PQTAPVPPEA DRGEQYQSRV VSSGPYKFDG EYRPGVSLNL VRNDQWDPAT DPTREALPDR VEVQLGVDQN EIDQRLASGD LDVDLAGAGV GPAMRGTLLT DEARKNSVDN PQSNTLRYVN ISTVLEPLDD LACREAVMYA ADRDALQRAW GGDTGGDIAT QIMPASLPGA DPGIDLYPSQ DNQGDLDKAR QKLEECGEPD GFSTSIGVRA DRPSEVSTAE ALQQALARVG IETRIKQYPS DTYTNTQAGS PSFVEDNDLG LTVYGWAPDW ASGYGFMSKI LDGDAIQDAG NANISELDDE RINGWFDEVI TVRDPEERAS IYTRIDRRAM EQAAILPAVF ERTVLYRPPN LTNVYYHSGY SMYDYMALGT TRE
|
| |