Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0487 |
Symbol | |
ID | 9244328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 585245 |
End bp | 587023 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003678440 |
Protein GI | 297559466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.183361 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGACC TGAGCCGCAG GCGCCTGCTC CGGACGATCG GCCTCGGCGC GGCGGGCGTC GCCGGCGCCG GAGTGCTGTC CTCCTGCGCG GGCGGCGACA ACGCCGAGAG CGGCGCCACG CAGTTCACCG GCGTCTTCGA CTTCGACCTG GCCGCCCAGA CGCGCAACGT CGCCGTGGAG GACGGCGCGC TGCTGATGAA CTCCGTGTAC GCCGACCTCT TCCTGCCCGC GGGCGCGTTC TACAACTGGG AGACCCACGA GTGGGACTAC CTGCTCCTGG AGAACAGCGC CTGGGAGGGC GACGACCTCG TCGTGACCCT GCGCCCGGGC CTGAAGTGGA GCGACGGCAC CGACCTGACC GCCGAGGACC TGCACCAGAA CTACGCCATC CGGGTCCTGG AGGCGCCCGC GTGGTCGGTG GGCTTCCCGC AGATCACCGA GTTGGAGCGG CTGGACGACC TCAGCGTCCG CGCGCGCTTC GGCAACCCCT TCCCCGGCAT CGAGCTCCAG GTCGTCAAGC ACCGGATCTT CTCCAAGTCG ACCTACGGCG ACTTCGGGGA GCGCGCGATC ACGATGGTGG CCGACGGCGT CCGCCAGGGC GACGACGAGC ACACCGAGTT CAACGCCGAG TTCATCGAGT TCAACCCCGA GGAGATCATC ACCAGCGGGC CCTACACCTT CGACCGGGCC CAGATGTCCG ACGCGCGGAT CACCCTGGTG CGCGAGAGCA CCGGCTACCG GGGCGAGGAG GTGAACTTCG AGGAGGTGGT CGTCCACAAG GGTGACAACC GCCAGGCCTC CCTGCTCATC CAGCAGGGCG AGGTCGACTA CTCGACCCTG GCCACCTCCG CCGCCGACCA GCAGGCGTTC CGCGGCGTCG AGGGGTTCCG GTGGATCGAG CACCCCGGCT ACGACGGCTG CGGACTGATG TTCAACTACG CGGCCAAGCC CGAGCTCAAG GACGTTCGGG TGCGCAAGGC GCTCAAGCAC CTGCTCGACA GCGACCAGAT CGGCCAGGTG GCCCGGGGCG AGGCCTACGA CCGGGTGCGG TACTACTCCG GGCTGGTCGA CCTCCAGACC GAGCAGATCT TCACCCCCGA GGAGCTCGCG GAGTTCGCCG CCTACGACCA CGACCCGGAC CGGGCCACCG AACTGCTGGA GGAGGCGGGC TGGACCAAGG AGGGCGGGGT CTGGCACACC GCCGAGGGTC AGGAGGCCAG CTACGAGATC ATCGGCGTCG CGGGCTGGGG CGACTTCGAG CTGACCGCCA CCCAGGTGGA GGAGGCCTGG AACGCCTTCG GTATCAAGAC GACCGCGCGC AACGTGCCCG CGGACAACCC GTGGGGCATC TGGGCCGCAG GTGACTTCGA GGTGGCCGTG CGCCACTGGG GCAACCCGGA GATCCCGCAG TACTGGGGCG CCTTCCAGAT GAACTTCCTG GTGGAGAACG CCCGCACCGG CGAGACCCCG GGGCAGGACT TCGACCTGAA GGTGGACAGC CCCAGCCGGG GCGAGGTGGA CCTGGAGGCG CTGGTCGAGG TCGCCAAGAC CGCGCAGACC GAGGAGGAGC AGACCGAGGC CCTCAAGACG ATGGCGATCG TCTTCAACGA GCTGCTGCCG CGCATCCCGA TCTGGACCTA CAAGTACCTG GCCCCGGCCC TGGAGGGCGC GCGGGTGGAG TCCTTCCCCG AGGACCACCC CGCCGCCCAG AACCAGATCT ACCGGGACAA CCACATCATC CTGTCGCTGA TGCAGGGCGG CCTGGAGGCT GCCGGGTAG
|
Protein sequence | MRDLSRRRLL RTIGLGAAGV AGAGVLSSCA GGDNAESGAT QFTGVFDFDL AAQTRNVAVE DGALLMNSVY ADLFLPAGAF YNWETHEWDY LLLENSAWEG DDLVVTLRPG LKWSDGTDLT AEDLHQNYAI RVLEAPAWSV GFPQITELER LDDLSVRARF GNPFPGIELQ VVKHRIFSKS TYGDFGERAI TMVADGVRQG DDEHTEFNAE FIEFNPEEII TSGPYTFDRA QMSDARITLV RESTGYRGEE VNFEEVVVHK GDNRQASLLI QQGEVDYSTL ATSAADQQAF RGVEGFRWIE HPGYDGCGLM FNYAAKPELK DVRVRKALKH LLDSDQIGQV ARGEAYDRVR YYSGLVDLQT EQIFTPEELA EFAAYDHDPD RATELLEEAG WTKEGGVWHT AEGQEASYEI IGVAGWGDFE LTATQVEEAW NAFGIKTTAR NVPADNPWGI WAAGDFEVAV RHWGNPEIPQ YWGAFQMNFL VENARTGETP GQDFDLKVDS PSRGEVDLEA LVEVAKTAQT EEEQTEALKT MAIVFNELLP RIPIWTYKYL APALEGARVE SFPEDHPAAQ NQIYRDNHII LSLMQGGLEA AG
|
| |