Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0011 |
Symbol | |
ID | 9243838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 14379 |
End bp | 15800 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003677970 |
Protein GI | 297558996 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0936464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0312936 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGGCCC GCCCGAGGCC GCCGCGGCCG GTCGTGTGCG CGCTCCTGTC GCTCCTGCTG CTGGGCTCCG CCGCCTGCGA CGCGCGGGGC GAGCGGGACG AACGGGTCCT GCGCGTCGCC CTCCCCGTTG ACGTCACCGC CACGGAGGAC ACGGGCGGGG GCGGGGTGTA CACGCGGCTC ATCGAGCGCT GGGAGGCGGA GAACGAGGGG TGGGAGGTCG ACGTCCGGTG GCTCTCGCCC CGGGCGGACG AGCAGCGCTC GCAGATGGTC GTGGCGATGC AGGCCGACCC GGCGGCCTAC GACGTCCTCC TCCTCGACAA CCAGTGGGTG CCGGAGTTCC ACGAACGCGG CTGGCTGGCG GAGGTGGACA CCGGCGGAGA CGGCCCGCTG GCGTGGAACG GCTTCCTGGG CCGCGCCCGG GACGCGGTGT GCTACGAGGG CCGGGCGCGG GCGGTGCCGT TCCACATGGA CGTGGGCCTG CTCTACTACC GCGCCGACCT GGTCGGCCCC GAGGAGGTCA CCGCCCGGAT CGAGGAGGAC GGCTGGCGCG GGCTGCTCGG CCTCGCGGAC GAGGTACGCG ACGAGCACGG CCTGGAGCAC GGGTACACGG GCCAGTTCGG CGACTACGAG GGCCTGACCG TCAACGCCCT GGAGTTCGTC CTCGGCGGGC ATCCCGGTCT GGAGGCGGAC AGCCGGGGGA CCTTCCACGG GTCGGACGGG GCGGACGGCG GCGGGGCGGA GGACGGCTGT GTCACGTCCG CCGGTGCGAG CGGCGAGTTC CAGGGCCTGG AGGTGCTGGA GGCGGGGCTG GAGCCCGAGG CCGGGGGTGT CATCCCGCGC GCTGCCCTGG AGGAGACCGA GACCGAGAGC CTGAACCGGT TCCGCGGCGG CGAGGTGGTG TTCATGCGCC ACTGGCCCCG GGTCGTGCCC CAGCTCCAGG CCGACTCCGA GAGCGCCGAG ACGCTCCGGG AGGGGCTGCG CTGGGTCGGC TGGGCGGGGG AGGCCGGGGG CGGCGACCCC GCGTTCGGGG TGCTCCGGCT GCCCGCGGCC GTGCTCGGCG GCAACAGCCT GGCGGTCACC GGGGAGTCGC CGCACCGGGA CGAGGCCTGG AGCCTGGTCG CGGCCATGAC CGGGCAGGAG GTCCAGGCGG AGTTCCGGGA GGCCGGTCTG CTCCCCGCCC GCGGCAGCGG CTACCGGCTG AGCGACGCCG AGGACCGGGA CGTGCGGTAC TGGGAGGAGC TGCGCGACGC CGTCGGCGAG GGACTGCTGC GGCCGCGCAC CCCCTACTAC CCGTACGTGA GCGAGGTGCT GCGGGACCAC GTGGACACGC AGATCCGCCC GGGCTCGACC GACCCGCACC CCGAGGACAT GGTGTGCGAG CTCAACGCCG CGCTCGCCGG GAAGGTCAGC ACGTGCGGCT GA
|
Protein sequence | MRARPRPPRP VVCALLSLLL LGSAACDARG ERDERVLRVA LPVDVTATED TGGGGVYTRL IERWEAENEG WEVDVRWLSP RADEQRSQMV VAMQADPAAY DVLLLDNQWV PEFHERGWLA EVDTGGDGPL AWNGFLGRAR DAVCYEGRAR AVPFHMDVGL LYYRADLVGP EEVTARIEED GWRGLLGLAD EVRDEHGLEH GYTGQFGDYE GLTVNALEFV LGGHPGLEAD SRGTFHGSDG ADGGGAEDGC VTSAGASGEF QGLEVLEAGL EPEAGGVIPR AALEETETES LNRFRGGEVV FMRHWPRVVP QLQADSESAE TLREGLRWVG WAGEAGGGDP AFGVLRLPAA VLGGNSLAVT GESPHRDEAW SLVAAMTGQE VQAEFREAGL LPARGSGYRL SDAEDRDVRY WEELRDAVGE GLLRPRTPYY PYVSEVLRDH VDTQIRPGST DPHPEDMVCE LNAALAGKVS TCG
|
| |