Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5255 |
Symbol | |
ID | 9249153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 417340 |
End bp | 419295 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003683141 |
Protein GI | 297564168 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.313455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAC GAACGCCAGG CGGCGCGAGG CGGTGGCTGG GCGCCGCCGC GGGAGCGGCG GCGGTCGTGG TGACCGCCGG GTGCACCTTC TTCTCCCTCG ATCCCGAGGT CGAGGAGGGC GAACGGGTGG AGGGCACCGG CGAGGCGCCG ATGCTCACCG CCCTCGTGGA GTCGGGCGAC CTGCCCCCGC TGGAGGAGCG GCTGCCCGAC GAGCCGCTGG TGGTGGAGCC CCACGACCGC GTCGGCGTGT ACGGCGGCGA GTGGAACAGC GCCATCCTCG GCGTGGGCGA CTGGCCCTGG CTCGGCAGGA CCGTGGGCTA CGAGAACCTC ACCCGCTGGG ACCCGGAGTG GCAGGAGGTG ATCCCCAACC TCGCCGAGTC CTGGGAGTAC AACGAGGACG CCACCGAGCT GACCTTCACC CTGCGCGCGG GCCTGCGCTG GTCCGACGGC GAGCCGTTCA CCTCCGACGA CGTCGTGTTC GCCTTCAACG ACATCTTCAA CAACGAGGCG CTGACGCCGG TCGCGGCCGC CGATCCCGGC ACCGCCGAGA AGGTGGACGA GCGGACCTTC ACCATCACCT TCGACGAGCC CGACGCCCTG TGGGCCGGGT ACGACCTCCT CCAGTACCAG GTGGTGACCA AGCCCAAGCA CTACCTGGAG CGGTTCCACA TCGAGTACAA CCCCGACGCC GACGAGCTGG CCGAGGAGGA GGGGTACGCC GACTGGGTCG AGATGTTCGA GGCCGAGGCG GGCGTGATCG ACAGCTCCCG GTACTGGCAG AACCCCGACA TCCCCACCAT GTACCCGTGG CGGGTCGTGG AGCCGCTGGC CGACTCCGGG CGGATGGTGC TGGAGCGCAA CCCCTACTAC TGGAAGGTGG ACACCGAGGG CAACCAGCTC CCCTACATCG ACCGGGTCGT CTTCGACATC CTCCCGGACG AGGAGGTCAT GCTGGTCAGG GCGCTCAACG GCGAGTTCGA CATGCACTCG CGGCACTTCA ACACCCTGGA GAACAGGCCC ACCCTGGCCG AGGCGCGCGA GTCGGGCGGC TACGACTTCT TCGAGCTGCG GCCCGCCGAG ATGAACACCG CGATGATCTC CCTCAACCTC ACCCACGAGG ACGAGGAGCT GCGCGAGACC TTCAACGACC GCGACTTCCG GGTGGCGCTC TCACACGCCG TCAACCGCCA GGACATCATC GACGTCGTCT ACCGCGAACA GGGCGAGCCC TGGCAGGGCG CGCCCCGCGA GGACAGCCCC TTCCACAACG AGGAGCTGGC CAAGCAGTAC ACCGAGTACG ACCCGGACCT GGCCAACGGG ATCCTCGACG AGGCGGGCTA CGACGAACGC GACTCCGACG GCTTCCGCAC GAGCCCGCGC GGCGAGACGG TGCGCTTCAC GCTGTCGGTG CCCACGGACT TTCGCCCCGA CATCGTGGAC TCGATGGAGA TGGTCGTCGG CTTCTGGCAG GAGCTGGACA TCGACGTGGA GCTCAACACC GAGGACCGCT CGCTGTGGCA GACCCGCCGG GAGAACAACG AGCACGACGC CAACGTGTGG TCGGGCGACA ACGGCATGAT GGACGCGATG TACGACCCCC GCTGGTACGC GCCCACCCAG AGCGGGGAGT CCAACTTCGC CATCCCGTGG GCCCAGTGGT ACGTCTCCGA CGGCGAGGAC CCGCGCTCCC AGGAGCCGCC CGCCGACGTG CGCGAACACC TGGAGCTGTA CGACGCCGTC CAGGCCGAGC CCGACCCCGG GGCCCGCGAG GAGCTGATGC GCGAGTTCCT GTCGGTCTCC CAGGAGCGGT TCTACGCGAT GGGCGTCAGC CTGAGCCCGA CCGGCTACGG GATCGTCGCG GACGACTTCC ACAACGTGCC CGGGTCGATG CCCTCCTCCG GCAACTACAA CGACCCCGGG CCGACCAACC CCGAGCAGTA CTTCATCGAG GAGTGA
|
Protein sequence | MRRRTPGGAR RWLGAAAGAA AVVVTAGCTF FSLDPEVEEG ERVEGTGEAP MLTALVESGD LPPLEERLPD EPLVVEPHDR VGVYGGEWNS AILGVGDWPW LGRTVGYENL TRWDPEWQEV IPNLAESWEY NEDATELTFT LRAGLRWSDG EPFTSDDVVF AFNDIFNNEA LTPVAAADPG TAEKVDERTF TITFDEPDAL WAGYDLLQYQ VVTKPKHYLE RFHIEYNPDA DELAEEEGYA DWVEMFEAEA GVIDSSRYWQ NPDIPTMYPW RVVEPLADSG RMVLERNPYY WKVDTEGNQL PYIDRVVFDI LPDEEVMLVR ALNGEFDMHS RHFNTLENRP TLAEARESGG YDFFELRPAE MNTAMISLNL THEDEELRET FNDRDFRVAL SHAVNRQDII DVVYREQGEP WQGAPREDSP FHNEELAKQY TEYDPDLANG ILDEAGYDER DSDGFRTSPR GETVRFTLSV PTDFRPDIVD SMEMVVGFWQ ELDIDVELNT EDRSLWQTRR ENNEHDANVW SGDNGMMDAM YDPRWYAPTQ SGESNFAIPW AQWYVSDGED PRSQEPPADV REHLELYDAV QAEPDPGARE ELMREFLSVS QERFYAMGVS LSPTGYGIVA DDFHNVPGSM PSSGNYNDPG PTNPEQYFIE E
|
| |