Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0536 |
Symbol | |
ID | 9244377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 657636 |
End bp | 659333 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003678489 |
Protein GI | 297559515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAATCG GGAAGGTGCG CTACCTCGCC CCGGTCGCGG CGCTGGTTCT CGTGGCCAGC GCCTGCGGCG GAGGCGACGA CCGGGAAGAC GCCCAGGAGC AGGTGGCTTC CCTCGCCGCC GAGGACCTGA ACCCCGCGCC CCGCGAGGAG TTGCGAGAGG GCGGCAGCTT CGTGTGGGCC ATCAGCGCCT ACGACCAGCA GCACAACATG TGGCACACCC AGGGCAACAT GGCCAACGTG CGCCGCGTGG CCGCCGCGGT GCTGCCCAAC CCGACCCGGT ACGACGTTGA GGGGCTCCCC TCCCCCAACG AGGACTTCGT CCTCGACTTC GGGGTCAGCG ATGACGGGAT CGAGGTCTTC TACGAGCTCA ACCCCGACGC GGTGTGGTCC GACGGCAAGC CCATCACGTC CGAGGACTAC GAGGCCCAGG TCGAGACCGT CAGCGGTTCG CGCGAGGGCG ACTGGGAGCT CGGTGGCACC GACGGGTACG ACCAGATCGC CGAGTTCGTC CCCGGAGACG ATGAGTACTC CTTCACCCTG AGGTTCGACG CGCCCTTCGC TGAGTGGCCG TCCCTGTTCT CCCCGCTGTA CCCCAAGGAG TACATGGAGG ACGAGGAGCT CTTCAAGGAG GGCTACGTCA AGGACTACCC CGTCACCGCG GGGCCCTTCG GGAACGTGGA GTTCGACGAC GTCACCGAAC GCATCACGAT CACGAGGAAC GAGGACTGGT GGGGCACCGA GCCGCTGCTG GACGAGATCG TCTTCGACGC CATGGGCACC GATGCCATGG CTGGCGCCTT CAACAACGGT GAGATCGACG GCTTCTACCT CGGTTACGAC GCGGCCGGCT ACGAGTTGCT CAGAGACCGC GAGGGCGCCT ACTTCACCAG GGCCGTCAAC CACGCCTACC GGTTCGCCTC GCTCAACGGC GCGAGCCCGA ACCTGGAGGA CGTTCGCGTC CGTCACGCGA TCTCGCTGGG CCTGGACACC GACGCCCTCG CCGAGATCGC CCTGGGCGCG GTCGACTGGC CGATCACCGG TGAGACCAAC CGCCTGCTCC GCTCCAGCCA GAACGGCTAC CAGGACAACA GCGAGGGATA CGGGGAGTAC GACCCCGAGC GCGCGGGCGA GCTGCTCGAC GAGGCGGGGT GGATCCTGGA GGAAGGCGCC GAGTTCCGCA CGAACGCCGA GGGCGAGACG CTGTCCTTGG ACTGGGTGGC CTCGGACGAC ATGGCCATCG CCCAGGACGA GGCTGAGATC GGCCGCGACA TGCTCGCTGA GATCGGTGTC GAGGTCAAGG TCCAACAGGT GCCGAACAAC GCCCTGTTCT CCGAGTACGT CATCCCCGGA AACTACGAGG TCGCCACGTA CGTTCTCGCC GGTTCCCACC CCTACGCGGG TGACGCCCAG GAGAACTACG GTATGCCGGA CCAGGACGGG GAGTGGGGTA ACAACCTCAC GCGCATCTCG ACCGAGGAGA TCGACCGGAA GTTCGGTGAG ATGCGTTCCG AGACGGACCC GGACCGCTAT GCCGAGCTCG CCAACGAGAT CGACCGCCTG TTGTGGGAGG AGGTCGCGAC CATTCCCTTC TTCGAGCGTC CCGGCCTCTA CGTCATGAAT GAGGACCTGC ACAACTGGGG GGAGTTCGGT CTGGCCTCCG AATACGTCTA CGAGGACATC GGCTGGGCCG CCGAGTAG
|
Protein sequence | MRIGKVRYLA PVAALVLVAS ACGGGDDRED AQEQVASLAA EDLNPAPREE LREGGSFVWA ISAYDQQHNM WHTQGNMANV RRVAAAVLPN PTRYDVEGLP SPNEDFVLDF GVSDDGIEVF YELNPDAVWS DGKPITSEDY EAQVETVSGS REGDWELGGT DGYDQIAEFV PGDDEYSFTL RFDAPFAEWP SLFSPLYPKE YMEDEELFKE GYVKDYPVTA GPFGNVEFDD VTERITITRN EDWWGTEPLL DEIVFDAMGT DAMAGAFNNG EIDGFYLGYD AAGYELLRDR EGAYFTRAVN HAYRFASLNG ASPNLEDVRV RHAISLGLDT DALAEIALGA VDWPITGETN RLLRSSQNGY QDNSEGYGEY DPERAGELLD EAGWILEEGA EFRTNAEGET LSLDWVASDD MAIAQDEAEI GRDMLAEIGV EVKVQQVPNN ALFSEYVIPG NYEVATYVLA GSHPYAGDAQ ENYGMPDQDG EWGNNLTRIS TEEIDRKFGE MRSETDPDRY AELANEIDRL LWEEVATIPF FERPGLYVMN EDLHNWGEFG LASEYVYEDI GWAAE
|
| |