Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4535 |
Symbol | |
ID | 9248415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5378107 |
End bp | 5379810 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003682428 |
Protein GI | 297563454 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.494788 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAAAC GAAGGAAGAC GCTCGCGCTC GCCGCGGCGG GCACCTCCGC TTTGATGGTG CTGACCGCCT GCAGCGGCGG CGGTGGTGGC GAAGGCGAGC AGGAGAAGGA GGTCACATGG GTGATCAACA GCCTTCCCGC CGCCTGGGGA GCCATCAGTA GCGCCGGAGG CAGCGTCTAC GTCATCCAGG CACTCTCCGG CGTCGTGCCC TTCACCGGGC AGTACCAGCC CGACGCGACG TACGAGTACG ACATGGACGT CCTCGCCGAG GAGCCCACCC TCATCAACGA CAATCCCGAC GAGGGTCCGT TCCAGTTCAG CTTCACCCTC GCCGAGGACG CCGTGTGGAA CGACGGCACG CCGATGACCG GCGAGGACCT GCGGGTCACC ATGATGATGT CGGCGTCCCC CACCGAGGGC TACTGCGACA CCTGTGACTC CCGCGGCACC ACCGGCGCCG ACATGGTCGA GGAGGTCGAG GTCGACGGCA AGACCGCGAC CTTCACCCTC AAGGAGGGCC TGTCCAACCC CGAGTGGATG GGCATGTTCG ACGCGCACAG CGTTGGCGGC GGCTTCTACC CGGCGCACCT GGCCGAGGAG AACGGCTGGG ACGTCGACGA CCCCGCGCAG CTCGGCGAGT ACTACGCCTG GCTGCACGAG ACGCGCCCCG AGTGGTCCGG CGGCCCCTAC CAGATCGTGG ACGGCGACCT GGAGAACCAG GTCGTCAAGG AGCCCAACCC CGAGTGGTTC GGTGAGACGC AGCCCGCGCT CGACCGCATC ATCATGCCGT ACAACACCGA CGAGGGCACC TTCATCCCCG CCTTCCAGAA CGGCGAGATC GACGGCGCCA ACCCCGCGCA GTACAGCGAG GACATCATCA CCCAGCTCCA GGGGATGGAG ACCGCCACGC TCACCATCGG CGAGGGCAAC ATCTGGGAGC ACATCGACAT CAACACCGAG AACGAGTGGC TCTCGGACGT CGAGCTCCGC AGGGCCGTGT TCACCGCGAT CAACCGCGAC GAGATCGCCA GCCGCAACTT CGAGGCCGGA TACCCCGAGT ACGAGCTGAA GAACAACCAC ATCTTCGGCA GCGACAGCGA GTACTTCGAG GACCTCGTCT CCGAGTCCGG GCAGGGCAGC GGCGACGTCG AGGCCGCCAC CGCGATCCTG GAGGAGGCCG GTTACGAGCT CGACGGGGAC ACCCTCATGC TCGACGGCGA GCAGGTCGGC CCGTTCCGCC TGCGCAGCAC CGACACCGTC ATCCGCAACA ACTCCGTGCA GCTGATCCAG GCCCAGCTCG CCGAGATCGG CATCGAGACC ACCATCGAGA TGACCGACGA CCTGGGCACG ATGCTGGCCG AGCAGGACTA CGACATCGTC CAGTTCGGCT GGAGCGGCAG CCCGTACTTC GCCTCCAGCC CCGAGCAGTT CTGGCACTCC GAGAGCACCA GCAACTTCGG CGGCTACTCC AACGACGAGG TGGACGAGCA CGCCGAGGCC ACCGCGACGG CCGCCAACCT GGACGAGGCG GCCGAGCACG CCAACGCCGC CGTGGCCGCC GTGGTCCCGG ACGCCTACGT CCTGCCGATC GTGGCCGAGC CCAACTACTT CTTCGTGAAC GACAGGCTCG CCAACGTCGA GGACAACCTC CAGTCCAGCT ACCGCGCCAC CTACAACATC GGTGAGTGGG ACCTCGCCGA GTAG
|
Protein sequence | MHKRRKTLAL AAAGTSALMV LTACSGGGGG EGEQEKEVTW VINSLPAAWG AISSAGGSVY VIQALSGVVP FTGQYQPDAT YEYDMDVLAE EPTLINDNPD EGPFQFSFTL AEDAVWNDGT PMTGEDLRVT MMMSASPTEG YCDTCDSRGT TGADMVEEVE VDGKTATFTL KEGLSNPEWM GMFDAHSVGG GFYPAHLAEE NGWDVDDPAQ LGEYYAWLHE TRPEWSGGPY QIVDGDLENQ VVKEPNPEWF GETQPALDRI IMPYNTDEGT FIPAFQNGEI DGANPAQYSE DIITQLQGME TATLTIGEGN IWEHIDINTE NEWLSDVELR RAVFTAINRD EIASRNFEAG YPEYELKNNH IFGSDSEYFE DLVSESGQGS GDVEAATAIL EEAGYELDGD TLMLDGEQVG PFRLRSTDTV IRNNSVQLIQ AQLAEIGIET TIEMTDDLGT MLAEQDYDIV QFGWSGSPYF ASSPEQFWHS ESTSNFGGYS NDEVDEHAEA TATAANLDEA AEHANAAVAA VVPDAYVLPI VAEPNYFFVN DRLANVEDNL QSSYRATYNI GEWDLAE
|
| |