Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1748 |
Symbol | |
ID | 8447350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1911412 |
End bp | 1915455 |
Gene Length | 4044 bp |
Protein Length | 1347 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645040874 |
Product | Ricin B lectin |
Protein accession | YP_003201127 |
Protein GI | 258651971 |
COG category | [S] Function unknown |
COG ID | [COG3827] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.207372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0116173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGAC GAACCACCGG TGATGGCCCC ACCGCCCTGC GCGCCCTCGC CCGCCGCGGC CTGCTCAGCC TGGTCGCGGC CTCCACCGCC GCCACCATCG CACTGGTGCC GGTGGCCCCG GCGACGGCCA CGCCCGTCAT CGTGGTCCCG GCGGCTGCGG TCGCGTCGGC GCCGATCGCC GATCCGGGGG CCGCCGCGCC CTCGGCGGCC CCAACCACCG ACCCGGCCCC CACGCCCGCC CCGACCCCGG CCTCGACGCC CGCGCCGGCC TCGACGCCCG CGCCGGCCCC GACACCCGCG CCGGCGCCGG CCGCGGCGGT GGCTCCGGCT CCCGCCGCCG CCACGGCCAC GACGACCGAC CCGGCCCCGA CCGGCAGCCC GGCTCCGGCT CCGGCCGCGG GGTCGCCGTC CGCCTCCACC CCCGCCTCCA CCGCACCCTC GACAGTGGTT CCCAGCAGCA CCTTCTCGCT GTCCCCGGCG GCCCGGTCGT TCCTGTCCGC CCTGAACCTG AACCTGGGAT CCGGGCCGGT CACCGGCACC CTGGACGGCT CGGTGCTCAC CGTCGCCGTC GGCGCCCCCG CCGTGGCGTT CCCGCTGCCC ACCCCCGGCC AGACGGTCTC CTTCACCGGC GCCACCTTGA CCATCGACGA GTCCACCCGG ACGCTGACCC TCACCGCCGC GGTCACGACC GGCAACGGTC TCGGCGGTTC GCTGTCGGTG AGCATCGCCA ACGCCGATGC CACCGATCTG ACCGGAGCCG ACCTGACCGC CACCCTCGAC ATCACCGGGA TCTCCGTGCT GGGGACCACG GTGGAGGTCT CCGGCTCGCT GTCCGCCACC GGCGGCCGGC TCGCCGCCTC CCTGACCGGC AGGCTGAGCA GCGACGGCGT CGTCGCCGAC GGGGTGCTGA CCGTCAAGTC CGGGAGCACC ATCACCCTGG CCACCGACAC CGGCCTGAGC GTCAACGGCA CCGCCGTCCT CGGTTCCGGC CCGACGGCTT TCACCGTGAT CGTGTCCGGT GCGATCAAGG ACGGCCGGAA CTGGTCGCTG AGCGTGGACA ACACCGCCGA GACGCCACAG TTCAGCCCGG TCGACGGCCT GAGCATCAGT CCGGCGTTCG CCGGAACGAT CACCGACGCC AACGGGACCG TCACCTTCGA TGTCGCCGGG GACGACACCG GGAGCTGGCA AACCGGCCCG GCCACGTTGT CCCTGACGCA CGTGGAGGTG TCCAACGCCG CTCCCCGGGA CGGATTGGCC TGTCCCGCGG TCGATCCCGG CCAGGTCTGG TTCGACATCC AGGGCGGCGT GGCCGATCCG GCGGCCGGGA TCGCCGGTTC CGCGCAGGCC TGCGTGGTAC CGGCGGCCAA GGCGTTCCAG ATCACCGCGA GTGCGCCGAG CATCACCCTG CCCAACGCGG CCCACTTCTC GCTGGACCAG CCATCGGTGT CGATCACCGG CAGCGGCGTC GGGACGGCGC AGGCCAAGGT CGCGGTGTCG GCCGCCGCCA CCCTGACGGT GGCGCCGGAC GCCGACCACA CGGTCCACAC TCCGGTCACC CTGAGTTTCG CCGACGATGG CAGCTTCACC GCTTCGGCCG GGATCGATCT AGGCGCGCTC GGCGTCGGCT CGGGCCAGGG CACCCTGGTG CTGGCCAGCA AGCAGATCGC CCAGTTCGAC CCGACGACCG TGGGGGCGAC CGGTCAAAAG TTCGATCTGC CGGCCGGCGT CACGCTGCTG TTCGCCTACC AGCCCTCGGG TGCGGTGAGC GCGGCGCTCA AGAACCTGCA GTTGCCCGCC CCGAAATCCA TCGCCGCCCG GGCCACCCTG TCCGACAACG GGTTCGAGGC GACCGCTCAG CTGCAGTTCG GCGACCGGGA CCAGGGCGCC AAGCTGTTCG CGCAGAACAC CCCCGGTGGC GCCGCGGCCT ACATCAACTC CCTGGGTCTG GATTTCCAAC TGGGTTCGGC CAGCGGCACG GTGACCGTGT CCGGGTCGGC CTTCCTCGTG TTGTCCAAGC TGTATGCCAG CGGCACGGCC TCCCGGGTCC AGGTCACGCT CGGCGGCAGC CTGGGGGTCA GCGCGCAGGG GGCGGTGTCG GTCTCCTTGC AGTTCGACAT CAAGGGTCTG GGCGGGCCGT GGACCGACGC GTTCGGCATC CCCGGTCTGT CGGTCAGTGA GGTCGCCGGC CGGATCGGGG TGAAGGACAG TCCGGAAACC GCCGGCATCC CCCTACCCAC GCTGGCCTTC AAGGTCGACA ACGTGCAGCT GCCCAAGGCC TGGAACGATG CCATCGGCAT CCAGGCCGGC GCCACCACCT CGTTGAATCT GGTGCTGGAC GTGAACAACC CGGTGCTCGG CTTCTCGATC GCCGGCCAGA CCCCGGATGC GGTCGCGCTC AAGCCCTTCA CCATCATCAA GTCGGTGGCC GGGAAGTCGA TCCCGTCGTC CTTGCCGGAT TCGGTGCAGG TCAAGACCGC GCAGCTGCTG TTCGCCCCGC TGGGCGGCAC CGACGCCGCG GGCAAGGCGA TCAACCCCGG GGCGACCCTG GTCTTCGACA GCGCCGTGGC CGGTGTACCC GTGCACGTGG ATGGCAACGT CGCCGTGCTG CCTTACCCGA GCCTGACCGC CGACGCGAGC GTCGGCAACT TCGCGGTCGG GCCGGTCACC CTGGCCAACA CCGACCTGAA GATCACGCTG TCCGCGGATC CGTCGAACCC GCAGGCGGAC TTCAGCTTCC ACGGCGGATT CACCGACAAG ACCAGCGGCA TCTCGTTCCT GGCCGGCATC GACGAGGGCG CCTCCGCCTC CCTGGCCAGC GCCGCGGTGT CGCTGCACAT CGCCGGTGGC CAGCCGCAGT ACCTGCAGGC GGCGGCGGAC CTGACCGGTT CGGTCTCGAT CTCGCCCAGG GACGGCAGCG TCTCCTTTGC GGCCTCGGGC AATGCCGCGG TCGCGGTGAA CGGGCTGAAC CTGGCGTCCG TGCCGTTCAG CTACTCGACC ACCAGCGGCG CGCTCTGGCA GCAGCTGCAG GGCAGCGCGG GCCAGGTGGC CCAGGCGTTC AAGAGCGCCT ACGGATGGAC CGATGCCCAG GCCGCGGCCG CGCTGAACAC CCTGCGGGCC ACGCCGGCCC AGATTGCCGG GGCCCTGCAA TCGGCCTACG GCGACGGCTC TGACGCGGTG ATGAAGGTCC TGCTCAAGAC CGGATTCAGC GTCGACACCA GCATCGCGAC GGTGAAGTCG ATCCTCGGGG CGGCGGACAA CCAGATTGCC AGCACCCTTG CTCAGCTGGG CTACCAGCAA ACCCAGATCG CGACCCTGCT GAACCGCTTC TACGGCGACG CCGACGCCCG GATCGCTTCC GTCCTGCTCG GTCTGGGCAA CACCGCGACC AGCGTCGCCG GGACCCTGCA CACCGTGTTC GGTGACACCG ACCGGCAGGT CGCCGTGGCC TTCCAGCAGA TCGGGGTGCC CGCCCAGACG ATCGAGGCCG CACTGACCAA TGCCTTCGGC GACGGGCAGG CGGCGATCTA CAACCTGATG ACCTCGATCG GCTCCGCCGG CACCGGCACC CTGGACGCGC TGGCCGGGGT GTTCAACTCC GGGGCCTACT CGCCGTCGGC GCACCCGTGG TGGTCGGTGC CGCTGCTGTG GGACGTGTCC AACGCCAGCA CCGCCGAGGG TGCGCCGGTC CTGCAGTGGA GCTGGAACGG CGGGCACAAC CAGCAGTGGT ACGTGCTGCC GACCGACGGC GGGTTCGCCG AGCTGGTCAA CCGCAACAGC GGCAAGTGCC TGGCCGCGCC GGGCTACAAC GCAGGCCAGC AACTCATCCA GGTGGCCTGC ACCGGCAACC CGGGGCAGCA GTGGTACCTG GGCGTGTACC CCGGGCAGAG CCTGACCGGC CAGACCAAGA CCGTCTGGAA CCGCGCGACC GGGCTGTACG CGGACGTCAG TGGGGCCAGC ACCGCGGCCG GTGCGGCCAT CGACCAGTGG TACTACAACG GCAACTGGAA CCAGCAGTGG TACTTCGGTC CCGCGGTCGG ATGA
|
Protein sequence | MSRRTTGDGP TALRALARRG LLSLVAASTA ATIALVPVAP ATATPVIVVP AAAVASAPIA DPGAAAPSAA PTTDPAPTPA PTPASTPAPA STPAPAPTPA PAPAAAVAPA PAAATATTTD PAPTGSPAPA PAAGSPSAST PASTAPSTVV PSSTFSLSPA ARSFLSALNL NLGSGPVTGT LDGSVLTVAV GAPAVAFPLP TPGQTVSFTG ATLTIDESTR TLTLTAAVTT GNGLGGSLSV SIANADATDL TGADLTATLD ITGISVLGTT VEVSGSLSAT GGRLAASLTG RLSSDGVVAD GVLTVKSGST ITLATDTGLS VNGTAVLGSG PTAFTVIVSG AIKDGRNWSL SVDNTAETPQ FSPVDGLSIS PAFAGTITDA NGTVTFDVAG DDTGSWQTGP ATLSLTHVEV SNAAPRDGLA CPAVDPGQVW FDIQGGVADP AAGIAGSAQA CVVPAAKAFQ ITASAPSITL PNAAHFSLDQ PSVSITGSGV GTAQAKVAVS AAATLTVAPD ADHTVHTPVT LSFADDGSFT ASAGIDLGAL GVGSGQGTLV LASKQIAQFD PTTVGATGQK FDLPAGVTLL FAYQPSGAVS AALKNLQLPA PKSIAARATL SDNGFEATAQ LQFGDRDQGA KLFAQNTPGG AAAYINSLGL DFQLGSASGT VTVSGSAFLV LSKLYASGTA SRVQVTLGGS LGVSAQGAVS VSLQFDIKGL GGPWTDAFGI PGLSVSEVAG RIGVKDSPET AGIPLPTLAF KVDNVQLPKA WNDAIGIQAG ATTSLNLVLD VNNPVLGFSI AGQTPDAVAL KPFTIIKSVA GKSIPSSLPD SVQVKTAQLL FAPLGGTDAA GKAINPGATL VFDSAVAGVP VHVDGNVAVL PYPSLTADAS VGNFAVGPVT LANTDLKITL SADPSNPQAD FSFHGGFTDK TSGISFLAGI DEGASASLAS AAVSLHIAGG QPQYLQAAAD LTGSVSISPR DGSVSFAASG NAAVAVNGLN LASVPFSYST TSGALWQQLQ GSAGQVAQAF KSAYGWTDAQ AAAALNTLRA TPAQIAGALQ SAYGDGSDAV MKVLLKTGFS VDTSIATVKS ILGAADNQIA STLAQLGYQQ TQIATLLNRF YGDADARIAS VLLGLGNTAT SVAGTLHTVF GDTDRQVAVA FQQIGVPAQT IEAALTNAFG DGQAAIYNLM TSIGSAGTGT LDALAGVFNS GAYSPSAHPW WSVPLLWDVS NASTAEGAPV LQWSWNGGHN QQWYVLPTDG GFAELVNRNS GKCLAAPGYN AGQQLIQVAC TGNPGQQWYL GVYPGQSLTG QTKTVWNRAT GLYADVSGAS TAAGAAIDQW YYNGNWNQQW YFGPAVG
|
| |