Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1771 |
Symbol | |
ID | 8447373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 1940433 |
End bp | 1942676 |
Gene Length | 2244 bp |
Protein Length | 747 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645040897 |
Product | glycoside hydrolase clan GH-D |
Protein accession | YP_003201150 |
Protein GI | 258651994 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.145856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.079989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCAC CCGCCACGAC TTCCACCGTC AATGAGGCCG CGAGGCCGGC CGACCTCGGA TTGAACCACC CCGGTTCGGC GGTCCGGCAC CTGCGAGCCG CCGGGGTCAG CGTGGTGATC GACTGCCGCG GGCCGCGTCT ACCGCGCATC GTGCACTGGG GCGCCGACCT GGGCGATCTC GACCGGACCA CGCTGGACAA CCTGGTTCGA GCCGATGTGC AGGCCGTGGT CAGCAATGTT CCCGACCAAC CGATCAGCCC TGGACTGCTC GCCGAACACT CGACCGGCTG GTTGGGACTG CCCGGCCTGC TCGGACACCG TGCCGGCCAT CAGTGGTCGA CCCTGTTCGA CGTGACGACC GTCAGCTCGA GCAACGACCG GGACGGAACC CGACGCGTCG TCGTCGACGC AGCGGATCCG GACGCCGGCC TGCGCCTCGT GCTGACCCTG GAGCTGTTGC CGGCCGGGGT GCTTCGCACC CGAGCCACCC TGACGTCGAT CGCCGACCAG GCCGGCGCGC CCTACACCGT CGACGGGCTG GCCCTGTCGC TTCCGGTCCC GACCCGTGCC ACCGAACTGC TCGATTTCAC CGGCCGGCAC CTGCGGGAAC GGCATCCGCA GCGCCGCCGG TTCGATGTCG GCAGTTGGGT CCGCGACAAC CGCCGCGGCC GCACCGGAGC CGACGCCACG ATGCTGTTGG CCGCTGGCAC GCCCGGTTTC GGCTTCGGCC ACGGTGAGGT CTGGTCGGTG CACACCGCCT GGAGCGGCAA TCACCGCACG ATCGCCGAGC GCAGTCCGAA CGGACACGCC GTGCTGTCCG GCGGGGAACT GTTGCTGCCT GGCGAGATCA CCCTGGGGCC GAGCGAGTCG TACACCACGC CATGGATCTA CGGTTCCTAC GGCGGTGCCG GCCTGAACGC CGTGGCCGAC CGCTTCCACG CCTTCCTGCG GGGGCGGCCG CAGCATCCCC GCAGCCCGCG TCCGGTCATC CTGAACACCT GGGAATCGGT GTACTTCGAC ATGGACTTGC CGACGCTGAT CGCGCTGGCC GAGGCCGGCG CCGAGGTGGG TGTCGAACGG TACGTGCTGG ACGATGGCTG GTTCACCGGC CGCCGCGACG ACACCGCCGG GCTGGGCGAC TGGCAGGTCG ACCGCGACGT CTGGCCCAAT GGCCTCAAGC CCTTGGTGGA TCGGGTCACC AAGCTCGGCA TGCAGTTCGG CATCTGGATT GAACCCGAGA TGATCAATCC CGACTCCAAT CTCGCCCGCG CGCACCCGGA ATGGATGCTG TCCACCGGGC ACCGGCTGCC CATCGAATCA CGGCACCAGC AGGTGCTGGA TCTGGCCAAT CCGGGCGCGT TCGACTACAT CCTGGGCAGC CTGGACGACC TGCTCAAGAA GCACGACATC AGTTACCTCA AGTGGGACCA CAACCGGGAT CTGATCGACG CCGGGCACAG TCCGGACGGT CAGCCGGCCG TGCACGACCA GACCCTGGCC GTGTACCGGC TGATTGACGA ACTGCGCCGC CGCCACCCCG GGGTCGAGAT CGAGTCATGC TCCTCCGGCG GCGCCCGAGT CGATCTGGAG GTGCTGCAGC GCACCGATCG GGTTTGGACC AGCGACTGCA TCGACGCCCT GGAACGGCAG ACGATCCAGC GCTACACCGG GCTGCTGGTG CCGCCGGAGA TGCTCGGCGC CCACATCGGC ACCGGCCAGG CGCACACCAC CGGACGCCGG CACAACCTGT CCTTCCGGGC CGGCACCGCC CTGTTCGGCC ACATGGGAAT CGAGGCCAAC CTGACCAGCA TGTCGGCCGC CGAGCGAGCT GAACTGGGCG AATGGGTTGC CCTGCACAAG AAGCTGCGGC CGCTCCTGCA CACCGGCCGG GTGGTCCGTT TCGACGACGT CGACCCCTCG CTGATGGTGC AGGGTGTCTA CGCGGCCGAC CTTTCCCAAG CCGTGATCAG CATCGCCGCG ATCGCGACCG CCGACAGCGC CCCGATCGGC CGGCTCGTCA TTCCCGGCCT GGATCCGGAC GCGCCCTATC ACCTGGAGCT GCTGCCGCCC GGCGATGTCA TCCAGGGCGA GGACGCCGAG CGCCGCAACG GCAACAACAA GCACCTGCCA CCGTGGCTCG TCACCGGGAC CGACCTGACC GGCGCGGCGC TGACCTACGC CGGGGTGCAG CTGCCCGATC TGCTTCCCGA GCAGCTGCTG CTGCTGCGCG TCACCCGGGT ATGA
|
Protein sequence | MTSPATTSTV NEAARPADLG LNHPGSAVRH LRAAGVSVVI DCRGPRLPRI VHWGADLGDL DRTTLDNLVR ADVQAVVSNV PDQPISPGLL AEHSTGWLGL PGLLGHRAGH QWSTLFDVTT VSSSNDRDGT RRVVVDAADP DAGLRLVLTL ELLPAGVLRT RATLTSIADQ AGAPYTVDGL ALSLPVPTRA TELLDFTGRH LRERHPQRRR FDVGSWVRDN RRGRTGADAT MLLAAGTPGF GFGHGEVWSV HTAWSGNHRT IAERSPNGHA VLSGGELLLP GEITLGPSES YTTPWIYGSY GGAGLNAVAD RFHAFLRGRP QHPRSPRPVI LNTWESVYFD MDLPTLIALA EAGAEVGVER YVLDDGWFTG RRDDTAGLGD WQVDRDVWPN GLKPLVDRVT KLGMQFGIWI EPEMINPDSN LARAHPEWML STGHRLPIES RHQQVLDLAN PGAFDYILGS LDDLLKKHDI SYLKWDHNRD LIDAGHSPDG QPAVHDQTLA VYRLIDELRR RHPGVEIESC SSGGARVDLE VLQRTDRVWT SDCIDALERQ TIQRYTGLLV PPEMLGAHIG TGQAHTTGRR HNLSFRAGTA LFGHMGIEAN LTSMSAAERA ELGEWVALHK KLRPLLHTGR VVRFDDVDPS LMVQGVYAAD LSQAVISIAA IATADSAPIG RLVIPGLDPD APYHLELLPP GDVIQGEDAE RRNGNNKHLP PWLVTGTDLT GAALTYAGVQ LPDLLPEQLL LLRVTRV
|
| |