Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3374 |
Symbol | |
ID | 8448989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3712750 |
End bp | 3714234 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645042451 |
Product | glycosidase PH1107-related protien |
Protein accession | YP_003202691 |
Protein GI | 258653535 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0000228446 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00648646 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAACAGG TTCTCGCGCA TCGGCTGTCC GTCCGGCTCG AGCCGGATCC GGCGCGGGTG GTGGCCCGCC TGTTCCTGCC CGGCGAGCAG ACCCAGCACG AACGCTCGCG GGTCGGGGGC ATCGCCGACC GGGTGCTGGC CCTTCCCGAG GCGGACGTGC AGGTGCTGGC CGAGCAGGTG CTGCGGGACT TCTCCGCGCG GCACCGTGAC CTGCCGGCCA TCCTGGACGG GCACGCCGTG ATCATCGCCT CCCGGATCGG GGACACCCCG GTCTCGGCGG CGCGCATGCT GGTGCTGGGG GCCAGCGTCA CCGCCGAGTA CGCCACCGAG TCCGCCGCGC TGTGCAACCC GAGCGCCGTG CCGCACCCCG ATCAGTCCGG CCTGTTGGAC GGGCAGCTGC GGGTGGCGGT GAGCCTGCGG GCCATCGGGG AGGGACACAT CTCCGCGATC GCCTTCGGCA CCGCCGTCAT CGGGCCCGGT GCGCAGTGGG AGTTCCAGGA CCGGGAACGA CCGCTGGTCA CCGGCCGCAG CTCGCCGGCC CAGTGGCGCA ACAGCCAGTT GGCCGCGGTG CTGGCCGACC ACGGCGTCGT GGACACCCTG GGCGCGACCC TGCTGCACGA GCTGCCCGAC CTGTTCGACG TCGTCGATCT GGAGCGGGTG CTGGCCCATG CCCCCGGCGA CCTGCTGGCC CGCCTAGGTG GACCGGCCAC CATCGACCTG GTCCGCCGGG TGGTGTCCTC GGCCTACCGG GTCGAGTTCG ACGCCGACAC TGCGCTGGCC CAACGGATCC TGCAGCCCAA TGCGGCCGAG GAGAGCAACG GACTGGAGGA CGCGCGGTTC ACCCGCTTCG TCGACCCGGA CGGGGTGGTG GAGTACCGGG CGACCTACAC CGCCTACGAC GGCCACCAGA TCGCCCCGCG GCTGCTGATC AGCTCGGATC TGCGCGAGTT CAACGCCTAC CGGCTGGCTG GGTCGGCCGC CCGGAACAAG GGCATGGCCC TGTTCCCGCG GCTGGTCGGC GGGCGGCACC TGGCGTTGTG CCGCACCGAC GGCGAGAACA TCAGCCTGGC CTACTCCACG GACGGATTCC GCTGGTCCGA GCCGACCCTG CTGTACGGGC CGAGCCGGGC CTGGGAAGTG GTGCAGGTGG GCAACTGCGG CCCGCCGGTC GAGACCGAGC GCGGCTGGCT GGTGCTCACC CACGGGGTGG GTCCGATGCG CACCTACGCG ATCGGCGCCA TCCTGCTCGA CCTGGACGAC CCGTCCCGGG TGATCGGCTC GCTGCGCCAC CCCCTGCTGG AGCCGATTGA CGGCGAGCGG GACGGGTACG TGCCCAACGT GGTCTACTCC TGCGGCCCGG TCCGGCACGA CGGCCGGCTG TGGGTACCGT TCGGCATCGA CGACGCGCGG ATCGGCGTCG CCTGGCTCGA TCTGGACGAG CTGCTCGACG AACTCCTTGA CGGTGGGGTC ATCTCCGCGC TCTGA
|
Protein sequence | MKQVLAHRLS VRLEPDPARV VARLFLPGEQ TQHERSRVGG IADRVLALPE ADVQVLAEQV LRDFSARHRD LPAILDGHAV IIASRIGDTP VSAARMLVLG ASVTAEYATE SAALCNPSAV PHPDQSGLLD GQLRVAVSLR AIGEGHISAI AFGTAVIGPG AQWEFQDRER PLVTGRSSPA QWRNSQLAAV LADHGVVDTL GATLLHELPD LFDVVDLERV LAHAPGDLLA RLGGPATIDL VRRVVSSAYR VEFDADTALA QRILQPNAAE ESNGLEDARF TRFVDPDGVV EYRATYTAYD GHQIAPRLLI SSDLREFNAY RLAGSAARNK GMALFPRLVG GRHLALCRTD GENISLAYST DGFRWSEPTL LYGPSRAWEV VQVGNCGPPV ETERGWLVLT HGVGPMRTYA IGAILLDLDD PSRVIGSLRH PLLEPIDGER DGYVPNVVYS CGPVRHDGRL WVPFGIDDAR IGVAWLDLDE LLDELLDGGV ISAL
|
| |