Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3074 |
Symbol | |
ID | 8448688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3390474 |
End bp | 3391382 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 645042156 |
Product | tryptophan synthase, alpha subunit |
Protein accession | YP_003202397 |
Protein GI | 258653241 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0159] Tryptophan synthase alpha chain |
TIGRFAM ID | [TIGR00262] tryptophan synthase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0000185478 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000119928 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCAGCC AGCCGACCAG CCAGCCGCCC AGCCAGCCGA CAGTCCTGGC CGAGCTGTTC GCCGGCCTGC GGGCGCAGAA CCGGGCCGCC CTGGTGGGCT ACCTGCCGGC CGGCTACCCG GACCTGGCCG CGTCCAAGGA CCTGTACGCC GCGATCATCG AGGGGGGCTG CGACCTGGTC GAGGTCGGCC TGCCGTTCTC CGACCCGGTC CTGGACGGCC CGGTCATCCA GCACGCCGCC CAGCAGGCCC TGGCCGGCGG CTTCCGGGTC CGCGACACCT TCGAGATCGT TGAGTCGATC ACCGCCCGCG GTGGCCGCGC CGTCGTGATG ACCTACTTCA ACCCGGTGCT GGCCTACGGG GTCGACGCGT TCGCCCGGGA CCTGGCCGCG GCCGGGGGAG CGGGCGTGAT CACCCCCGAC CTGATCGTCG ACGAGGCCGG CCCCTGGCTG GACGCGGTCC ATGCGCACGG CATCGACCCG ATCTTCCTGG TCGCCCCGTC GTCGTCGGCG GAGCGGATCG CGCTCACCGC GGCCTCCGGC GGGGGCTTCG TCTACGCCGC CTCGGTGATG GGCGTGACCG GGGCCCGGGA CCAGGTGTCC AGCGCGGCGC CCGATCTGGT CGCCCGCTGC CGCACCGTGA CCGACCTGCC GATCGGCGTC GGGCTGGGCG TACGCACCGG CGAGCAGGCC CGGCAGATCG CCGAGTACGC CGACGCGGTG ATCGTCGGCA GCGCGTTCCT GGACGCCTAC GCCCGCGGTG GCCGGGACGC CGCCGCCGCG CTGGCCGGCG AGTTCGCCGC CGGCATCCGG GCCGCTCGCG CCGATGGAAC CGCCCGCGCC GACGGAACCG CCCGCGCTGA TCGAAGTGCC GATTCTGCGC CCGTTTCGTC CGGTTCCGTG GCGCAATGA
|
Protein sequence | MTSQPTSQPP SQPTVLAELF AGLRAQNRAA LVGYLPAGYP DLAASKDLYA AIIEGGCDLV EVGLPFSDPV LDGPVIQHAA QQALAGGFRV RDTFEIVESI TARGGRAVVM TYFNPVLAYG VDAFARDLAA AGGAGVITPD LIVDEAGPWL DAVHAHGIDP IFLVAPSSSA ERIALTAASG GGFVYAASVM GVTGARDQVS SAAPDLVARC RTVTDLPIGV GLGVRTGEQA RQIAEYADAV IVGSAFLDAY ARGGRDAAAA LAGEFAAGIR AARADGTARA DGTARADRSA DSAPVSSGSV AQ
|
| |