Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3949 |
Symbol | |
ID | 4598084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4161334 |
End bp | 4162689 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639778554 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_925133 |
Protein GI | 119718168 |
COG category | [R] General function prediction only |
COG ID | [COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) |
TIGRFAM ID | [TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.334602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACG ACCAGGTCAT GTTCACGCTG GACGAGTCCG AGCAGGTCAC CCACTGGTAC AACATCGTGG CGGACCTGCC GACGCCGCCC CCGCCGCCGC TCCACCCCGG CACCCACGAG CCGGTCGGAC CCGACGACCT GGCCGCACTG TTCCCGATGG AGCTGATCCT CCAGGAGGTC AGCGGCGAGC GGTACGTCGA GATCCCGGAG CCGGTGCGCG AGGTCTATCG CCAGTACCGG CCCAGCCCGC TCTACCGCGC CCGCCGCTGG GAGCAGAAGC TCGGCACCAC GGCGCGGATC TACTACAAGT ACGAGGGCGT CTCGCCCGCC GGGTCGCACA AGACCAACAC CTCGGTGCCG CAGGTCTACT ACAACTCCCT GCACGGGGTG CGCCGGCTCA CGACCGAGAC GGGCGCCGGC CAGTGGGGCA CCGCGCTGGC CTACGCGTGC TCCCTGTTCG ACGTGGAGTG CGAGGTCTGG CAGGTCGGCG CGTCGTACGA CTCCAAGCCG CAGCGCCGGA CGCTGATCGA GGTGTTCGGC GGCTCCGTGC ACCGCTCCCC GAGCCGGCTC ACCGAGTCCG GCAAGGCGTT CCCCGACGGA CACCCGGGGT CACTGGGCAT CGCGATCTCC GAGGCGGTCG AGGTCGCCGC CCAGGACGAG ACGACCAAGT ACGCCCTCGG CTCGGTGCTC AACCACGTGC TGCTGCACCA GACCGTGATC GGCGAGGAGA CGCTGCGCCA GCTGGCCAAG GCCGGTGAGT CCGGCGCCGA CCTCGTGGTC GGCTGCGCCG GCGGCGGGTC GAACTTCGCC GGCCTCGCCT TCCCGTTCCT GCGCGAGAAG CTGGCCGGCA CGCAGGCTCC CCGGATCCTG GCGGTCGAGC CCACGTCCTG CCCGACCCTG ACCCGGGGCG AGTACCGCTA CGACTTCGGC GACACCGCCG GGCTGACACC CTTGATGAAG ATGTACACGC TCGGCCACGA CTTCGTGCCG TCGCCGATCC ACGCGGGCGG GCTGCGCTAC CACGGCATGG CGCCGCTGGT CTCGCACGCC GTGCACGAGG GCCTGATCGA GGCCACCGCG CTGCACCAGA GCGAGTGCTT CGAGGCTGGC CTGGAGTTCG CCCGCACCCA GGGCATCGTC GCGGCGCCCG AGTCCTCGCA CGCGCTGGCC CAGGCCCGCC GCGAGGCGCT CGCCGCGACC GAGTCCGGGG CGGAGCCGGT GATCGTCGTC GGGCTCTCCG GGCACGGCCT GCTCGAGCTC GGTGCCTACG AGTCGTTCCT CTCCGGCCAC CTCGAGGACG ACCCGCTGTC CGACGCGGAC CTCACCGCGG CGCTGGCGGG CATCCCGCAG GTCTGA
|
Protein sequence | MNDDQVMFTL DESEQVTHWY NIVADLPTPP PPPLHPGTHE PVGPDDLAAL FPMELILQEV SGERYVEIPE PVREVYRQYR PSPLYRARRW EQKLGTTARI YYKYEGVSPA GSHKTNTSVP QVYYNSLHGV RRLTTETGAG QWGTALAYAC SLFDVECEVW QVGASYDSKP QRRTLIEVFG GSVHRSPSRL TESGKAFPDG HPGSLGIAIS EAVEVAAQDE TTKYALGSVL NHVLLHQTVI GEETLRQLAK AGESGADLVV GCAGGGSNFA GLAFPFLREK LAGTQAPRIL AVEPTSCPTL TRGEYRYDFG DTAGLTPLMK MYTLGHDFVP SPIHAGGLRY HGMAPLVSHA VHEGLIEATA LHQSECFEAG LEFARTQGIV AAPESSHALA QARREALAAT ESGAEPVIVV GLSGHGLLEL GAYESFLSGH LEDDPLSDAD LTAALAGIPQ V
|
| |