Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4175 |
Symbol | |
ID | 4596689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4410986 |
End bp | 4412767 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639778781 |
Product | thiamine pyrophosphate protein |
Protein accession | YP_925359 |
Protein GI | 119718394 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.296619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACGA CCGTGGCCGA CCAGCTGCTC GCCCGCCTGC GGGAGTGGGG GGTGGCGCAG GTCTTCGGCT ATCCCGGCGA CGGGATCAAC GGGATCCTCG GCGCGTTCTC CCGCGCCGAC GACCAGCCGC GCTTCATCCA GTCCCGCCAC GAGGAGATGA GCGCGTTCCA GGCGGTGGGC TACGCGAAGT TCTCCGGCCG CCCCGGCGTC TGCATGGCGA CCTCCGGGCC CGGCGCGATC CACCTGCTCA ACGGCCTCTA CGACGCGAAG CTAGACCACG TCCCCGTGGT GGCGATCGTC GGGCAGACCA ACCGCACCGC GATGGGCGGC AGCTACCAGC AGGAGGTCGA CCTGATCAGC CTGTTCAAGG ACGTCGCCGG CGACTACGTG CAGATGGTGA CCGTCCCCGA GCAGCTGCCG AACGTGCTGG ACCGGGCGAT CCGGGTCGCG ACCGCCCGCC GCGCGCCGAC CGCGATCATC GTGCCCAACG ACGTCCAGGA GCTGGAGTAC GCCGCCCCGC AGCACGCGTT CAAGATGGTG CCCTCCAGCC TCACCCCGAC CCAGCGGCCC ACGGTCACCC CGGACCCCGC GGCGCTGCGC CAGGCCGCCG ACGTCTTGAA CTCCGGCCGC AAGGTCGCGC TGCTGGTCGG CCAGGGCGCC CGCGGCGCGG ACGCCGAGAT CGCCGAGGTG GTGGACCTGC TGGGCGCCGG GGTCGCGAAG GCGCTGCTCG GCAAGGACGT GCTCAGCGAC GAGCTGCCCT GGGTGACCGG CTCGATCGGC CTGCTCGGCA CCCGCCCGAG CTACGAGCTG ATGATGGGCT GCGACACGCT GCTGACCGTC GGCTCGAGCT TCCCCTACAC GCAGTTCATG CCGGAGCTGG ACCAGGCCCG TGCCGTGCAG ATCGACCTCG ACGGCACGAT GATCGGGATG CGCTACCCCT ACGAGGTCAA CCTCGTCGGC GACGCGCAGG CCACGCTGCG CGCGCTGATC CCGCTGCTGG AGAAGCAGCA GGACCGCTCC TGGCACGACG AGATCTGCGC GAACGTCACC GACTGGTGGG AGGTGATGGA CGCCGAGGCC CACGTCGCGG CCGACCCGGT CAACCCGATG CGGATCTTCA ACGAGTTCTC GAAGGTCGCG CCGACCGACG CGATCATCTC CTCCGACAGC GGCTCGGCCG CGAACTGGTA CGCCCGGCAC GTCAAGATGC GCGGCCGGAT GCGCGGCTCG CTGTCGGGCA CGCTCGCGAC GATGGGCCCG GCGGTGCCGT ACGCGATCGG CGCCAAGTTC GCCCACCCCG ACCGGCCCGC GATCGCCTTC GAGGGCGACG GCGCGATGCA GATGAACGGG CTGGCCGAGC TGCTCACGAT CGCCCGCTAC TGGCCGGAGT GGGCCGACCC GCGGCTGGTG GTCGCCGTAC TCCACAACAA CGACCTCAAC CAGGTCACCT GGGAGCTGCG CGCGATGGGC GGGACGCCCA CCTTCGTGGA GTCCCAGGCG CTGCCGGACG TCTCGTACGC CGACTTCGCG CGCTCGTGCG GCCTGGGCGC GACGACCGTG ACCGACCCCG ACCAGCTCGC CGACGCGTGG CAGGTCGCGC TGTCCTCGGA CCGCCCGCAC CTGCTCGACG TGCACTGCGA CCCCGACGTG CCGCCGATCC CGCCGCACGC CACCCTCGAG CAGATGACCG CGATGGCCAA GGCGCTGATC AAGGGCGACA CCAGCCGCTG GGGCGTGATG AAGGAGGGCA TCAGGACCAA GGCCCAGGAG CTGCTGCATT GA
|
Protein sequence | MTTTVADQLL ARLREWGVAQ VFGYPGDGIN GILGAFSRAD DQPRFIQSRH EEMSAFQAVG YAKFSGRPGV CMATSGPGAI HLLNGLYDAK LDHVPVVAIV GQTNRTAMGG SYQQEVDLIS LFKDVAGDYV QMVTVPEQLP NVLDRAIRVA TARRAPTAII VPNDVQELEY AAPQHAFKMV PSSLTPTQRP TVTPDPAALR QAADVLNSGR KVALLVGQGA RGADAEIAEV VDLLGAGVAK ALLGKDVLSD ELPWVTGSIG LLGTRPSYEL MMGCDTLLTV GSSFPYTQFM PELDQARAVQ IDLDGTMIGM RYPYEVNLVG DAQATLRALI PLLEKQQDRS WHDEICANVT DWWEVMDAEA HVAADPVNPM RIFNEFSKVA PTDAIISSDS GSAANWYARH VKMRGRMRGS LSGTLATMGP AVPYAIGAKF AHPDRPAIAF EGDGAMQMNG LAELLTIARY WPEWADPRLV VAVLHNNDLN QVTWELRAMG GTPTFVESQA LPDVSYADFA RSCGLGATTV TDPDQLADAW QVALSSDRPH LLDVHCDPDV PPIPPHATLE QMTAMAKALI KGDTSRWGVM KEGIRTKAQE LLH
|
| |