Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5572 |
Symbol | |
ID | 8016463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | + |
Start bp | 154202 |
End bp | 156004 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644827739 |
Product | thiamine pyrophosphate protein domain protein TPP-binding |
Protein accession | YP_002978939 |
Protein GI | 241518311 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.307179 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAATG CCGCTGACAT ATTGATAGAC ACACTGATTG AATGGGACGT GAAGGTCATC TTCGGCCTTC CTGGAGACGG TATTAACGGG GTCATGGAGG CGTTGCGCAA GCGCCAGGAC CAGATCCGCT TCATCCAGGT CCGGCATGAG GAATCGGCCG CTTTCATGGC GAGTGCCTAC GCTAAATTCA CCGGAAATCT CGGCGTTTGC CTTGCAACGT CCGGACCAGG CGGCACGAAT CTGCTGACGG GACTTTATGA TGCCAAGCTC GATCAGATGC CGGTGCTTGC CATCACCGGG ACGCAGTATC ATGACCTGAT CGAAACCTTT ACCCAGCAGG ACGTCGACTT GACCCGCGTG TTCGACAATG TCGCGCTCTA CAATGCGCAT GTCAGTGACG CGTCGCACAT GGAGAACGTC GCCAGCCTCG CCTGCCGCTC GGCGCTCTCG CGCCGGGGCG TCGCACATCT CTCCATCGCC AACGACGTCC AGGAGATGGA CGGGAAATCG CGATCGAAAC GGAACCGCCC GAAACACGTA CCGAACCGCT ATTTCCACGG CCGCCAAGTG CCAGAGGAGA TCGAGCTCGA TCGTGCGGCC CGGATACTGA ACGATGCCCG CAAGGTCGCC ATCCTCGCCG GCCGGGGTGT TGTCGGCGCG GCGGAGGAAT TGCGCGAGAT CGCCGACCTG CTCGCCGCGC CGGTGGCCAA AGCCCTGCTC GGCAAGACGG CTCTCGCCGA CGACGATCCC TTGACGACCG GCGGGATTGG AATTCTCGGC ACCGCTCCCT CGCAGGAGAT CATGGAACAG TGCGACGCGG TGCTGATTGT CGGTTCCTCC TTTCCCTACA TCGAATATTA CCCGCGGCCC GAAGCCGCTC GCGGCGTGCA GATCGACAGC GATCCTCAGC GAATTGGCCT ACGATTTCCC GTCGATGCCG GTCTGGTGGG GGACGCGCGG GAAACGCTAC GGCTCCTTAG GCCTAGGCTG ACCAGGAAAA CAGATCGGTC GTTCCTGGAA AAGGCGCAAA CGGCGATGTC CGAATGGCGC CGAAAGATGG AGACGATGGA GACGGAGCGG AGCGCTCCGT TAAAGCCACA GGCGGTGGTT CGAGCTTTCG GCAGGCGCAT CGCGGCCGAT GGCGTGCTGG TGGCCGATTC CGGCCAGAAC ACCGAGCTGG CGGCGCGCCA CATCGACCTC CGCACGACAA ACCAGTTCGC AGTCTCCGGC GCGCTCGCCT CGATGGCTTC CGGCCTGCCT TATGCGATAG CGGCGGGCGC CGCCGATCCG AAGCGGCCAA TCTACGCGGT CATTGGCGAT GGCGGATTCG GCATGCAGCT CGGCGAGTTC TCGACCGCCG TTCGCATGGC TCTCCCGCTG AAATTGCTGG TGATCTGCAA CGGCATGCTC AACCAGATCG CCTGGGAGCA GATGATGTTC CTTGGCAACC CGCAATTTGC CTGCGAACTG GCGCCCATCG ACTTCGCCAA GGCGGCGGAG GCAATGGGAG GCCGCGGCTT CACAATTCGG CGTTTCGATC AGATAGAGCC GACTTTGACG GAGGCTTTCT CCGTCGCCGG TCCGGTGGTC ATTCAAGCGA TGGTTGACCA ATACGAGCCG ATGATGCCTC CGAAGATGCC CAAAGACTAT GCCAAGAGCT TTCGCCAGGC TCTTCCGGAG ACGCCGGGTC ATCGGGCGAT CGAGGCGAAT GTGTCGCGCT CGCCGCTTCG CGAGATGATG GACGCCGGAG AGCAGGAAAA GGAACTGTCC GAAAACACGA CCGATTTGCC GGGCATACCT TGA
|
Protein sequence | MPNAADILID TLIEWDVKVI FGLPGDGING VMEALRKRQD QIRFIQVRHE ESAAFMASAY AKFTGNLGVC LATSGPGGTN LLTGLYDAKL DQMPVLAITG TQYHDLIETF TQQDVDLTRV FDNVALYNAH VSDASHMENV ASLACRSALS RRGVAHLSIA NDVQEMDGKS RSKRNRPKHV PNRYFHGRQV PEEIELDRAA RILNDARKVA ILAGRGVVGA AEELREIADL LAAPVAKALL GKTALADDDP LTTGGIGILG TAPSQEIMEQ CDAVLIVGSS FPYIEYYPRP EAARGVQIDS DPQRIGLRFP VDAGLVGDAR ETLRLLRPRL TRKTDRSFLE KAQTAMSEWR RKMETMETER SAPLKPQAVV RAFGRRIAAD GVLVADSGQN TELAARHIDL RTTNQFAVSG ALASMASGLP YAIAAGAADP KRPIYAVIGD GGFGMQLGEF STAVRMALPL KLLVICNGML NQIAWEQMMF LGNPQFACEL APIDFAKAAE AMGGRGFTIR RFDQIEPTLT EAFSVAGPVV IQAMVDQYEP MMPPKMPKDY AKSFRQALPE TPGHRAIEAN VSRSPLREMM DAGEQEKELS ENTTDLPGIP
|
| |