Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5385 |
Symbol | |
ID | 6978479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 1021802 |
End bp | 1023430 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394487 |
Product | hypothetical protein |
Protein accession | YP_002279305 |
Protein GI | 209547387 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.366398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.452356 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGA AGGAAACGAC CGGCCAGGCG ATGACCCGTT CGCTCGTCGC CCATGGGATC GACACGGCGT TCGGCATCCC GGGCGCCCAC ATGTACGATT TCAACGATGC CCTCTACGGC GCCCGTGACC AGGTTCGGTT CATTCACACA AGGCACGAAC AGGGCGCGGG ATATATGGCC TATGGCTATG CCAAATCCAC CGGCCGGATC GGCGCCTATA CCGTGGTCCC CGGCCCGGGT GTTCTCAATT CCGGGGCAGC GCTTTGCACC GCTTACGGCG CCAACGCGCC GGTGCTTTGC ATCACCGGCA ACATCATGTC CCACCTGATT GGCCAGGGCA GAGGGCAGTT GCACGAACTG CCCGATCAAC TCGCGACAAT GCGCGGGCTT ACCAAGACGG CGGAACGCAT CAATCATCCG TCCGAGGCCG GCCCTGTCAT GGCCGAGGTT GTTAATAAAA TGCTCTCCGG CCGCCAGGGT CCGGGAGCCG TCGAAGCGCC GTGGGACGTG TTTGGCCAAT CCGGCCCCGA AGTCGATCTT CCCTTAGGCA AGAGAGCGCC TCACCCGGCC GTCAACGCCG ATCAGATCGC TGCCGCAGCA GCGCTGATAG CGGGCGCCAG CAATCCGATG ATCATGGTCG GCGGCGGTGC GGTGGATGCC GGCGCCGAGA TCGCCGCCCT TGCGGAACTG CTGCAATCGC CCGTCACCTC CCATCGTTCC GGCAAGGGCA TCGTCGCCGA CGACCACCCG AACTATCTGA ACTTCGTCGC CGCCTACGAA TATTGGAAGA AGACCGACGT TCTGATCGGC ATCGGCAGCC GGCTCGAACT GCAGTTCATG CGTTGGAAAT GGCTTCCGAA GGATCTCAAG ATCATCCGCA TCGATATCGA TCCGACCGAA ATGGTCCGCC TCAAGCCCGG TGTCGGCATC GTCGCCGATG CTTCGGCGGG AACGCAGGCT CTGATCGATG CACTGGCAGG CGCCCGCCGT GAGGATCGCA CCCGCGAATT TGCCGACCTC AACAGAGACG CCAGATCCCG GTTCTCGGAA GTACAGCCGC AGCTTGCCTA TCTCGATGCC ATTCGTCAGG CTCTGCCGAG AGATGGCTTC TTCGTCGAGG AAGTCAGCCA GATGGGCTTT ACCGCCCGTT TTGCCTTCCC CGTCTACGGC CCGCGTCAGT ACGTGACATG CGGTTATCAG GACAATCTGG GTTTTGGCTT CAACACCGCC TTGGGCGTGA AGGTAGCCAA TCCCGATAAA GCGGTTGTTT CCGTTTCCGG CGATGGCGGC TTCATGTTCG GCGTTCAGGA GCTTGCCACC GCCGTCCAGC ACAGGATCGC TGTCGTCGCC ATCGTCTTCA ACAATTCGGC CTATGGCAAT GTCCTGCGCG ACCAGAAGCA GGCGTATCAT GGCCGCTATC TTGGCTCGGA CCTGACCAAT CCGGATTTTG TCGCGCTTGC CGAAAGCTTC GGTATCCGCG CATTCAAGGT GATAAGCCCC GTCGAACTCA AAGAGACGAT CGAGAAAGCC CTCTCTCTCG ACGAACCCGT GCTCATCGAG GTTCCGATCG AAAAGGGATC CGAGGCAAGT CCCTGGCCTT TCATTCATCC GGCGCCGCAC GCCGAATGA
|
Protein sequence | MSKKETTGQA MTRSLVAHGI DTAFGIPGAH MYDFNDALYG ARDQVRFIHT RHEQGAGYMA YGYAKSTGRI GAYTVVPGPG VLNSGAALCT AYGANAPVLC ITGNIMSHLI GQGRGQLHEL PDQLATMRGL TKTAERINHP SEAGPVMAEV VNKMLSGRQG PGAVEAPWDV FGQSGPEVDL PLGKRAPHPA VNADQIAAAA ALIAGASNPM IMVGGGAVDA GAEIAALAEL LQSPVTSHRS GKGIVADDHP NYLNFVAAYE YWKKTDVLIG IGSRLELQFM RWKWLPKDLK IIRIDIDPTE MVRLKPGVGI VADASAGTQA LIDALAGARR EDRTREFADL NRDARSRFSE VQPQLAYLDA IRQALPRDGF FVEEVSQMGF TARFAFPVYG PRQYVTCGYQ DNLGFGFNTA LGVKVANPDK AVVSVSGDGG FMFGVQELAT AVQHRIAVVA IVFNNSAYGN VLRDQKQAYH GRYLGSDLTN PDFVALAESF GIRAFKVISP VELKETIEKA LSLDEPVLIE VPIEKGSEAS PWPFIHPAPH AE
|
| |