Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3873 |
Symbol | |
ID | 8014696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3943617 |
End bp | 3944726 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826443 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_002977655 |
Protein GI | 241206559 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0584274 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTTG AGATGAGCAA GCCCGTTCCG CGTCCCGGTA TTCTCGATAT CGCAGCCTAT GTGCCGGGCA AGGAACATGC GCCGGGTGTT GCCCGCGTCT ACAAGCTTTC GTCCAACGAA ACGCCGCTCG GCGCCAGCCC GAAGGCAATC GAAGCCTTCA AGACGGTTGC CGACAATCTG GGGCGTTATC CTGACGGGCA GGCGATCGAA CTGCGTGAGG CGATTGCCGC CGTGCACGGC CTCAATCCGG CAAACATTCT CTGCGGCAAC GGTTCCGACG AACTGCTCGG CTTGCTCTGC CATGTCTATC TCGGTGCCGG CGACGAGGGC ATCATCACCG AGCACGGCTT CCTCGTCTAC AAGATCCAGA TCCTGGGCGC CGGCGCCACG CCTGTTGTCG TCAAGGAGAA AGACTATACC GTCGATGTCG ATGCGATCCT TGCCGCGGTG ACCGAGAAGA CGAAGATCGT CTTCATCGCC AATCCCGGCA ATCCAACCGG CACCTATGTT TCCGTCAGCG AGATCCGCCG CCTTCAGGCC GGACTGCCGA AACATGTCGT CCTCGTGCTC GATGCCGCCT ATGCCGAATA TGTGCGCCGC AACGATTATG AAGCCGGCAT CGAGGTCGTC TCCTCCAATG CCAACGTGGT GATGACCCGC ACCTTCTCGA AGGCTTATGG CCTTGCGGCG CTGCGCGTCG GCTGGATGTA TGCGCCCGCC GAGATCGTCG ATGCGCTGAA TCGCGTGCGC GCGCCGTTCA ACTTGAACGC GCCGGCAATC GCCGCCGCTG CCGCTGCCAT CCGCGACCAG GCCTTCATCC AGCAGGCCGT CTCCTTCAAT CAGATGTGGG TCGAGACGCT CACCCAGGCA CTCGAAGCGA TCGGGTTGAA GGTGACGCCG TCCGTCGCCA ATTTCGTCCT CATTCATTTC CCCGAGATCG ACGGCAAGCG CGCCGCGGAT GCCGATGATT TGTTGACGAG CCGCGGCTAC ATCCTGCGCG CCGTGCGCGG CTATGGTTTC GCCAATGCGC TGCGCATGAG CATCGGCCCC GAAGAGGCCA ACCGCGGCGT GATTGCCGCG CTCACCGAAT TCATGGGTCA TCAGGCATGA
|
Protein sequence | MSVEMSKPVP RPGILDIAAY VPGKEHAPGV ARVYKLSSNE TPLGASPKAI EAFKTVADNL GRYPDGQAIE LREAIAAVHG LNPANILCGN GSDELLGLLC HVYLGAGDEG IITEHGFLVY KIQILGAGAT PVVVKEKDYT VDVDAILAAV TEKTKIVFIA NPGNPTGTYV SVSEIRRLQA GLPKHVVLVL DAAYAEYVRR NDYEAGIEVV SSNANVVMTR TFSKAYGLAA LRVGWMYAPA EIVDALNRVR APFNLNAPAI AAAAAAIRDQ AFIQQAVSFN QMWVETLTQA LEAIGLKVTP SVANFVLIHF PEIDGKRAAD ADDLLTSRGY ILRAVRGYGF ANALRMSIGP EEANRGVIAA LTEFMGHQA
|
| |