Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6104 |
Symbol | |
ID | 8016061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | - |
Start bp | 140574 |
End bp | 142379 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644827410 |
Product | allophanate hydrolase |
Protein accession | YP_002978610 |
Protein GI | 241258726 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases |
TIGRFAM ID | [TIGR02713] allophanate hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0470749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.373295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCCGA CCATCCTCGA TCTCTCAAGC CTTCGCGCCG CCTATCAATC CGGCCTGACG CCGCTCGACG CCATCGAAGA GGTGATCGCG CGGCGTGCCG CCTCGAAAGA TCCGGCAATC TTCATCACCC CGGTGCCGGA TGACGAGCTG CGCGCGGCCG CAAAAGTGCT GATGGCACGC GCGCCCGAGG CAAACAGCCT GCCGCTTTGG GGCGTGCCCT TCGCCGTGAA AGACAATATC GATGCCGCCG GCCTGCCGAC GACGGCCGCC TGCCCGGCAT ACGAATATCG GCCGGAAGCG GACGCTACCG TCGTTGCGCG GCTGAAGGCA GCCGGCGCCA TCATCATCGG CAAGACCAAT CTCGACCAGT TCGCGACCGG CCTCAACGGC ACGCGCTCGC CCTATGGCGC GCCGCGTTCG GTCTTCGATG CGGCCTATAT CTCCGGCGGA TCGAGTTCCG GATCATCAGT CACGGTGGCC TCCGGCCTCG CGGCCTTTGC GCTCGGCACC GATACGGCAG GCTCCGGCCG CGTGCCTGCC GCCTTCAACA ATCTGGTCGG CATCAAGCCG ACGCCGGGCC TTGTGCCGAA TACCGGCGCG GTTCCGGCCT GCAAGAGCGT CGACTGCATC ACGATTTTTG CGGCGACCGT CGGCGACGGT GTTGCGATCC GCAAGGTCGC CGAAGGCTTC GATGGCGCCG ATGCTTTCTC GCGTCACGCC AAGCCGGCGA ACCTGCCGGT ATCGGGCTTG CGCATCGGCG TTCTCACCGA TGCCGAGCGG GAATTCTTCG GTGACAAGGA GGTGGAGGCG CTCTACGACC AGGCGATCGA GCGGGCCAAA GCGCTCGGAG CGACCATCGT GCCTTTCGAT TACGCGCCAT TCCGCGAGGC CGCCGCCCTC CTCTATGACG GGCCGTGGGT CGCCGAGCGT CTGGCGGCGG TCGAGACCTT CCTCGCCACC AACGCCGCCG ATTTCGACCC GACGGTCAGA GGGATTATCG AAGGCGCCAA GGGCAAGACC GCGGTCGAGG CCTTTAACGG CCGATACCGG CTGGAGGAGC TGCGCCGCAA GACCGAAGCC GAATGGGAAA AGGCGGACGT GCTTCTGCTG CCGACCGCAC CGACCACCTA CACGGTCGCC GACATGCTGG CCAATCCCGT CGTGCTCAAT GGCCGCCTCG GTCGCTACAC CAATTTCGTC AACCTGCTCG ATTGCGCAGC GATCGCCGTT CCGGCCGGTT TCGGCAAAGA CGGCCTGCCG GGCGGCGTCA CCGTCATTGC ACCTGCCTTC ACCGACGATG CCCTGGCCCC ACTTGCCGAT GCGCTGCACC GCGCAGCAGC TTCCGGCATG GGCATCGACC GGCAGGCGGC AATACCGGAA GCGAGCCGTG TCGTGCCTGG CGATGACGGT TTCATCGAAA TCGCGGTCGT CGGTGCGCAT CTGACCGGCA TGCCGCTCAA TCACGAACTG GCAGGCTCGG GCGGGCGTCT GGTCAAGACC TGCCGCACAT CAGGCGATTA TCGCCTCTTC GTTCTGCCCA ATACCATGCC GCCGAAACCG GGGCTGCTGC GCGAACCCGG CCATAGGGGG CAGGGGCTGG AGGTCGAGGT CTGGGCACTG CCGGCCGATG CTTTCGGCAG GTTCGTCCAG AAGATTCCGG CACCCCTCGG CATCGGTAAG CTGACGCTCG AAGACGGTTC CAGCGTCTCC GGCTTCGTCT GCGAGGCCCA TGGGGTGAAA GGTGCTGAGG AAATCACCGC ACTTGGCGGC TGGCGCAACT ATATCAGCGC CAAGCTGGCG AGCTGA
|
Protein sequence | MLPTILDLSS LRAAYQSGLT PLDAIEEVIA RRAASKDPAI FITPVPDDEL RAAAKVLMAR APEANSLPLW GVPFAVKDNI DAAGLPTTAA CPAYEYRPEA DATVVARLKA AGAIIIGKTN LDQFATGLNG TRSPYGAPRS VFDAAYISGG SSSGSSVTVA SGLAAFALGT DTAGSGRVPA AFNNLVGIKP TPGLVPNTGA VPACKSVDCI TIFAATVGDG VAIRKVAEGF DGADAFSRHA KPANLPVSGL RIGVLTDAER EFFGDKEVEA LYDQAIERAK ALGATIVPFD YAPFREAAAL LYDGPWVAER LAAVETFLAT NAADFDPTVR GIIEGAKGKT AVEAFNGRYR LEELRRKTEA EWEKADVLLL PTAPTTYTVA DMLANPVVLN GRLGRYTNFV NLLDCAAIAV PAGFGKDGLP GGVTVIAPAF TDDALAPLAD ALHRAAASGM GIDRQAAIPE ASRVVPGDDG FIEIAVVGAH LTGMPLNHEL AGSGGRLVKT CRTSGDYRLF VLPNTMPPKP GLLREPGHRG QGLEVEVWAL PADAFGRFVQ KIPAPLGIGK LTLEDGSSVS GFVCEAHGVK GAEEITALGG WRNYISAKLA S
|
| |