Gene Rleg_6104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6104 
Symbol 
ID8016061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp140574 
End bp142379 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content66% 
IMG OID644827410 
Productallophanate hydrolase 
Protein accessionYP_002978610 
Protein GI241258726 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02713] allophanate hydrolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0470749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.373295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCGA CCATCCTCGA TCTCTCAAGC CTTCGCGCCG CCTATCAATC CGGCCTGACG 
CCGCTCGACG CCATCGAAGA GGTGATCGCG CGGCGTGCCG CCTCGAAAGA TCCGGCAATC
TTCATCACCC CGGTGCCGGA TGACGAGCTG CGCGCGGCCG CAAAAGTGCT GATGGCACGC
GCGCCCGAGG CAAACAGCCT GCCGCTTTGG GGCGTGCCCT TCGCCGTGAA AGACAATATC
GATGCCGCCG GCCTGCCGAC GACGGCCGCC TGCCCGGCAT ACGAATATCG GCCGGAAGCG
GACGCTACCG TCGTTGCGCG GCTGAAGGCA GCCGGCGCCA TCATCATCGG CAAGACCAAT
CTCGACCAGT TCGCGACCGG CCTCAACGGC ACGCGCTCGC CCTATGGCGC GCCGCGTTCG
GTCTTCGATG CGGCCTATAT CTCCGGCGGA TCGAGTTCCG GATCATCAGT CACGGTGGCC
TCCGGCCTCG CGGCCTTTGC GCTCGGCACC GATACGGCAG GCTCCGGCCG CGTGCCTGCC
GCCTTCAACA ATCTGGTCGG CATCAAGCCG ACGCCGGGCC TTGTGCCGAA TACCGGCGCG
GTTCCGGCCT GCAAGAGCGT CGACTGCATC ACGATTTTTG CGGCGACCGT CGGCGACGGT
GTTGCGATCC GCAAGGTCGC CGAAGGCTTC GATGGCGCCG ATGCTTTCTC GCGTCACGCC
AAGCCGGCGA ACCTGCCGGT ATCGGGCTTG CGCATCGGCG TTCTCACCGA TGCCGAGCGG
GAATTCTTCG GTGACAAGGA GGTGGAGGCG CTCTACGACC AGGCGATCGA GCGGGCCAAA
GCGCTCGGAG CGACCATCGT GCCTTTCGAT TACGCGCCAT TCCGCGAGGC CGCCGCCCTC
CTCTATGACG GGCCGTGGGT CGCCGAGCGT CTGGCGGCGG TCGAGACCTT CCTCGCCACC
AACGCCGCCG ATTTCGACCC GACGGTCAGA GGGATTATCG AAGGCGCCAA GGGCAAGACC
GCGGTCGAGG CCTTTAACGG CCGATACCGG CTGGAGGAGC TGCGCCGCAA GACCGAAGCC
GAATGGGAAA AGGCGGACGT GCTTCTGCTG CCGACCGCAC CGACCACCTA CACGGTCGCC
GACATGCTGG CCAATCCCGT CGTGCTCAAT GGCCGCCTCG GTCGCTACAC CAATTTCGTC
AACCTGCTCG ATTGCGCAGC GATCGCCGTT CCGGCCGGTT TCGGCAAAGA CGGCCTGCCG
GGCGGCGTCA CCGTCATTGC ACCTGCCTTC ACCGACGATG CCCTGGCCCC ACTTGCCGAT
GCGCTGCACC GCGCAGCAGC TTCCGGCATG GGCATCGACC GGCAGGCGGC AATACCGGAA
GCGAGCCGTG TCGTGCCTGG CGATGACGGT TTCATCGAAA TCGCGGTCGT CGGTGCGCAT
CTGACCGGCA TGCCGCTCAA TCACGAACTG GCAGGCTCGG GCGGGCGTCT GGTCAAGACC
TGCCGCACAT CAGGCGATTA TCGCCTCTTC GTTCTGCCCA ATACCATGCC GCCGAAACCG
GGGCTGCTGC GCGAACCCGG CCATAGGGGG CAGGGGCTGG AGGTCGAGGT CTGGGCACTG
CCGGCCGATG CTTTCGGCAG GTTCGTCCAG AAGATTCCGG CACCCCTCGG CATCGGTAAG
CTGACGCTCG AAGACGGTTC CAGCGTCTCC GGCTTCGTCT GCGAGGCCCA TGGGGTGAAA
GGTGCTGAGG AAATCACCGC ACTTGGCGGC TGGCGCAACT ATATCAGCGC CAAGCTGGCG
AGCTGA
 
Protein sequence
MLPTILDLSS LRAAYQSGLT PLDAIEEVIA RRAASKDPAI FITPVPDDEL RAAAKVLMAR 
APEANSLPLW GVPFAVKDNI DAAGLPTTAA CPAYEYRPEA DATVVARLKA AGAIIIGKTN
LDQFATGLNG TRSPYGAPRS VFDAAYISGG SSSGSSVTVA SGLAAFALGT DTAGSGRVPA
AFNNLVGIKP TPGLVPNTGA VPACKSVDCI TIFAATVGDG VAIRKVAEGF DGADAFSRHA
KPANLPVSGL RIGVLTDAER EFFGDKEVEA LYDQAIERAK ALGATIVPFD YAPFREAAAL
LYDGPWVAER LAAVETFLAT NAADFDPTVR GIIEGAKGKT AVEAFNGRYR LEELRRKTEA
EWEKADVLLL PTAPTTYTVA DMLANPVVLN GRLGRYTNFV NLLDCAAIAV PAGFGKDGLP
GGVTVIAPAF TDDALAPLAD ALHRAAASGM GIDRQAAIPE ASRVVPGDDG FIEIAVVGAH
LTGMPLNHEL AGSGGRLVKT CRTSGDYRLF VLPNTMPPKP GLLREPGHRG QGLEVEVWAL
PADAFGRFVQ KIPAPLGIGK LTLEDGSSVS GFVCEAHGVK GAEEITALGG WRNYISAKLA
S