Gene Rleg_1977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1977 
Symbol 
ID8013014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1977704 
End bp1978666 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content64% 
IMG OID644824564 
Producturea amidolyase related protein 
Protein accessionYP_002975796 
Protein GI241204700 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0301643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAG CCGTCCTTGC AATCAACTTC GCGGGTCCTC ACGTCGCGGT CCAGGACGGA 
GGACGCCACG GATTGATGCG TTACGGCGTT CCGGCCTCTG GTCCGATGGA TAGGATCTCG
TTTGCCGCCG CTAACGTCGC CGTCGGCAAT CCTGCCGGTC AGCCCGCAAT CGAAGTCTCC
ATGGGCGGCC TGGTTCTCGA CTGCCTGTCG GGAGACGTTA CGTTTGCCGT CGCCGGAGGA
GGCTTCATCG TAGAGCATGC CGGCGACAAG CGCGGCGCAT GGATGGTCGC CACGCTGAGG
GCCGGGGAGC GGCTGGCAAT CCGTCCGGGA CACTGGGGAA GCTGGACATA TCTGGCCTTC
GTCGGTCATA TCGAGGCGAA GACCTGGCTA GGCAGCATGT CGACGCACAG TCTCTCCGGT
CTCGGCGGTG GGCGCCTTAC GGCCGGTCAG ATGGTCACCG TCGCCGATCC GGAAGTGCGG
GATGATCGAC ATGGTCCGAT CACATGTCCA GTCATCGCAA GACCGCGATC CGAGCTGCGT
GTGGTGATCG GTCCACAGGA TCGGTTCTTC TCGAAAGAAA CCCTGTCGAA TTTCCTTTCG
TCGCCTTTCC GCCTGAGTGA CGCCTACGAC CGCATGGGCG TACGCCTACA AGGTCCGTCG
CTTGCGCCAA GCGTTGCGCT GGACATGCCG TCGGAAGCGA TCGTGCGGGG CTCGGTGCAG
GTGGCCGGAG ACGGTGTTCC CACCATTCTG CTGGCCGACC ACCAGACGAC CGGAGGGTAT
CCCAAGATCG CCACGGTGGT GGATTCGGAT CTGGACGCCT TCGTCCAGCT ACGCCCCCGC
GACCATGTCG GCTTCCTGGC CGTGACGCCG CAGCAGGCGA TCGAGCACAT CCGGCTTCGG
GCTGCGACTA TGTCCCGTTA CCTCGCGGCG GTCTGCGACG GACCATGGAA CGTCCGAACA
TAG
 
Protein sequence
MSQAVLAINF AGPHVAVQDG GRHGLMRYGV PASGPMDRIS FAAANVAVGN PAGQPAIEVS 
MGGLVLDCLS GDVTFAVAGG GFIVEHAGDK RGAWMVATLR AGERLAIRPG HWGSWTYLAF
VGHIEAKTWL GSMSTHSLSG LGGGRLTAGQ MVTVADPEVR DDRHGPITCP VIARPRSELR
VVIGPQDRFF SKETLSNFLS SPFRLSDAYD RMGVRLQGPS LAPSVALDMP SEAIVRGSVQ
VAGDGVPTIL LADHQTTGGY PKIATVVDSD LDAFVQLRPR DHVGFLAVTP QQAIEHIRLR
AATMSRYLAA VCDGPWNVRT