Gene Rleg_6921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6921 
Symbol 
ID8022949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp372781 
End bp374127 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID644833782 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_002984916 
Protein GI241666832 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.772055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.983522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAC TTCACGCAGA CACGGCGCTG ACGCCGCAGG GCTGGCAGAA GGATGTGCGG 
CTGACGCTTG AGGCCGGGCG CATCGCGCGG GTCGAAATCG GCACTTCTCC CGAGCCCGGC
GACGAGTGTC ATGCTCTCCT CGTTCCCGCC ATGGCAAACC TGCATAGCCA CGCCTTCCAG
CGGGCGATGG CCGGCCTTGC CGAAGTGCGC GGCCCGGCCA ATGACAGCTT CTGGAGCTGG
CGCACGGTCA TGTACAAGTT TGCCCTGGCG ATGACGCCTG ATCATGTCGA GGCGGTCGCG
GCCAAGCTTT ATGCGGAAAT GCTGGAGGCC GGTTTTTCCC GCGTCGGCGA GTTCCACTAT
CTCCACCACG ACAGGGATGG GGGCACTTAC GCCAATATCG CCGAGCTTGC CGAACGCATT
GGCGCGGCAA GTCAAGAGAC CGGTATCGGC CTGACACTGC TGCCGGTCTT CTATGCCCAT
TCCGGCTTCG GCGGCGCTGC CCCGATCGAC GGCCAGCGGC GTTTCATCAA TTCGCTCGAA
AGCTTCGAAA GGCTGATGGA GGGATGCCGG GCAGTCACCG GCCGGCTCGA CGGCGCCGAG
CTTGGGCTGG CGCCGCACAG CCTGCGGGCG GCGACACCGG AGGAGCTCAC AAGGCTGGTG
CCGATGGCCG GCGACGGTCC CATCCATATC CATGTCGCCG AGCAGGTGAA GGAGGTCGAG
GACTGCATTG CCTGGTCCGG CGCGCGGCCT GTGCAATGGC TGCTCGATCA TGCGCCTATG
GATGAGCGCT GGTGCCTGAT CCATGCGACG CACATGACCG AGGACGAAAC GCGGCGGATG
GCGAAAAGCG GCGCGATCGC CGGCCTCTGC CCGATTACCG AAGCCAATCT CGGCGACGGC
GCTTTCGCCG CGCCGCTCTT TCTCGAAGAG GGCGGGCGTT ACGGCATCGG TTCGGATTCC
AACGTGCTGA TCTCCGTGCC GGAGGAACTG CGCCAGCTCG AATATTCGCA GCGCCTGGCG
CTTCGCGCCC GCAACGTCGT CGCGGCACCC GGCGGATCGA CGGCGCTTTC GCTGTTCACT
CATGCGCTTG CCGGCGGCGG TGCTGCGCTG AAAGCACCGG CGGGTCTTGC TGAGGGCCAT
CACGCCGATA TCGTTTCGCT CGATACGACG GCTGTTCCTT ATCTCGCAGG CGACCAGATC
CTCGATCACT GGCTGTTTGC AGGCGGCATT TCCGTCGATT GCGTCTGGGC GCATGGCCGC
AAGCAGGTGG AGGGCGGTCG CCATCTCAAG CGTGATGCCA TCGACCGGCG CTTCCTCGCT
GCGATGGGTG AACTGCTGGC CGACTGA
 
Protein sequence
MTTLHADTAL TPQGWQKDVR LTLEAGRIAR VEIGTSPEPG DECHALLVPA MANLHSHAFQ 
RAMAGLAEVR GPANDSFWSW RTVMYKFALA MTPDHVEAVA AKLYAEMLEA GFSRVGEFHY
LHHDRDGGTY ANIAELAERI GAASQETGIG LTLLPVFYAH SGFGGAAPID GQRRFINSLE
SFERLMEGCR AVTGRLDGAE LGLAPHSLRA ATPEELTRLV PMAGDGPIHI HVAEQVKEVE
DCIAWSGARP VQWLLDHAPM DERWCLIHAT HMTEDETRRM AKSGAIAGLC PITEANLGDG
AFAAPLFLEE GGRYGIGSDS NVLISVPEEL RQLEYSQRLA LRARNVVAAP GGSTALSLFT
HALAGGGAAL KAPAGLAEGH HADIVSLDTT AVPYLAGDQI LDHWLFAGGI SVDCVWAHGR
KQVEGGRHLK RDAIDRRFLA AMGELLAD