Gene Rleg2_5009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5009 
Symbol 
ID6978103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp656957 
End bp657979 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content54% 
IMG OID643394155 
ProductNitrilase/cyanide hydratase and apolipoprotein N-acyltransferase 
Protein accessionYP_002278973 
Protein GI209547055 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGCA TTAACTTCAA GGCCGCTGCT GCTCATGTTG CCCCGGTTTA TCTGGAACCC 
GTGGCGAGCG CTGAGAAAGC GTGCTCGGTA ATCGCAGAGG CTGCCCGAAA CGGGGCATCC
CTTGTAGTTT TTTCCGAGAG CTTTCTTCCC GGGTTTCCCG TCTGGGCAGC GCTTTATCCA
CCCATCCAAT CGCATGGACA TTTCAAGCGC TTCCTGAACG CTTCCGTATA TATGGATGGG
CCAGAAATTG ATCGTGTCCG GAAAGCTGCA TCAAAAAGCG GTGTTTTCGT ATCCATCGGG
TTCTCCGAGC GCAATCCAGC GAGTGTCGGA GGTCTGTGGA ACAGCAATGT CTTGATTTCC
GATACGGGCG AAATCCTGAT CCATCATCGA AAGCTTGTGG CAACTTTCTT CGAAAAACTG
GTTTGGGATC CAGGCGATGG CGCGGGTCTG GTCGTGGCAG AGACACGAAT CGGACGTATT
GGAGGCCTGA TCTGCGGCGA AAACACGAAT CCGCTTGCGC GCTATAGCCT GATGACGCAG
TCAGAGCAGG TTCACATAAG TAGTTACCCG CCGATCTGGC CAACTCGTGT TCCGACGGAG
AGCGAGAACT ACGATAACCG AGCGGCCAAC CGGATCCGTG CCTCGTCCCA TTGCTTCGAG
GCCAAGTGCT TCGGCATCGT CGTCGCAGGT CGCCTTGACG AAGCAGCGTG CAAAGCCATT
GCCCTGGATG ACACAGCTAT TTCAGCAATT ATAGATGCCA GTCCGCAGGC CAGCAGTTTT
TTCCTTGGGC CGACCGGGGC GCCAATAGGT GATGAAATGA TTGATGAAGG AATCGGCTAC
GCCATTATCG ATCTTGATGA TTGCGTTGAA CCTAAGCGGT TTCACGACGT CGTTGCTGGT
TACAACCGCT TTGATATATT CGACGTCGTC GTTAACCGGA CACGCCGCCA ACCGATCAGG
TTTCTGCAAG CTCGCTCCGA GGAAGCTCTG GTCGAGCCCG GGGCAATGGC TTTGCAGGAG
TAA
 
Protein sequence
MGSINFKAAA AHVAPVYLEP VASAEKACSV IAEAARNGAS LVVFSESFLP GFPVWAALYP 
PIQSHGHFKR FLNASVYMDG PEIDRVRKAA SKSGVFVSIG FSERNPASVG GLWNSNVLIS
DTGEILIHHR KLVATFFEKL VWDPGDGAGL VVAETRIGRI GGLICGENTN PLARYSLMTQ
SEQVHISSYP PIWPTRVPTE SENYDNRAAN RIRASSHCFE AKCFGIVVAG RLDEAACKAI
ALDDTAISAI IDASPQASSF FLGPTGAPIG DEMIDEGIGY AIIDLDDCVE PKRFHDVVAG
YNRFDIFDVV VNRTRRQPIR FLQARSEEAL VEPGAMALQE