Gene Rleg2_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1761 
Symbol 
ID6980498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1802704 
End bp1803729 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content60% 
IMG OID643396484 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002281274 
Protein GI209549357 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.893641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0475771 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTGA CAGTTCTTTC CACGCTTCTC TTTGCCGGAA CGGCGCTGGC TGCCGGTTCG 
GTCCAGGCCG CCGGCGAGCT CAACCTCATC TGCTCGGCCG ATGTCGTCAT CTGCGAGCAG
ATGCAGGGCG ACTTTGAAAA GGCTCACGAC ATCAAGGTGA ATATGGTGCG CCTGTCATCG
GGCGAGACCT ATGCCAAGAT ACGCGCCGAG GCTCGCAACC CGAAGACCGA CATCTGGTGG
GCCGGCACGG GGGATCCGCA TGTCCAGGCA GCATCGGAAA ACCTGACGCT GGAATACAAG
TCGCCAATGC TCGACCAGTT GCAGGATTGG GCCAAAAAGC AGGCGGAGAG CACCGGTTTC
AAGACGGTCG GCGTCTACGC CGGTGCGCTC GGCTGGGGTT ACAACACCGA GATCTTCAAG
ACCAAGGGTT ACAAGGAACC CCGCTGCTGG GCCGATCTCC TGGCGCCGGA ATTGAAGGGC
GAAATCCAGA TCGCCAATCC GAACTCCTCG GGCACTGCCT ATACGGCACT CGCTTCGCTG
GTGCAGATCA TGGGCGAAGA CCAGGCCATC GACTATCTGA AAAAACTAAA CGCCAACGTC
TCGCAATACA CCAAATCCGG ATCGGCTCCT GTCAAGGCGG CGGCGCGGGG CGAAACGGCC
CTCGGCATCG TCTTCATGCA TGACGCCGTC GCGCAGACCG CCGAAGGTTT TCCGGTCAAG
TCGGTCGCGC CATGCGAGGG CACCGGCTAT GAAATCGGCT CCATGTCGAT CATTAAAGGC
GCCCGCAATC TCGACAATGC CAAGATCTGG TACGACTGGG CGCTGCGACC CGAAGTGCAG
TCGCGCATGA AGGATGCCAA GTCCTTCCAG CTGCCGTCGA ACAAATCGGC GGAGGTGCCG
AAGGAAGCGC CGAAGTTCGA GGACATCAAG CTGATCGACT ACGACTTCAA GACCTATGGC
GATCCGGCAA AGCGCAAGGC CCTGCTCGAG CGCTGGGATC GGGAAGTCGG CGCCATCGCC
AACTGA
 
Protein sequence
MRLTVLSTLL FAGTALAAGS VQAAGELNLI CSADVVICEQ MQGDFEKAHD IKVNMVRLSS 
GETYAKIRAE ARNPKTDIWW AGTGDPHVQA ASENLTLEYK SPMLDQLQDW AKKQAESTGF
KTVGVYAGAL GWGYNTEIFK TKGYKEPRCW ADLLAPELKG EIQIANPNSS GTAYTALASL
VQIMGEDQAI DYLKKLNANV SQYTKSGSAP VKAAARGETA LGIVFMHDAV AQTAEGFPVK
SVAPCEGTGY EIGSMSIIKG ARNLDNAKIW YDWALRPEVQ SRMKDAKSFQ LPSNKSAEVP
KEAPKFEDIK LIDYDFKTYG DPAKRKALLE RWDREVGAIA N