Gene Rleg_3637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3637 
Symbol 
ID8014486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3673711 
End bp3674757 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content63% 
IMG OID644826202 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002977421 
Protein GI241206325 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.276394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGCTGG CGTTCTGGCC GGGCTTCGCG CTCGCCGATC AGGCCTTCTA TCCGGCGAAG 
TCGGGCAATG CCGATGCGCC GGTGCTGACG GTTTATTCCT CGCTCGACGA GCCCCTGGCG
CAGCCGATGA TCCGGGGTTT CCTGGACGCC AATCCCGATA TCGCAGTCAA ATATGAGGAC
ATGCTGACCG GCGACATCTA CGACCGGATC GTCAGGGAGA CGGATGCCGG CAAGAAGACG
GCGGATTTCG CCTTCTCCTC GGCGATGGAC CTGCAGGTGA AGCTTTCCAA TGACGGATAT
GCTCAGGTCA GCAACCTGCC GATGAGCGGT GCATGGCCGA AATGGGCGAA CTGGCGCAAC
ACCGCCTATG CGCTCACCTT CGAGCCGGCG GTGTTCGTCT ATCACAAGCC GAGCTTTGCG
CATGAGCCGG TGCCGAGCTC GCGGGCTGAA TTCGTCGATT ATCTGAAACG CAAGGGCAAC
GACGTCTATG GGCGGATTGG CACCTACGAT ATCGAGCGCT CGGGCGTCGG CTTTCTTTTC
ATGGCGCGCG ACCAGGAGCA GTTCGGCGAC ATCTGGTCGG TGATCGGGGC GATGGGGGCT
GCCGGCGTCA AGCTTTATTC GACGAGTTCG GCGATCCTCG AACGCGTTGC CGACGGGCGC
TTCGTGCTCG GCTACAATAT TCTCGGCTCC TATGCGGCCG ACTGGGCGTC GCGCTATCCC
GATGTCGGCA TCGTGCTGCC GAAGGATTAT ACCGTGGTGA TGTCGCGGAT CGGGCTGGTG
CCGCAGGCCG CCGCCGATCC GGAACTCGGT CGGCGTTACC TTACCTTCTT CATGTCGAGG
GAAGGGCAGA CGATCCTGGC GCGCGAGCTG CAGATCCCGG CGGTCAGCCC CGAGGTGGCA
GGCGAGAATA CCGCCAATAC GCTGCAGGAA CTGCTCGGCG CCCAACTGCG GCCGGTGCCG
GTCAGCCCCG GATTGATGGT CTATCTCGAC CAGGTGAAGC GGGCGCGGCT GATCGCGCAT
TGGAACGAGG TTCTGCGGAT GCAGTGA
 
Protein sequence
MLLAFWPGFA LADQAFYPAK SGNADAPVLT VYSSLDEPLA QPMIRGFLDA NPDIAVKYED 
MLTGDIYDRI VRETDAGKKT ADFAFSSAMD LQVKLSNDGY AQVSNLPMSG AWPKWANWRN
TAYALTFEPA VFVYHKPSFA HEPVPSSRAE FVDYLKRKGN DVYGRIGTYD IERSGVGFLF
MARDQEQFGD IWSVIGAMGA AGVKLYSTSS AILERVADGR FVLGYNILGS YAADWASRYP
DVGIVLPKDY TVVMSRIGLV PQAAADPELG RRYLTFFMSR EGQTILAREL QIPAVSPEVA
GENTANTLQE LLGAQLRPVP VSPGLMVYLD QVKRARLIAH WNEVLRMQ