Gene Rleg_6061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6061 
Symbol 
ID8016323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp95850 
End bp96878 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content60% 
IMG OID644827369 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002978569 
Protein GI241258685 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA AGACCCTTTC CCTCATGCTC TTCGCCGGCA CGGCGCTCGG TGCGCTACCG 
GCGCAGGCGG CCGGCGAACT CAACCTCATC TGCTCGGCGG ACGTCGTGAT CTGCGAGCAG
ATGAAGGGCG ATTTCGAGAA GTCGCATAGC GACATCAAGG TGAACATGGT TCGCCTCTCG
TCGGGCGAGA CCTATGCCAA GGTGCGCGCC GAGTCTCGTA ACCCGAAGAC CGACATCTGG
TGGGCTGGCA CAGGCGATCC GCATCTCCAG GCGGCATCGG AAAATTTGAC GCTGGAATAC
AAGTCGTCGA AACTCGACGA ACTCAACGAC TGGGCGAAGA AGCAGGCAGA AAGCTCCGGT
TACAAGACCG TCGGCGTTTA TGCCGGCGCG CTCGGCTGGG GCTACAACAC GGAAATCTTC
AAGACCAAGG GCTACAAGGA GCCAGTCTGC TGGGCCGACC TTTTGGCACC GGAACTGAAG
GGTGAAATTC AAATCGCGAA CCCGAATTCT TCCGGCACCG CTTACACGGC GCTCGCCTCT
CTCGTGCAGA TCATGGGCGA GGACCAGGCT TTCGACTACC TGAAGAAGCT GAACGGCAAC
ATATCGCAAT ATACCAAGTC CGGATCGGCA CCCGTCAAGG CCGCAGCACG CGGCGAGACG
GCGCTCGGCA TCGTCTTCGT GCACGATGCG GTGGCGCAGA CGGCTGAAGG CTTCCCGGTC
AAGTCGATCA CGCCTTGCGA AGGTACCGGC TACGAGATCG GCTCCATGTC GATCATCAAG
GGCGCCCGCA ACCTCGAAAA TGCGAAGGTC TGGTACGACT GGGCGCTGAC GGCGGAAGTC
CAGTCGCGCA TGAAGGATGC CAAGTCTTTC CAGCTGCCTT CCAACAAGAG CGCCGTAATC
CCGAAGGAGG CGCCGCGCTT CGAGGACATC AAGCTGATCG ACTACGACTT CAAGACCTAT
GGCGACCCAG CAAAGCGCAA GGCACTGCTG GAACGCTGGG ATCGGGAAAT CGGCGCCGCC
GCCAACTGA
 
Protein sequence
MKMKTLSLML FAGTALGALP AQAAGELNLI CSADVVICEQ MKGDFEKSHS DIKVNMVRLS 
SGETYAKVRA ESRNPKTDIW WAGTGDPHLQ AASENLTLEY KSSKLDELND WAKKQAESSG
YKTVGVYAGA LGWGYNTEIF KTKGYKEPVC WADLLAPELK GEIQIANPNS SGTAYTALAS
LVQIMGEDQA FDYLKKLNGN ISQYTKSGSA PVKAAARGET ALGIVFVHDA VAQTAEGFPV
KSITPCEGTG YEIGSMSIIK GARNLENAKV WYDWALTAEV QSRMKDAKSF QLPSNKSAVI
PKEAPRFEDI KLIDYDFKTY GDPAKRKALL ERWDREIGAA AN