Gene Rleg2_5843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5843 
Symbol 
ID6977232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp255379 
End bp256404 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID643393298 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_002278116 
Protein GI209546226 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.302205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACAC AACGGCTCAC CCGGCTCCTC GCGGCCGCGG TCATGGCAGG CAGCTTCGCG 
ATCGGGAGCA TTGCTCCTGC TTTCGCGGAT CAGACGCTGC TCAACGTTTC CTATGATCCG
ACTCGCGAAT TGTACAAGGA TTTCAACGCC GCCTTTGCCG CGAAGTGGAA AAAGGACAAC
GGCGAAGCCG TAACGATCCA GGCCTCCCAT GGCGGTTCCG GCGCCCAGGC CCGCTCGGTC
ATCGACGGCC TCGACGCCGA TGTCGTGACG CTGGCCCTCG AAGGCGATAT CGACGCCATT
GCCAAAGCCA CCGGCAAGAT CCCGGCCGAC TGGAAGACGA AATTCCCCAA CAATTCGACG
CCTTATACGT CGACGATCGT CTTCCTGGTC CGCAAGGGCA ACCCGAAGGG CATCAAGGAT
TGGGGCGACC TGGTCAAGGA CGACGTGCAG GTGATCACGC CGAACCCGAA GACCTCGGGC
GGCGCCCGCT GGAACTTCCT CGCCGCCTGG GCATGGGCCA AGCAGGCGAA TGGCGGCGAT
GAAGCCAAGG CGCAGGATTA TGTCGCGAAA CTGCTGCAGC ACGTTCCGGT TCTCGATACC
GGCGCGCGCG GCGCCACGAC CACCTTCGTC CAGCGCGGCC TCGGCGACGT GCTGCTTGCC
TGGGAAAACG AGGCCTATCT TTCGCTTGAA GAACTCGGCC CCGACCAGTT CGAGATCGTC
ACCCCGAGCT TCTCCATCCG CGCCGACCCG CCGGTCGCCG TCGTCGACGG CAATGTCGAC
AAGAAGGGCA CGCGCAAGGT CGCCGAAGCC TATCTCAACT ACCTCTATTC GGATGAAGGC
CAGAAGATCG CCGCCAAGCA TTATTACCGG CCGTTCAAGC CTGAAGCCGC CGATCCGGCC
GATATCGCCC GCTTCCCGAA GCTGACGCTC GCGACCATCG ACGACTTCGG CGGCTGGAAA
GAAGCCCAGC CGAAATTCTT CGGCGACGGC GGGGTATTTG ACCAGATCTA TAAGCCGGCC
CAATAA
 
Protein sequence
MQTQRLTRLL AAAVMAGSFA IGSIAPAFAD QTLLNVSYDP TRELYKDFNA AFAAKWKKDN 
GEAVTIQASH GGSGAQARSV IDGLDADVVT LALEGDIDAI AKATGKIPAD WKTKFPNNST
PYTSTIVFLV RKGNPKGIKD WGDLVKDDVQ VITPNPKTSG GARWNFLAAW AWAKQANGGD
EAKAQDYVAK LLQHVPVLDT GARGATTTFV QRGLGDVLLA WENEAYLSLE ELGPDQFEIV
TPSFSIRADP PVAVVDGNVD KKGTRKVAEA YLNYLYSDEG QKIAAKHYYR PFKPEAADPA
DIARFPKLTL ATIDDFGGWK EAQPKFFGDG GVFDQIYKPA Q