Gene Rleg_6755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6755 
Symbol 
ID8022685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp191774 
End bp192799 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content61% 
IMG OID644833622 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_002984756 
Protein GI241666672 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACAC AACGGCTCAC CCGGCTCATC GCAGCTGCGG TCATGGCAGG CAGCTTCGCG 
ATCGGAAGCA TTGCTCCGGC ATTCGCAGAT CAGACGCTTC TTAACGTTTC CTACGATCCG
ACCCGCGAAT TGTATAAGGA TTTCAATGCC GCCTTTGCCG CCAAGTGGCA AAAGGACAAT
GGTGAAACCC TGACGATCCA GGCTTCGCAT GGCGGTTCCG GCGCCCAGGC CCGCTCGGTC
ATCGACGGTC TCGACGCCGA TGTCGTGACA CTTGCCCTCG AAGGCGATAT CGACGCCATC
GCCAAGGCGA CCGGCAAGAT CCCGGCCGAC TGGAAGACCA AGTTCCCCAA CAATTCGACG
CCTTATACGT CGACGATCGT CTTCCTCGTG CGCAAGGGCA ACCCGAAAGG CATCAAGGAT
TGGGGAGACC TGGTCAAGGA CGACGTGCAG GTGATCACCC CGAACCCGAA GACATCGGGC
GGCGCGCGCT GGAACTTCCT TGCCGCATGG GCATGGGCCA AGCAGTCAAA TGGCGGCGAC
GAAGCCAAGG CGCAGGAATA CGTTGCGAAA CTCCTGCAGC ACGTCCCGGT TCTCGACACC
GGCGCTCGCG GCGCCACGAC CACCTTTGTC CAGCGCGGCC TCGGCGATGT GCTGCTCGCC
TGGGAAAACG AAGCCTATCT TTCGCTCGAA GAGCTCGGTC CCGACCAGTT CGAGATCGTA
ACACCGACCT TCTCCATCCG CGCCGATCCG CCGGTCGCCG TCGTCGACGG CAATGTCGAC
AAGAAGGGCA CGCGCAAGGT CGCCGAAGCC TATCTCAACT ACCTCTATTC GGACGAAGGC
CAGAAGATCG CCGCCAAGCA CTACTATCGG CCGACCAAGC CGGAAGCCGC CGATCCGGCT
GACATCGCCC GCTTCCCGAA GCTGACGCTG GCGACCATCG ACGACTTCGG CGGCTGGAAG
GACGCACAAC CTAAATTCTT CGGCGACGGC GGCGTATTTG ACCAAATCTA CAAGCCGGCC
CAATAA
 
Protein sequence
MQTQRLTRLI AAAVMAGSFA IGSIAPAFAD QTLLNVSYDP TRELYKDFNA AFAAKWQKDN 
GETLTIQASH GGSGAQARSV IDGLDADVVT LALEGDIDAI AKATGKIPAD WKTKFPNNST
PYTSTIVFLV RKGNPKGIKD WGDLVKDDVQ VITPNPKTSG GARWNFLAAW AWAKQSNGGD
EAKAQEYVAK LLQHVPVLDT GARGATTTFV QRGLGDVLLA WENEAYLSLE ELGPDQFEIV
TPTFSIRADP PVAVVDGNVD KKGTRKVAEA YLNYLYSDEG QKIAAKHYYR PTKPEAADPA
DIARFPKLTL ATIDDFGGWK DAQPKFFGDG GVFDQIYKPA Q