Gene Rleg2_5738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5738 
Symbol 
ID6977128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp138056 
End bp139072 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content63% 
IMG OID643393194 
Productputative sugar ABC transporter, substrate-binding protein 
Protein accessionYP_002278012 
Protein GI209546122 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCC GCAAGATGCT TCTGGCATCG GCCGCTATTG CTTGCGCCGC GATGCCCGTT 
TCTGCCTTTG CCGACACGTC GGCCAAGAAA ATCGCTCTTT CCAACAATTA TGCCGGCAAC
TCATGGCGCC AGGCCATGCT GACGAGCTGG GGCAAGGTGA CGGGCGAAGC CGTGAAGGCC
GGCACCGTTG CCGCAGCCGA CCCTTTCACC ACCGCCGAGA ACCAGGCGAC GGAGCAGGCC
GCGCAGATTC AGAACATGAT CCTGCAGGGC TATGACGCCA TCGTGCTGAA CGCCGCCTCG
CCGACGGCAC TGAACGGCGC GGTCAAAGAA GCCTGCGATG CCGGCATCAC CGTGGTGTCC
TTCGACGGCA TCGTGACCGA ACCTTGCGCC TGGCGCATTG CCGTCAACTT CAAGGAAATG
GGCCGCAGCG AAGTTGAGTA CCTGTCGAAG AAACTCCCTG AGGGCGGCAA CCTGCTCGAG
ATCCGCGGTC TTGCCGGTGT CTTCGTCGAT GACGAGATCT CGGCGGGCAT TCACGACGGC
GTCAAGCAGT ACCCGCAGTT CAAGGTCGTC GGCTCCGTTC ACGGCGATTG GGCGCAGGAC
GTGGCGCAGA AGGCGGTTGC CGGCATCCTG CCGAGCCTGC CCGACATCGT CGGCGTGGTG
ACGCAGGGCG GCGACGGTTA TGGCGCCGCG CAGGCGATTG CGGCGACCGA CCGGAAGATG
CCGACCATCA TCATGGGCAA CCGCGAAGAT GAACTGAAGT GGTGGAAGGA GCAGAAGGAC
GGCAAGGGCT ACGAGACCAT GTCCGTGTCG ATCGCGCCCG GCGTCTCAAC ACTCGCCTTC
TGGGTCGCTC AGCAGATCCT CGACGGCAAG GAGGTCAAGA AGGACCTCGT GGTGCCCTTC
CTGCGCATCG ACCAGGACAA TCTCGAAACG AACCTCGCCA ATACCCAGGC CGGCGGCGTC
GCCAACGTGG AATACACGCA GGCAGACGCC ATCAAGGTCA TCGAGTCGGC AAAGTAA
 
Protein sequence
MTIRKMLLAS AAIACAAMPV SAFADTSAKK IALSNNYAGN SWRQAMLTSW GKVTGEAVKA 
GTVAAADPFT TAENQATEQA AQIQNMILQG YDAIVLNAAS PTALNGAVKE ACDAGITVVS
FDGIVTEPCA WRIAVNFKEM GRSEVEYLSK KLPEGGNLLE IRGLAGVFVD DEISAGIHDG
VKQYPQFKVV GSVHGDWAQD VAQKAVAGIL PSLPDIVGVV TQGGDGYGAA QAIAATDRKM
PTIIMGNRED ELKWWKEQKD GKGYETMSVS IAPGVSTLAF WVAQQILDGK EVKKDLVVPF
LRIDQDNLET NLANTQAGGV ANVEYTQADA IKVIESAK