Gene Rleg_4643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4643 
Symbol 
ID8007122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp1520 
End bp2536 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content62% 
IMG OID644821580 
Productsugar ABC transporter, substrate-binding protein 
Protein accessionYP_002972840 
Protein GI241113005 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCC GTAAGATGCT TCTGGCATCG GCCGCTATTG CTTGCGCCGC GATGCCCGTT 
TCCGCCTTTG CCGAGACATC GGCCAAGAAA ATCGCCCTTT CCAACAACTA TGCCGGCAAT
TCGTGGCGCC AGGCCATGCT GACGAGTTGG GGCAAGGTGA CGGGCGAAGC CGTGAAGGCC
GGCACCGTTG CCGCAGCCGA CCCTTTCACC ACCGCCGAGA ACCAGGCGAC GGAACAGGCC
GCGCAGATCC AGAACATGAT CCTGCAGGGT TATGACGCCA TCGTGCTGAA CGCCGCCTCG
CCGACGGCGC TGAACGGTGC GGTCAAGGAA GCCTGCGACG CCGGGATCAC CGTCGTGTCC
TTCGATGGTA TCGTCACCGA ACCCTGCGCC TGGCGTATCG CCGTCAACTT CAAGGAAATG
GGCCGCAGTG AGGTCGAGTA CTTATCGAAG AAACTTCCGG ACGGCGGCAA CCTGCTCGAG
ATCCGCGGCC TTGCCGGTGT CTTCGTCGAC GACGAAATCT CGGCGGGCAT CCACGACGGT
GTCAAGCAGT TCCCGCAGTT CAAGATTGTT GGCTCCGTTC ACGGCGACTG GGCGCAGGAC
GTGGCGCAGA AGGCTGTTGC CGGCATCCTG CCGAGCCTGC CCGACATCGC CGGCGTCGTA
ACGCAGGGCG GTGACGGCTA TGGCGCCGCA CAGGCGATTG CCGCAACCGA CCGGAAGATG
CCGATCATTA TCATGGGCAA CCGCGAGGAC GAACTGAAGT GGTGGAAGGA GCAGAAGGAC
GCGAAGAGCT ACGAGACCAT GTCCGTATCC ATCGCGCCAG GCGTCTCCAC ACTCGCTTTC
TGGGTGGCCC AGCAGATCCT CGACGGTAAG GAAGTCAAGA AGGACCTCGT CGTGCCCTTC
CTGCGCATCG ACCAGGACAA TCTCGAAACC AACCTCGCCA ATACCCAGGC CGGCGGCGTC
GCCAACGTGG AATACACGCA GGCAGACGCA ATCAAGGTCA TCGAGTCCGC AAAGTAA
 
Protein sequence
MTIRKMLLAS AAIACAAMPV SAFAETSAKK IALSNNYAGN SWRQAMLTSW GKVTGEAVKA 
GTVAAADPFT TAENQATEQA AQIQNMILQG YDAIVLNAAS PTALNGAVKE ACDAGITVVS
FDGIVTEPCA WRIAVNFKEM GRSEVEYLSK KLPDGGNLLE IRGLAGVFVD DEISAGIHDG
VKQFPQFKIV GSVHGDWAQD VAQKAVAGIL PSLPDIAGVV TQGGDGYGAA QAIAATDRKM
PIIIMGNRED ELKWWKEQKD AKSYETMSVS IAPGVSTLAF WVAQQILDGK EVKKDLVVPF
LRIDQDNLET NLANTQAGGV ANVEYTQADA IKVIESAK