Gene Rleg2_2823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2823 
Symbol 
ID6981567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2871648 
End bp2872604 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content61% 
IMG OID643397535 
Productcholine ABC transporter, periplasmic binding protein 
Protein accessionYP_002282319 
Protein GI209550402 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0554207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CAAGCACGTT CAAACTGGTT ACTGCAACTG CCGTCGCCGC CCTCTCCGTC 
GCGACCGCCG CCTTCGCCGC CGATCCCGAC AGCTGCTCCA CCGTCCACTT CTCGGATGTC
GGCTGGACTG ATATTACCGC CACCACCGCC ACCGCATCCG TCGTCTTGAA GAGCATCGGC
TATCAGACCG ACGTGAAGGT TCTCTCGGTG CCGGTCACCT ACACCTCGCT GAAGAACAAG
GACATCGACA TCTTCCTCGG CAACTGGATG CCGACGCAGG AAAAGGACGT CCGCCCTTAT
CTCGATGACA AGTCGGTCGA ATCCTTCGGC CCCAACCTCG TCGGCGCCAA GTACACACTC
GCCACCAACG CCAAGGGCGC CGAACTCGGC ATCAAGGACT TCAAGGACAT CGCCGCCCAC
AAGGACGATC TCGACGGCAA GATCTACGGC ATTGAGCCCG GCAATGACGG CAACCGCCTG
GTCATGGACC TGATCGAAAA GAACACCTTC GGCATGAAGG AGATGGAAGT CGTCGAATCC
TCCGAACAGG GCATGCTCGC TCAGGTCGCC CGTGCCGAGA AAGCAGGCAA GCCCGTCGTC
TTCCTCGGCT GGGAGCCCCA TCCGATGAAC ACCAACTTCA AGCTGACCTA CCTCACCGGC
GGTGACGACG TCTTCGGTCC AGACTTCGGC GGTGCCAAGG TCTATACCAA TGTCCGCGCC
GGCTATCTCG ACGAATGCCC GAATGTCGGC GCGATGCTGA AGAACCTGAA GTTCTCCCTC
GACATGGAGA ATAAGATCAT GGGCAAGATC CTCGACGACG GCAAGGAGCC GGAAGCTGCC
GCTTCCGAAT GGTTGAAGGC GAACCCTTCT GCGCTCGAGC CCTGGCTCGC AGGGGTCAAG
ACCCGCGACG GCAAGGGCGA CGCGCTGGCG GCCGCCAAGA CCGGTCTCGG CCTTTGA
 
Protein sequence
MKTTSTFKLV TATAVAALSV ATAAFAADPD SCSTVHFSDV GWTDITATTA TASVVLKSIG 
YQTDVKVLSV PVTYTSLKNK DIDIFLGNWM PTQEKDVRPY LDDKSVESFG PNLVGAKYTL
ATNAKGAELG IKDFKDIAAH KDDLDGKIYG IEPGNDGNRL VMDLIEKNTF GMKEMEVVES
SEQGMLAQVA RAEKAGKPVV FLGWEPHPMN TNFKLTYLTG GDDVFGPDFG GAKVYTNVRA
GYLDECPNVG AMLKNLKFSL DMENKIMGKI LDDGKEPEAA ASEWLKANPS ALEPWLAGVK
TRDGKGDALA AAKTGLGL