Gene Rleg_3092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3092 
Symbol 
ID8015760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3089784 
End bp3090740 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content61% 
IMG OID644825659 
Productcholine ABC transporter, periplasmic binding protein 
Protein accessionYP_002976887 
Protein GI241205791 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CAAGCACGTT CAAACTCGTT ACCGCAACTG CCGTCGCCGC CCTCTCCGTT 
GCGACCGCCG CCTTCGCCGC CGATCCCGAC AGCTGCTCCA CCGTTCATTT CTCGGACGTC
GGCTGGACGG ATATCACCGC CACCACCGCC ACCGCATCCA TCGTCCTGAA GAGCATCGGC
TATCAGACCG ACGTCAAGGT TCTCTCCGTG CCGGTCACCT ACACCTCGCT GAAGAACAAG
GACATCGACG TCTTCCTCGG CAACTGGATG GAGACGCAGG AGAAGGACGT TCGTCCCTAT
CTCGACGACA AGTCGGTCGA ATCCTTCGGC CCGAACCTGG TCGGCGCCAA GTACACGCTC
GCCACCAACG CCAAGGGTGC GGAACTCGGC ATCAAGGACT TCAAGGATAT CGCCGCCCAC
AAGGACGATC TCGACGGCAA GATCTACGGC ATCGAGCCCG GCAATGACGG CAACCGCCTC
GTCATGGACA TGATCGAGAA GAACGAGTTC GACTTGAAGG ATATGGAAGT CGTCGAATCC
TCCGAACAGG GCATGCTCGC CCAGGTCGCG CGTGCCGACA AGTCAGGCAA GCCCGTTGTT
TTCCTCGGCT GGGAGCCCCA CCCGATGAAC ACCAACTTCA AGCTGACCTA CCTCACAGGC
GGGGACAAAG TGTTCGGCCC CGACTTCGGC GGCGCCAAGG TCTTCACCAA CGTGCGTGCC
GGTTATCTCG ACGAATGCCC GAATGTCGGC ATGATGTTGA AGAACCTGAA ATTCTCCCTC
GACATGGAGA ACCAGATCAT GGGCAAGATC CTCAACGACG GCAAGGAGCC GGAGGCTGCA
GCTTCCGAAT GGCTGAAGGC GAACCCCGCA GCGCTCGAAC CCTGGCTCGC GGGCGTCAAA
ACCCGCGACG GCAGCGGCGA GGCTCTCGCG GCCGCCAAGA CCGGCCTCGG CCTCTGA
 
Protein sequence
MKTTSTFKLV TATAVAALSV ATAAFAADPD SCSTVHFSDV GWTDITATTA TASIVLKSIG 
YQTDVKVLSV PVTYTSLKNK DIDVFLGNWM ETQEKDVRPY LDDKSVESFG PNLVGAKYTL
ATNAKGAELG IKDFKDIAAH KDDLDGKIYG IEPGNDGNRL VMDMIEKNEF DLKDMEVVES
SEQGMLAQVA RADKSGKPVV FLGWEPHPMN TNFKLTYLTG GDKVFGPDFG GAKVFTNVRA
GYLDECPNVG MMLKNLKFSL DMENQIMGKI LNDGKEPEAA ASEWLKANPA ALEPWLAGVK
TRDGSGEALA AAKTGLGL