Gene Rleg2_4666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4666 
Symbol 
ID6977760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp303357 
End bp304427 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content63% 
IMG OID643393840 
ProductABC transporter related 
Protein accessionYP_002278658 
Protein GI209546740 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.47492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGCT TGAGCATCAG AAACGTCAAG AAATCCTTCG GCACGGTCGA TATCATTCAT 
GGCGTCGACG TCGAGATCGC CGATGGTGAA TTCACCATCC TGGTCGGCCC CTCCGGCTGC
GGCAAGTCGA CCTTGCTGCG CATGATCGCC GGACTTGAGG ATATCACCGG CGGCCAGATC
AGCATCGACG GCCGGGTGGT GAACAATCTG CAGCCGAAGG ATCGCGATAT CGCGATGGTC
TTCCAGAACT ACGCGCTGTA CCCGCAGATG ACCGTCTCCC AGAACATGGG CTTCGCGTTG
GAGCTTGCCG GCGCCAAGCG GCCGGAGATC GAAAAGAAAG TCGGCGAGGC CGCCGCCATT
CTCGGCCTGC AGCCGCTTCT CCACCGCAAG CCGGCCCAGC TTTCCGGCGG GCAGCGCCAG
CGCGTCGCCA TGGGCCGCGC CATCGTTCGC GATCCCAAAG TCTTCCTCTT CGACGAGCCG
CTTTCCAATC TCGATGCGAA ACTGCGGGTG AAGATGCGGG CGGAGATCAA GGCGCTGCAT
CAGCGGCTGA AGACGACCAT CGTCTACGTC ACCCATGACC AGATCGAAGC CATGACCATG
GCCGACAAGA TCGTCGTGCT GCATGGCGGC CGTGTCGAGC AGATCGGCAG TCCGCTCGAA
CTCTACGACA GGCCGCGCAA CATTTTCGTC GCCGGTTTCC TCGGCTCCCC CGCCATGAAC
TTTCTCGAGG GGACGATCGA TGAGGCGGGA AAGCCGGCAT TGGCGCTTTC CAGCGGGTCG
CGCGTGGCAC TCTCGCGGGC GCCGGCCAAT TCCGCCAACC GGCCGCTGAC CCTCGGCATC
CGCCCCGAAG ACATCGCCTT CGGCGGCGAG AACGGGGTCG ATGCCGTGGT CAAGGTGGTC
GAGCCGACGG GATCGGAAAC CCATGTCGCC GTGGAGGTGG ATGGCCGGGA GCTCACATGG
GTGGTGCGCG AACGTGTCGA GCTCGCCCCG GAACAGCCGG TAAAGCTTTC CTTCGAGACC
TCCAAGGTTC ATTTTTTCGA CCGGCAGACG CAGCAGCGTT TGAACGCCTG A
 
Protein sequence
MSGLSIRNVK KSFGTVDIIH GVDVEIADGE FTILVGPSGC GKSTLLRMIA GLEDITGGQI 
SIDGRVVNNL QPKDRDIAMV FQNYALYPQM TVSQNMGFAL ELAGAKRPEI EKKVGEAAAI
LGLQPLLHRK PAQLSGGQRQ RVAMGRAIVR DPKVFLFDEP LSNLDAKLRV KMRAEIKALH
QRLKTTIVYV THDQIEAMTM ADKIVVLHGG RVEQIGSPLE LYDRPRNIFV AGFLGSPAMN
FLEGTIDEAG KPALALSSGS RVALSRAPAN SANRPLTLGI RPEDIAFGGE NGVDAVVKVV
EPTGSETHVA VEVDGRELTW VVRERVELAP EQPVKLSFET SKVHFFDRQT QQRLNA