Gene Rleg_4747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4747 
Symbol 
ID8006968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp115236 
End bp116306 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content60% 
IMG OID644821677 
ProductABC transporter related 
Protein accessionYP_002972937 
Protein GI241113102 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.155462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGCT TGAGCATCAG AAACGTCAAG AAATCCTTCG GCACGGTCGA CATCATTCAT 
GGTGTCGACG TCGAGATCGC CGACGGTGAA TTTACCATTC TCGTCGGCCC CTCCGGCTGC
GGCAAGTCGA CATTGCTGCG CATGATTGCC GGGCTCGAGG ATATCACCGC CGGCCAGATC
AGCATCGATG GTCGGGTCGT GAACAATCTG CAGCCGAAGG ATCGCGATAT CGCGATGGTC
TTCCAGAACT ATGCATTGTA CCCGCAGATG ACGGTCTCCC AGAACATGGG ATTTGCGCTC
GAGCTCGCCG GGGTCAAGCG GCCGGAAATC GAACAGAAGG TCGGTGAGGC TGCAGCAATT
CTCGGATTGC AGCCGCTTCT CGATCGAAAG CCGGCGCAGC TGTCGGGCGG ACAGCGGCAG
CGCGTCGCCA TGGGCCGCGC CATTGTTCGA GATCCGAAAG TCTTTCTCTT CGACGAGCCG
CTCTCCAATC TGGATGCGAA ACTGCGGGTG AAGATGAGGG CGGAGATCAA GGCTCTGCAC
CAGCGCCTGA AGACGACGAT CGTTTACGTC ACCCATGACC AGATCGAGGC CATGACCATG
GCTGACAAGA TCGTCGTGCT CCACGGCGGT CGGGTCGAAC AGATCGGCAG CCCGCTCGAA
CTCTACGACC GACCGCGCAA TATCTTTGTC GCCGGCTTCC TCGGTTCCCC CGCGATGAAT
TTTCTCGAGG GAACTCTTGA GGGCGCAGGC AACCCGGTAT TGTCGCTGCC GGGTGGGTCA
CGCGTAACGC TTTCGCGGGC TCCAGCCAAT GCCGCCAACA GACCGCTGAC GCTGGGCATT
CGCCCCGAAG ACATCACCTT CGGTGGCGAA AACGGTGTGG ATGCCGTGGT CAAGGTGGTC
GAACCCACGG GATCGGAAAC CCATGTCGCC GTGGAGCTCG AAGGCAGGGA ACTGACATGG
GTCGTTCGCG AACGTGTCGA GCTGGTGCCG GAACAGCCGG TGAAGCTTTC TTTCGAAACG
GCCAAGGTTC ACTTCTTCGA CCGGCAGACG CAGCAGCGCC TGAACGCCTG A
 
Protein sequence
MSGLSIRNVK KSFGTVDIIH GVDVEIADGE FTILVGPSGC GKSTLLRMIA GLEDITAGQI 
SIDGRVVNNL QPKDRDIAMV FQNYALYPQM TVSQNMGFAL ELAGVKRPEI EQKVGEAAAI
LGLQPLLDRK PAQLSGGQRQ RVAMGRAIVR DPKVFLFDEP LSNLDAKLRV KMRAEIKALH
QRLKTTIVYV THDQIEAMTM ADKIVVLHGG RVEQIGSPLE LYDRPRNIFV AGFLGSPAMN
FLEGTLEGAG NPVLSLPGGS RVTLSRAPAN AANRPLTLGI RPEDITFGGE NGVDAVVKVV
EPTGSETHVA VELEGRELTW VVRERVELVP EQPVKLSFET AKVHFFDRQT QQRLNA