Gene Rleg2_4780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4780 
Symbol 
ID6977874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp414152 
End bp415270 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content61% 
IMG OID643393944 
ProductABC sugar transporter, periplasmic ligand binding protein 
Protein accessionYP_002278762 
Protein GI209546844 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.462835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATA GACGAATGTT TCTGTGCGGC GCTGCCGCCG TGTTGACGAT CGGCCTGGTG 
GGACCGGCTT TTGCCGATCC GGACTCGGCC TTGGCAAAAC TGCAGGAAAG CGTCCTGTCG
AAGGGACCGT CGGGCGAAAG CCCGTCACCG GCCTCAGGCA TCAGTTTGAG CGATGAGGAA
CTCGGCAAGA TCAAGGCGAT GAACGCCACG GCGGCCATTG TCATGCATTA TGGCGGCAAC
GACTGGTCGC GGGCGCAGAT CAACGGTCTG CAGACGCAGT TCAAGACGAT GGGCATCAAG
GTGATCGCGG TCACCGATGC CGGTTTCAAG CCGGAAAAGC AGGTGGCCGA CCTCGAAACG
ATCATGGCGC AGAAGCCCAA TGTCATCGTT TCAATCCCGA CCGACCCGGC AGCGACGGCA
AAGGCCTACA AGGCCGCAGC CGATGCCGGT GTCAAGCTGG TGTTCATGGA CAACGTCCCG
GCGGGCTTCA AGGCGGGAAG TGACTACGTT TCCGTCGTCT CCGCAGACAA TTACGGCAAC
GGCGTCGCCT CCGCCCATCT CATGGCAAAA TCGTTGAACG GCGAAGGCGA AATCGGCGTA
GTCTTCCACG CAGCCGACTT CTTCGTCACG AAGCAGCGCT ACGACGCCTT TAAGGCGACG
ATCGCCTCCG ACTATCCGAA GATCAAGATC GTCGCCGAAC AGGGGATCGG CGGCCCGGAC
TTTTCAGGTG ACGCAGAAAA GGCGGCTTCT GCAATTCTGA CCTCCAATCC CAACGTCAAG
GGCATCTGGG CCGTCTGGGA TGTACCGGCA GAAGGCGTGA TCGCCGCGGC GCGCAATGCC
GGCCGTGACG ATCTCGTTAT CACCACGATC GACCTCGGCG AGAATGTCGC GATCTCGATG
GCGCAGGGCA GTTTCGTCAA GGGCCTCGGA GCACAACGCC CGTTCGATGC CGGCGTTGTC
GAAGCGAAAC TCGCAGGCTA TGCCCTCGTC GGCAAGGAAG CACCCGCCTT CGTGGCGCTG
CCAGCCCTAC CAGTCACCCG CGACAACCTG CTCGATGCCT GGAAGACCGT CTACTCCACC
GAGGCGACGG CCAACATCAA GACCAGCCTC GGCCAATAA
 
Protein sequence
MTNRRMFLCG AAAVLTIGLV GPAFADPDSA LAKLQESVLS KGPSGESPSP ASGISLSDEE 
LGKIKAMNAT AAIVMHYGGN DWSRAQINGL QTQFKTMGIK VIAVTDAGFK PEKQVADLET
IMAQKPNVIV SIPTDPAATA KAYKAAADAG VKLVFMDNVP AGFKAGSDYV SVVSADNYGN
GVASAHLMAK SLNGEGEIGV VFHAADFFVT KQRYDAFKAT IASDYPKIKI VAEQGIGGPD
FSGDAEKAAS AILTSNPNVK GIWAVWDVPA EGVIAAARNA GRDDLVITTI DLGENVAISM
AQGSFVKGLG AQRPFDAGVV EAKLAGYALV GKEAPAFVAL PALPVTRDNL LDAWKTVYST
EATANIKTSL GQ