Gene Rleg2_4987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4987 
Symbol 
ID6978081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp631492 
End bp632607 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content59% 
IMG OID643394133 
ProductABC transporter related 
Protein accessionYP_002278951 
Protein GI209547033 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGC TCACAATTGA TAAGATTACC AAGAACTTCG GCGGCCTCAC GGTCATCCCG 
GAACTGTCAC TCACGATCAA TGACGGCGAA TTCTGCGTGC TCGTAGGCCC CTCCGGGTGT
GGGAAGTCGA CCCTGCTGCG TATCATCGCT GGCCTGGAGC CCATTTCGTC GGGACGTCTC
CTCATCGATG GCGTCGACAT GAGCGGAGCG GAGCCTCCCG AACGAGGCGT GGCAATGGTC
TTCCAATCCT ACGCCCTATA TCCCCACATG GACGTCGGCC GGAATATCGG CTTCGGCCTT
GAAATTGCCC ATACACCGAA GGCCGACATC GCCGAGCGCG TTGGCAAAGC TGCTGACAAG
CTACGTCTGA CCAACTACCT ACGACGCAAA CCCCGCGAGC TTTCCGGTGG CCAGAGGCAG
CGTGTTGCCA TCGGCCGTGC GATGACGCGA AAGCCCAGGC TTTTCCTCCT TGATGAGCCG
CTCTCTAATC TCGACGCCGC CCTACGCGTA GGTATGCGCG TCGAGATCGC CCGTTTGAAG
GCAGAGCTCG CATCTACGAT GATCTACGTC ACCCACGACC AGGTCGAAGC CATGACTTTG
GCTGACCGTA TCGTGGTTAT GAACGCCGGA CGGGTCGAGC AGGTCGGGTC TCCCCTCGAC
CTGTACGAAG AGCCCAGCAA CCTGTTCGTT GCGGGCTTTA TCGGCTCTCC TGCAATGAAT
TTTCTGCAGG GAAAGATTGC TGCTGTGAGC GGCGATGTAG CGACGGTCGC CCTCGACATC
GGCCCGACCG TTCGGGTTCC TTTGCTGTAT GCAACGACAG TTGGCGACCA GGTCACCCTG
GGCATTCGGC CAGAACATAT TCCGCTCCGT CGAGACCCGC AGTCTGGGCA CACCTTCAAT
GCTTCGATGG TGGAGATGCT TGGTAGTGAT ACGTTCATCC ACGTCCGGCA GGGTGAAGAA
AGCGTGATCA TTCGCGACAG CCAAGGCCAT CGCCGTCGCA CAGGCGAACC GGTGACTATC
GAGCTGCCAA CGGGCGCTTG CTACCTTTTC GACACCAAGG GTCGCCGAAT TTCACAACGA
CTTCAGCCCC GCGCACTCTC GGGCACCGCG GCCTGA
 
Protein sequence
MATLTIDKIT KNFGGLTVIP ELSLTINDGE FCVLVGPSGC GKSTLLRIIA GLEPISSGRL 
LIDGVDMSGA EPPERGVAMV FQSYALYPHM DVGRNIGFGL EIAHTPKADI AERVGKAADK
LRLTNYLRRK PRELSGGQRQ RVAIGRAMTR KPRLFLLDEP LSNLDAALRV GMRVEIARLK
AELASTMIYV THDQVEAMTL ADRIVVMNAG RVEQVGSPLD LYEEPSNLFV AGFIGSPAMN
FLQGKIAAVS GDVATVALDI GPTVRVPLLY ATTVGDQVTL GIRPEHIPLR RDPQSGHTFN
ASMVEMLGSD TFIHVRQGEE SVIIRDSQGH RRRTGEPVTI ELPTGACYLF DTKGRRISQR
LQPRALSGTA A