Gene Rleg2_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2034 
Symbol 
ID6980773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2096195 
End bp2097175 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content61% 
IMG OID643396756 
ProductMonosaccharide-transporting ATPase 
Protein accessionYP_002281544 
Protein GI209549627 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0584323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCGC TCGACATCAA TGAACACAGG CTTTCGTCCG GAGCCTGGCT GAGCAAGCTC 
AAGGGAGCAA CCGGCCCGCT CGTCGGACTG CTCGCGCTGT GCGTCTTTCT GAGCCTGAGC
ACCGACACGT TTCTTTCGGT TCGAAACGGC CTCAACATCC TCGATCAGAT CACCGTTCTC
GGCATCATGG CGGTTGGAAT GACCTTCGTC ATCCTAATCG GCGGCATCGA TCTCTCGGTC
GGCTCGGCGC TTGCCCTGGC GATGATGGTC ATGGGCTGGA CCGCCAATGT CGCCGGCCTG
CCGCTGCCGG TCGCGATCGC TTTTGCTCTG GTCGCATCGG GAGTTTCGGG CCTGATCGTC
GGACTTCTGG TGACGCAGTT CAGGGTCCCG GCCTTTATTG CCACTCTTGC GATGATGTCC
GCCGCTCGCG GGGTCGCCAA CATGATCACC GACGGTCAGC AGATCGTCGG ATTCCCGGAC
TGGTTCATGA TGCTGGCAAT CGATCGTCAT TTCGGCGTGT TGACCGCCAC CGTGTTTCTC
ATGCTTGCGG TGGTTCTTGC GGCATGGCTT TTCCTGCACT TCCGCTCCGA AGGGCGCATG
CTCTATGCGG TCGGCGGAAA TCCGGAAGTC GCGCGCCTTG CGGGTATCAA CGTCCCGCTC
GTGACGATTG GCGTCTACGT CGTAAGTTCA GTCCTTGCCG GCCTCGCAGG CATCGTACTC
GCCGCCAGGC TGGATTCCGT CCAACCATCA AGCGGTCTGG GCTATGAGCT GGACACCATC
GCCGCGGTCG TCATCGGCGG CACGTCGCTC TCCGGCGGCG CCGGCGGGAT AGGAGGAACA
TTGATCGGTG TTCTTATCAT CGGCGTCCTT CGCAACGGGC TCAATCTTCT CAACGTCTCG
CCGTTCCTGC AGCAGGTGAT CATCGGCATC GTCATCGTGC TCGCGGTCGG CGCGGAGACT
ATTCGTCGGC GTCGCGCTTG A
 
Protein sequence
MVALDINEHR LSSGAWLSKL KGATGPLVGL LALCVFLSLS TDTFLSVRNG LNILDQITVL 
GIMAVGMTFV ILIGGIDLSV GSALALAMMV MGWTANVAGL PLPVAIAFAL VASGVSGLIV
GLLVTQFRVP AFIATLAMMS AARGVANMIT DGQQIVGFPD WFMMLAIDRH FGVLTATVFL
MLAVVLAAWL FLHFRSEGRM LYAVGGNPEV ARLAGINVPL VTIGVYVVSS VLAGLAGIVL
AARLDSVQPS SGLGYELDTI AAVVIGGTSL SGGAGGIGGT LIGVLIIGVL RNGLNLLNVS
PFLQQVIIGI VIVLAVGAET IRRRRA