Gene Rleg2_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2138 
Symbol 
ID6980877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2200398 
End bp2201393 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content59% 
IMG OID643396859 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_002281647 
Protein GI209549730 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0458223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.188194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGT TTCTGATTCC AGCCGTCTTT GCCGCGGCCC TTTCTATCGT TGCGCCGGCC 
GAAGCTGCAG AGTGCGGCAA GGTGTCGATC GCCGAGATGA AATGGGCCTC GGCCGGTATT
GCGGCAAATT TCGACAAGAT CATCCTGGAA AAGGGCTACG GCTGCTCGGT CACAATCGTC
GACGGCGACA CGCTGCCGAC CTTCGCCTCG ATGAACGAGA AAGGCACTCC GGACATCGCC
TCTGAATATT GGATCAATTC CGTCAGGGCC TTGCTCGATC AGGCCGTCAA CGCCGGACGG
CTGGTGCAGG GAGCCGAGAT CCTGGCCGAC GGTGCGGTCG AGGGCTGGTG GATCCCGAAA
TTCATCGCCG ACGCCAATCC CGACATCCGG TCGGTCGAAG ATGCGCTGAA ACATCCCGAA
CTCTTCCCCG CCGAGGACGA TGCGTCGAAG GGCGCGGTCT ACAATTGTCC CCCCGACTGG
AGCTGCCAGA TATCGACCAC TAATCTGTTC AAGGCGCTTG CCGCGGACAA GAAGGGCTTC
GAACTCGTCG AAACCGGCAG CCCCGAACGG CTCGATGCCT CGATTGCCCG TGCCTTCGAA
AACAAGGTCG GCTGGCTCGG TTATTATTGG GCGCCGACGG CCGTCCTCGG CAAATACGAC
ATGACGCGGC TGAGCTTCGG CGTCGGCCAC AACAAGACCG AGTGGGACCG CTGCACGGCA
GTTGCCGGCT GCATGAGGCC GGAACTCAAT TCCTACCCGG TATCGCGCGC CTTCACCTTG
ATGACCAGGT CTTTTGCCAG CCGCTCAGGA CCTGTCACCA CCTATCTCAA AACCCGCAAA
TGGGACAATC AGACGATCAA TCAGGTTCTC GCCTGGCAAG ACGAAAACCA CGAAAGCAAC
GAGGATGCCG CCATCCATTT CCTTCGCAAT TACGAGGGTC TGTGGATGAA ATGGGTTCCG
GCCGATGTAG CCGAAAAGGT CAAGGCGAGC TTATAA
 
Protein sequence
MKTFLIPAVF AAALSIVAPA EAAECGKVSI AEMKWASAGI AANFDKIILE KGYGCSVTIV 
DGDTLPTFAS MNEKGTPDIA SEYWINSVRA LLDQAVNAGR LVQGAEILAD GAVEGWWIPK
FIADANPDIR SVEDALKHPE LFPAEDDASK GAVYNCPPDW SCQISTTNLF KALAADKKGF
ELVETGSPER LDASIARAFE NKVGWLGYYW APTAVLGKYD MTRLSFGVGH NKTEWDRCTA
VAGCMRPELN SYPVSRAFTL MTRSFASRSG PVTTYLKTRK WDNQTINQVL AWQDENHESN
EDAAIHFLRN YEGLWMKWVP ADVAEKVKAS L