Gene Rleg2_6321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6321 
Symbol 
ID6983394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp271909 
End bp272979 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content57% 
IMG OID643399324 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002284080 
Protein GI209552164 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.374689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT ACCTACTTGC CACCACTATG ATCGTGTCGA TCGCAACCGC AGCGCGGGCC 
GATGTCGTTG TGATGTCATG GGGTGGCGAC TACGGTGCCG GACAGATCGC TGCCTTCAAC
AAGCCGTTTA CGGAGCAGAC CGGGATCAAA TCCAGCATGG TCGATTCCGA TAACCCGACA
GCGCTGATCA AGTCCATGGT TGAAGCCAAA AACGTGACCG TCGACGTCGT TGAAGTCGAG
TATCCCGATG CGATCCGCGG ATGCGACGAG GGCCTGCTCG AACCCGTCGA TCCCGCCATC
CTCCCGGCAG GCTCAGACGG AACCGCTGCC AACGACGACT TCATGAAAGG CGCGGTCACC
GAGTGCGGCG TAGCGACAGT CGTCTATTCA TGGGTCTTTG CCTACGACAA CAAAAAATTC
ACTGACGGTC CGAAGACCGT AGCGGACTTC TTCGACACCA AAAAATTTCC GGGAAAACGC
GCTCTTCGGA AACAGCCGAA ATTCGCACTT GAAATGGCGC TTATCGCCGA CGGAGTTTCC
ACGGCGGATG TCTACAAGGT CCTCAATACC AAAGAAGGCG TTGACCGTGC CTTCGCAAAG
CTCGGCACGG TGAAGGGCGA CTTGATCTGG TACCAGGCGA ATGCGGAGGC AGCACGCTTG
CTGGCAGATG GAGAGGTAGT GATGTCTTCG GGCTCCGCAA ACCGCTTCTT CAACGCCGCA
GTATCCGAAG GGAAGCCCTT CACCACAGTG TGGGACGGGC AAATCTACGA CTTTGCCATG
TTCGTAATTC CCAAGGGAGC TCCGCATCTC GACGAAGCGA AGAAGTACTT GGCCTTCGCA
ACTGACACGA AGCAGCTTGC CGCGATGGCC ACGGAACTTC CTTTCGGTCC AGCAAGAATG
TCTGCGGTCC CGCTCGTGCA TTTCTTCAAA GACGGAAAGA CCGACATTCG CCCGCACATG
CCCACTAACC CCGACAACCT GAAGAACGGT CTCGCCGTGT CTTCGGATTT CTGGGCCGAT
CACGAAGCCG AATTGACGGA GCGCTTCAAC GCGTGGCTCG CCACGAATTG A
 
Protein sequence
MKKYLLATTM IVSIATAARA DVVVMSWGGD YGAGQIAAFN KPFTEQTGIK SSMVDSDNPT 
ALIKSMVEAK NVTVDVVEVE YPDAIRGCDE GLLEPVDPAI LPAGSDGTAA NDDFMKGAVT
ECGVATVVYS WVFAYDNKKF TDGPKTVADF FDTKKFPGKR ALRKQPKFAL EMALIADGVS
TADVYKVLNT KEGVDRAFAK LGTVKGDLIW YQANAEAARL LADGEVVMSS GSANRFFNAA
VSEGKPFTTV WDGQIYDFAM FVIPKGAPHL DEAKKYLAFA TDTKQLAAMA TELPFGPARM
SAVPLVHFFK DGKTDIRPHM PTNPDNLKNG LAVSSDFWAD HEAELTERFN AWLATN