Gene Rleg_6388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6388 
Symbol 
ID8017002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp99909 
End bp100892 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content62% 
IMG OID644828183 
Productaliphatic sulfonate ABC transporter substrate-binding protein 
Protein accessionYP_002979383 
Protein GI241554170 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.269939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.249456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTCC ACCCCCGCCA CCTTCTCCTG CCGGCAGTGA TTGCTCTCGG CCTAACCTCA 
CCCGCCGCGG CCGCTGATAC GGTGAAGCTG CGCTATCTCG CCAGCCAGGG TGGTCTTGCC
GCTCATGAAC TCGCCGCCGA ACTCGGCTAC TTCGACGGAA CGGGCATCAC GCTCGAAAAT
GTCGGCTACG CCCAGGGCGG CCCGGCCTCC CTGATCGCCC TGGCATCGGG TGATGTCGAG
ATCGGCAGCG CCGCAACTTC CGCCGTCCTG AATTCGATCA TCGGCGGCAA TGATTTCGTC
GCCGCCTATC CCTCAAACGG CATCAATAAC GAGGTGCAGT CGACCTTCTA CGTGCTGGAA
GACAGCCCGA TCAAGAGTAT CAAGGACATT GCCGGCAAGA GCATCGCGGT CAATACGCTC
GGCGCGCATC TCGACTATAC CATCCGCGAA GCCCTGCATT CCGTAGGCCT ACCGGCGGAC
GCAGCCAATC AGCTTGTCGT TCCCGGGCCG CAGCTCGAGC AGGTGCTGCG CTCCAAACAG
GTCGATATTG CCGCCTTCGG CTACTGGCAG ACGACCTTCG AAGGGGCTGC GCTGAAGAAT
GGCGGCCTGC GTCCGATTTT CGACGATACC GACGTGCTTG GAGACATCGC CGGGGGCTTC
GTGGTCCTGC GCCGCGATTT CGCTCGCGAA CATCCTGAAG CTGCAAAGAT TTTCGTCGAG
CAGTCGGCTC GCGCTCTCGA TTACGCCCGC GAGCATCCGG AGGAAACCAA GAAGATCCTC
GCCAAGGCGC TCAGCGAGCG TGGTGAGAAC GCGGATATCG CACAGTACTT CCGAGGCTAT
GGGGTGCGTG CCGGCGGCCT GCCGATCGAG CGCGACATCC AGTTCTGGAT CGACGTCCTG
GTTCGCGAAG GCAAGCTGAA GCAAGGCCAG CTGGCCGCCA AGGACATCCT CTTGACCGTC
GACGCCAAGC CGGCAAGCAA CTGA
 
Protein sequence
MTFHPRHLLL PAVIALGLTS PAAAADTVKL RYLASQGGLA AHELAAELGY FDGTGITLEN 
VGYAQGGPAS LIALASGDVE IGSAATSAVL NSIIGGNDFV AAYPSNGINN EVQSTFYVLE
DSPIKSIKDI AGKSIAVNTL GAHLDYTIRE ALHSVGLPAD AANQLVVPGP QLEQVLRSKQ
VDIAAFGYWQ TTFEGAALKN GGLRPIFDDT DVLGDIAGGF VVLRRDFARE HPEAAKIFVE
QSARALDYAR EHPEETKKIL AKALSERGEN ADIAQYFRGY GVRAGGLPIE RDIQFWIDVL
VREGKLKQGQ LAAKDILLTV DAKPASN