Gene Rleg_3393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3393 
Symbol 
ID8014270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3409991 
End bp3411229 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID644825951 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002977178 
Protein GI241206082 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.205631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAT CCCTTAACAA GACGCTTCTC GGTGCGGCAT TGATCGGCGC ATCCCTTGCG 
CCGCATGCTT TCGCCGAAAC GACGCTGAAC GCGCTTTTCA TGGCCCAGGC CGCCTATAGC
GAGGCCGATG TGCGCGCCAT GACCGACGCC TTCGCCAAGG CGAACCCCGA TATCAAGGTC
AATCTCGAAT TCGTTCCCTA TGAAGGCCTG CACGACAAGA CGGTGCTGGC GCAGGGTTCC
GGCGGCGGTT ACGACGTTGT CCTCTTCGAC GTCATCTGGC CGGCAGAATA CGCCAGCAAC
AAGGTGCTGG TCGACGTCTC CTCTCGCGTC ACCGACGAGA TGAAGAAAGG TGTGCTGCCG
GGAGCCTGGA CCACCGTGCA ATATGATAGC AAATATTACG GCATGCCGTG GATCCTCGAC
ACCAAATACC TGTTCTACAA CAAGGAGATC CTCGAAAAGG CCGGCATCAA GACTCCGCCC
AAGACCTGGG ACGAGCTGAC CGAACAGGCA AAGACCATCA AGGACAAGGG CCTGCTCGCC
ACGCCGATCG CCTGGAGCTG GTCGCAGGCC GAAGCCGCGA TCTGCGATTA CACCACGCTC
GTCAGCGCCT ATGGCGGCGA TTTCCTGAAG GACGGCAAGC CGGCCTTCCA GACCGGCGGT
GGCCTCGATG CACTGAAATA CATGGTCTCC AGCTATTCCT CGGGCCTCAC CAATCCGAAC
TCCAAGGAAT TCCTCGAAGA GGACGTCCGT AAGGTCTTCG AAAACGGCGA TGCCGCCTTC
GCGCTGAACT GGACCTACAT GTACAACATG GCCAACGATC CGAAGGACAG CAAGGTCGCA
GGCAAGGTCG GCGTCGTGCC GGCGCCGGGT GTTGCCGGCA AAAGCGAGGC TTCGGCCGTC
AACGGCTCGA TGGGCCTCGG CATCACCTCG GCCAGCAAGC ATCCTGATGA GGCCTGGAAA
TACATCACCT TCATGACCTC GCAGGCGACG CAGAATGCCT ATGCCAAGCT CAGCTTGCCG
ATCTGGGCGT CCTCCTATGA GGACCCTGAT GTCACCAAGG GTCAGGAAGA ATTGATCTCC
GCCGCCAAGA TCGGCCTTGC CGCGATGTAT CCGCGTCCGA CGACGCCGAA ATATCAGGAG
CTCTCGACCG CGCTGCAACA GGCGATCCAG GAATCGCTGC TCGGCCAGTC CTCTCCCGAA
GATGCGCTGA AGTCGGCGGC CGACAATAGC GGCCTCTGA
 
Protein sequence
MLKSLNKTLL GAALIGASLA PHAFAETTLN ALFMAQAAYS EADVRAMTDA FAKANPDIKV 
NLEFVPYEGL HDKTVLAQGS GGGYDVVLFD VIWPAEYASN KVLVDVSSRV TDEMKKGVLP
GAWTTVQYDS KYYGMPWILD TKYLFYNKEI LEKAGIKTPP KTWDELTEQA KTIKDKGLLA
TPIAWSWSQA EAAICDYTTL VSAYGGDFLK DGKPAFQTGG GLDALKYMVS SYSSGLTNPN
SKEFLEEDVR KVFENGDAAF ALNWTYMYNM ANDPKDSKVA GKVGVVPAPG VAGKSEASAV
NGSMGLGITS ASKHPDEAWK YITFMTSQAT QNAYAKLSLP IWASSYEDPD VTKGQEELIS
AAKIGLAAMY PRPTTPKYQE LSTALQQAIQ ESLLGQSSPE DALKSAADNS GL