Gene Rleg2_6392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6392 
Symbol 
ID6983464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp38256 
End bp39896 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content62% 
IMG OID643399390 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002284146 
Protein GI209552231 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCG AAATATACGA TGCATTGATG CGCCGTCAGG CAAGCCGCCG CGACGTCTTG 
CGCGGAACGG CAAGTGCGGC GGCTCTGCTC GGCCTGTCCG GCGCGATGGG CGGAATGCCG
GGCATGGCAT TCGCCGCCGA TGATCTGCGC GCCCAGATCC TGCAGATTCC GGGCGTCGGC
AAGGGCTCGC CGACGGACGC GGACTGGCAG AAAGTCGGCG AGCTTTGCCT CGGTCCGACC
AAGGCCAACG TCAAGCAGGG CGAATTCGCC GATGTCGAGC TGACCTTCAT GGGGCTCAAC
AACCAGAACC TGCACAACTT CCTGTTCCGC GGCTTCCTGA AACCGTGGGA AGCCTATACC
GGCGCCAAGA TCAACTGGAT CGACCTTGCG CAAGCCGACT ACAATGCCCG CCTTCAGCAG
TCGATCGCGA CCGGCACGGT CGATTTCGAC ATTCTGGAAA TGGGCGCACC CTTCGAAGGC
GATACCGCCG GACGAGGGCT TCTGGACGAG ATGCCTGACT GGGTTGCAAA GCAGATCGAG
GCTGACGACC TGGTGAGTTA CCTGAAGCCA CCTGTGGGAA CCTGGGACGG CAAGACCTAC
CGCGTGACGA TCGACGGCGA CTGCCACACC TTCGCCTATC GCAAGGATTA CTTCGGCGAA
GGCTCGATCA GCGGCATGGC CGAGCCGCCG AAGACATGGC AGGAAGTGAA CGCGGCCTCC
AAGGCGCTGA TCGGCAAGAC CGATCCGCTG ACCAGTCAGC CAGCCTACGG CTATCTCGAC
CCGCTCAAGG GCTGGGGCGG TTTCGGCTTC TATTTCATCG AGAACCGCGC GACGGCCTAT
GCCAAATATC CGGGTGATCC GGCCTGGCTG TTCGATCCTG AAAACATGAA GCCGCTTGTC
AATAACCCCG CATGGGTCCA GGCGATCCAG GACGTCCTCG ATCTGATCGC GGCCAAGGCC
TATCCGGCCG ATCAGATCAA TGCCGATCCC GGCACCACCG CCTTCTCGCA GTTCCTGGCG
GGCACCGGCG CAATGTTGAT GTGGTGGGGC GACGTCGGCT CCAGCGCGCG CACCTCGGAC
ACCTCGGTCG TCGGCGATGT GGTCGGTTTC GGCATCAACC GCGGCTCCAA CCGGGTCTAC
AACCGCAAGA CCGGGCAATG GGAAGACAAG TACAATGAAG CGCCCAACAT GGCCTATCTC
GGCTGGGGCA TCTATGTCAC CAAGCAGGTC TCCGGCGACG AGAAGAAGCG CAAGGCGGCC
TGGTCCGCCG CCGCCCATCT CGGCGGCAAG GATCTGTCAC TGTGGACGTC GGCCTATCCC
TCAGGTTTCC AGCCCTACCG CCAGTCCAAC TTCAACTACG ACGAATGGGA AAAGGCCGGT
TACGACCGCG CCTATATCGA GGATTATCTG GGTTCGAACG CGGACAGCTA TAACCACCCG
AACGCCGCCA TCGAACCGCG CATTCCCGGT ATCTTCCAAT ATTACTCGGT CGCCGAGGAC
GAATTGGCAA AGGGTTTTGC CGGGCAATAC AAGTCGGCGC AGGAGACCGC GGATGCGATT
GCGGCCGCCT GGGAAAAGAT CACCGACCAG ATCGGCCGCG ACAGCCAGCT GAAACTCTAT
CGGGCGAGCC TCGGGATCTG A
 
Protein sequence
MKVEIYDALM RRQASRRDVL RGTASAAALL GLSGAMGGMP GMAFAADDLR AQILQIPGVG 
KGSPTDADWQ KVGELCLGPT KANVKQGEFA DVELTFMGLN NQNLHNFLFR GFLKPWEAYT
GAKINWIDLA QADYNARLQQ SIATGTVDFD ILEMGAPFEG DTAGRGLLDE MPDWVAKQIE
ADDLVSYLKP PVGTWDGKTY RVTIDGDCHT FAYRKDYFGE GSISGMAEPP KTWQEVNAAS
KALIGKTDPL TSQPAYGYLD PLKGWGGFGF YFIENRATAY AKYPGDPAWL FDPENMKPLV
NNPAWVQAIQ DVLDLIAAKA YPADQINADP GTTAFSQFLA GTGAMLMWWG DVGSSARTSD
TSVVGDVVGF GINRGSNRVY NRKTGQWEDK YNEAPNMAYL GWGIYVTKQV SGDEKKRKAA
WSAAAHLGGK DLSLWTSAYP SGFQPYRQSN FNYDEWEKAG YDRAYIEDYL GSNADSYNHP
NAAIEPRIPG IFQYYSVAED ELAKGFAGQY KSAQETADAI AAAWEKITDQ IGRDSQLKLY
RASLGI