Gene Rleg_5977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5977 
Symbol 
ID8016335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp6387 
End bp8111 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content58% 
IMG OID644827289 
Productputative sugar ABC transporter, substrate-binding protein 
Protein accessionYP_002978489 
Protein GI241258605 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGGC ATTTATTGAC GACGACAGCA GCCGTGCTGC TGGCCATGAC CGGGTCGGCC 
TATGCCGGCA TGGACGAGGC AAAAACTTTT CTGGATAAAG AGATCGGCGA CGTGTCGACG
CTCTCTCGCG CCGACCAGGA AAAGGAAATG CAGTGGTTCA TCGATGCTGC GAAGCCTTTC
CAGGGGATGG ACATCAAGGT TGTTTCGGAA ACCTTGACCA CCCACAAGTA TGAGTCCGAG
GTTCTGGCTC CGGCCTTTAC TGCGATTACC GGGATAAAGG TCACGCACGA CGTCATTCAA
GAGGGTGACG TTGTCGAGAA GATCCAGACG CAGATGCAGA CCGGGCAGAA CCTTTATGAC
GGCTGGGTCA ACGACTCCGA CTTGATCGGC ACCCACTGGC GCTATCAGCA GGTGCGCAAC
CTGACCGACT GGATGGCCGG CGAAGGCAAG GACGTTACCA ACCCCGGCCT CGATATCGAC
GACTTCATCG GCACCAAGTT CACGACAGCG CCGGACAAGA AGCTCTACCA GCTTCCCGAC
CAGCAGTTCG CCAACCTCTA CTGGTTCCGT TACGACTGGT TCAACGACGA GAAGAACAAG
GCCGATTTCA AGGCGAAATA CGGCTACGAT CTCGGCGTTC CGGTCAACTG GTCGGCCTAC
GAGGACATTG CCGAATTCTT TACCGGCCGT GACGTCAACG GCAAGAAGGT CTTCGGCCAC
ATGGATTACG GCAAGAAGGA CCCGTCGCTC GGCTGGCGCT TCACCGACGC TTGGCTGTCG
ATGGCTGGCA ATGGTGACAA GGGTCTGCCG AACGGTCTTC CGGTCGATGA ATGGGGTATC
AAAGTCGACG AGAATTCGCG TCCCGTCGGT TCGTGCGTCG CGCGTGGCGG CGATACCAAC
GGCCCTGCAT CGGTCTATTC GATCCAGAAG TATCTCGATT GGTTGAAGGC GTACGCACCG
CCGGAAGCTC AGGGCATGAC CTTCTCGGAA TCCGGTCCGG TGCCGGCGCA GGGTAACGTC
GCGCAGCAGA TCTTCTGGTA CACGGCGTTC ACCGCAGACA TGGTCAAGCC TGGCCTGCCT
GTCATGAACG AAGACGGTAC GCCGAAATGG CGCATGGCAC CGAGCCCGCA TGGCGTTTAC
TGGAAAGACG GCATGAAGCT TGGCTATCAG GACGTCGGTT CCTGGACGCT GATGAAATCG
ACCCCGACGG ACCGCGCAAA AGCCGCGTGG CTCTATGCAC AGTTCGTGAC CTCGAAGACG
ATTGACGTGA AGAAGAGCCA GCTCGGTCTC ACCTTCATCC GCGAATCCAC CATTCGCGAC
AAGAGCTTTA CGGAGCGCGC TCCGAAGCTC GGCGGTCTGA TCGAGTTCTA TCGTTCGCCT
GCGCGTGTTC AGTGGTCGCC AACCGGCACC AACGTCCCCG ACTATCCGAA GCTGGCTCAG
CTTTGGTGGC AGGCTATCGG TGACGCCGCT GCCGGCGCCA AGACGCCGCA GGAAGCCATG
GATTCTCTTT GCGGCGAGCA GGAGAAGGTC ATGGGACGTA TCGAGAAATC GGGCGTCCAG
GGCGATATCG GCCCGAAACT CGCTGAAGAG CAAGACCTCG CTTACTGGAA CGCGGATGCG
GTGAAGAAGG GCAACCTTGC ACCGCAGCTG AAGATCGAGA ACGAGAAGGA AAAGCCGGTC
ACGGTCAACT ACGACGAACT CGTCAAGAGC TGGCAGTCGA AGTAA
 
Protein sequence
MRRHLLTTTA AVLLAMTGSA YAGMDEAKTF LDKEIGDVST LSRADQEKEM QWFIDAAKPF 
QGMDIKVVSE TLTTHKYESE VLAPAFTAIT GIKVTHDVIQ EGDVVEKIQT QMQTGQNLYD
GWVNDSDLIG THWRYQQVRN LTDWMAGEGK DVTNPGLDID DFIGTKFTTA PDKKLYQLPD
QQFANLYWFR YDWFNDEKNK ADFKAKYGYD LGVPVNWSAY EDIAEFFTGR DVNGKKVFGH
MDYGKKDPSL GWRFTDAWLS MAGNGDKGLP NGLPVDEWGI KVDENSRPVG SCVARGGDTN
GPASVYSIQK YLDWLKAYAP PEAQGMTFSE SGPVPAQGNV AQQIFWYTAF TADMVKPGLP
VMNEDGTPKW RMAPSPHGVY WKDGMKLGYQ DVGSWTLMKS TPTDRAKAAW LYAQFVTSKT
IDVKKSQLGL TFIRESTIRD KSFTERAPKL GGLIEFYRSP ARVQWSPTGT NVPDYPKLAQ
LWWQAIGDAA AGAKTPQEAM DSLCGEQEKV MGRIEKSGVQ GDIGPKLAEE QDLAYWNADA
VKKGNLAPQL KIENEKEKPV TVNYDELVKS WQSK