Gene Rleg2_6464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6464 
Symbol 
ID6983535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp129405 
End bp131129 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content59% 
IMG OID643399461 
Productputative sugar ABC transporter, substrate-binding protein 
Protein accessionYP_002284217 
Protein GI209552302 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.724776 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGGC ATTTATTGAC GACGACAGCA GCGGTACTGC TGGCCATGAC CGGGTCGGCC 
TATGCCGGCA TGGACGAGGC AAAGACTTTC CTGGATAAAG AGGTCGGCGA CATGTCGACG
CTCTCTCGCG CCGACCAGGA AAAGGAAATG CAGTGGTTCG TCGATGCTGC GAAGCCCTTC
GCCGGCATGG AGATCAAGGT CGTTTCGGAA ACCTTGACCA CTCACCAGTA TGAGTCCCAG
GTTCTGGCTC CGGCCTTTAC TGCGATCACC GGCATCAAGG TCACTCACGA CGTCATTCAA
GAGGGTGACG TTGTCGAGAA GATCCAGACT CAGATGCAGA CCGGTCAGAA CCTTTATGAC
GGCTGGGTCA ACGACTCCGA CTTGATCGGT ACGCATTGGC GCTATCAGCA GGTGCGCAAC
CTGACCGACT GGATGGCCGG CGAAGGCAAG GACGTTACCA ACCCCGGCCT CGATATCGAC
GACTTCATCG GCACCAAGTT TACGACGGCA CCGGACAAGA AGCTCTACCA GCTTCCCGAC
CAGCAGTTCG CCAACCTCTA TTGGTTCCGC TACGACTGGT TCAACGACGA GAAGAACAAG
GCCGACTTCA AGGCGAAGTA CGGCTACGAC CTCGGCGTTC CGGTCAACTG GTCGGCCTAT
GAGGACATCG CCGAATTCTT TACCGGCCGT GACGAGAACG GCAAGAAGGT CTTCGGCCAC
ATGGACTACG GCAAGAAGGA CCCGTCGCTC GGCTGGCGCT TCACCGACGC CTGGCTTTCC
ATGGCCGGCA ACGGTGACAA GGGTCTTCCG AACGGTCTTC CGGTCGACGA ATGGGGTATC
AAGGTCGACG AAAAGTCGCG TCCCGTCGGC TCATGCGTCG CGCGCGGCGG TGATACCAAC
GGCCCGGCTT CCGTCTACTC GATCCAGAAG TATCTCGACT GGTTGAAGGC CTACGCACCG
CCGGAAGCTC AGGGCATGAC CTTCTCGGAA TCCGGTCCGG TTCCGGCACA GGGTAACGTC
GCGCAGCAGA TCTTCTGGTA CACGGCCTTT ACCGCAGACA TGGTCAAGCC TGGCCTGCCT
GTCATGAACG AAGATGGTAC GCCGAAATGG CGCATGGCAC CGAGCCCGCA TGGCGTTTAC
TGGAAAGACG GCATGAAGCT CGGCTATCAG GACGTCGGTT CCTGGACGCT GATGAAATCG
ACCCCGACGG ACCGTGCGAA GGCCGCCTGG CTCTATGCAC AGTTCGTGAC CTCGAAGACG
ATTGACGTGA AGAAGAGCCA GCTCGGCCTC ACCTTCATTC GCGAATCCAC CATTCGCGAT
AAGAGCTTTA CGGAGCGCGC TCCGAAGCTC GGCGGTCTGA TCGAGTTCTA TCGTTCGCCG
GCGCGTGTTC AGTGGTCGCC AACCGGCACC AACGTTCCCG ACTATCCGAA GCTGGCTCAG
CTTTGGTGGC AGGCAATCGG CGATGCGTCT TCCGGTGCGA AGACACCGCA GGAAGCCATG
GATTCTCTTT GCGGCGAGCA GGAGAAGGTC ATGGGCCGTA TCGAGAAATC CGGCGTCCAG
GGCGACATCG GCCCGAAACT GGCCGAAGAG CATGATCTCG CTTATTGGAA CGCGGATGCG
GTGAAGAAGG GCAACCTTGC GCCGCAGCTG AAGATCGAGA ACGAGAAGGA AAAGCCGGTC
ACCGTCAACT ACGACGAACT CGTCAAGAGC TGGCAGTCGA AGTAA
 
Protein sequence
MRRHLLTTTA AVLLAMTGSA YAGMDEAKTF LDKEVGDMST LSRADQEKEM QWFVDAAKPF 
AGMEIKVVSE TLTTHQYESQ VLAPAFTAIT GIKVTHDVIQ EGDVVEKIQT QMQTGQNLYD
GWVNDSDLIG THWRYQQVRN LTDWMAGEGK DVTNPGLDID DFIGTKFTTA PDKKLYQLPD
QQFANLYWFR YDWFNDEKNK ADFKAKYGYD LGVPVNWSAY EDIAEFFTGR DENGKKVFGH
MDYGKKDPSL GWRFTDAWLS MAGNGDKGLP NGLPVDEWGI KVDEKSRPVG SCVARGGDTN
GPASVYSIQK YLDWLKAYAP PEAQGMTFSE SGPVPAQGNV AQQIFWYTAF TADMVKPGLP
VMNEDGTPKW RMAPSPHGVY WKDGMKLGYQ DVGSWTLMKS TPTDRAKAAW LYAQFVTSKT
IDVKKSQLGL TFIRESTIRD KSFTERAPKL GGLIEFYRSP ARVQWSPTGT NVPDYPKLAQ
LWWQAIGDAS SGAKTPQEAM DSLCGEQEKV MGRIEKSGVQ GDIGPKLAEE HDLAYWNADA
VKKGNLAPQL KIENEKEKPV TVNYDELVKS WQSK