Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6464 |
Symbol | |
ID | 6983535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | - |
Start bp | 129405 |
End bp | 131129 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643399461 |
Product | putative sugar ABC transporter, substrate-binding protein |
Protein accession | YP_002284217 |
Protein GI | 209552302 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.724776 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGGC ATTTATTGAC GACGACAGCA GCGGTACTGC TGGCCATGAC CGGGTCGGCC TATGCCGGCA TGGACGAGGC AAAGACTTTC CTGGATAAAG AGGTCGGCGA CATGTCGACG CTCTCTCGCG CCGACCAGGA AAAGGAAATG CAGTGGTTCG TCGATGCTGC GAAGCCCTTC GCCGGCATGG AGATCAAGGT CGTTTCGGAA ACCTTGACCA CTCACCAGTA TGAGTCCCAG GTTCTGGCTC CGGCCTTTAC TGCGATCACC GGCATCAAGG TCACTCACGA CGTCATTCAA GAGGGTGACG TTGTCGAGAA GATCCAGACT CAGATGCAGA CCGGTCAGAA CCTTTATGAC GGCTGGGTCA ACGACTCCGA CTTGATCGGT ACGCATTGGC GCTATCAGCA GGTGCGCAAC CTGACCGACT GGATGGCCGG CGAAGGCAAG GACGTTACCA ACCCCGGCCT CGATATCGAC GACTTCATCG GCACCAAGTT TACGACGGCA CCGGACAAGA AGCTCTACCA GCTTCCCGAC CAGCAGTTCG CCAACCTCTA TTGGTTCCGC TACGACTGGT TCAACGACGA GAAGAACAAG GCCGACTTCA AGGCGAAGTA CGGCTACGAC CTCGGCGTTC CGGTCAACTG GTCGGCCTAT GAGGACATCG CCGAATTCTT TACCGGCCGT GACGAGAACG GCAAGAAGGT CTTCGGCCAC ATGGACTACG GCAAGAAGGA CCCGTCGCTC GGCTGGCGCT TCACCGACGC CTGGCTTTCC ATGGCCGGCA ACGGTGACAA GGGTCTTCCG AACGGTCTTC CGGTCGACGA ATGGGGTATC AAGGTCGACG AAAAGTCGCG TCCCGTCGGC TCATGCGTCG CGCGCGGCGG TGATACCAAC GGCCCGGCTT CCGTCTACTC GATCCAGAAG TATCTCGACT GGTTGAAGGC CTACGCACCG CCGGAAGCTC AGGGCATGAC CTTCTCGGAA TCCGGTCCGG TTCCGGCACA GGGTAACGTC GCGCAGCAGA TCTTCTGGTA CACGGCCTTT ACCGCAGACA TGGTCAAGCC TGGCCTGCCT GTCATGAACG AAGATGGTAC GCCGAAATGG CGCATGGCAC CGAGCCCGCA TGGCGTTTAC TGGAAAGACG GCATGAAGCT CGGCTATCAG GACGTCGGTT CCTGGACGCT GATGAAATCG ACCCCGACGG ACCGTGCGAA GGCCGCCTGG CTCTATGCAC AGTTCGTGAC CTCGAAGACG ATTGACGTGA AGAAGAGCCA GCTCGGCCTC ACCTTCATTC GCGAATCCAC CATTCGCGAT AAGAGCTTTA CGGAGCGCGC TCCGAAGCTC GGCGGTCTGA TCGAGTTCTA TCGTTCGCCG GCGCGTGTTC AGTGGTCGCC AACCGGCACC AACGTTCCCG ACTATCCGAA GCTGGCTCAG CTTTGGTGGC AGGCAATCGG CGATGCGTCT TCCGGTGCGA AGACACCGCA GGAAGCCATG GATTCTCTTT GCGGCGAGCA GGAGAAGGTC ATGGGCCGTA TCGAGAAATC CGGCGTCCAG GGCGACATCG GCCCGAAACT GGCCGAAGAG CATGATCTCG CTTATTGGAA CGCGGATGCG GTGAAGAAGG GCAACCTTGC GCCGCAGCTG AAGATCGAGA ACGAGAAGGA AAAGCCGGTC ACCGTCAACT ACGACGAACT CGTCAAGAGC TGGCAGTCGA AGTAA
|
Protein sequence | MRRHLLTTTA AVLLAMTGSA YAGMDEAKTF LDKEVGDMST LSRADQEKEM QWFVDAAKPF AGMEIKVVSE TLTTHQYESQ VLAPAFTAIT GIKVTHDVIQ EGDVVEKIQT QMQTGQNLYD GWVNDSDLIG THWRYQQVRN LTDWMAGEGK DVTNPGLDID DFIGTKFTTA PDKKLYQLPD QQFANLYWFR YDWFNDEKNK ADFKAKYGYD LGVPVNWSAY EDIAEFFTGR DENGKKVFGH MDYGKKDPSL GWRFTDAWLS MAGNGDKGLP NGLPVDEWGI KVDEKSRPVG SCVARGGDTN GPASVYSIQK YLDWLKAYAP PEAQGMTFSE SGPVPAQGNV AQQIFWYTAF TADMVKPGLP VMNEDGTPKW RMAPSPHGVY WKDGMKLGYQ DVGSWTLMKS TPTDRAKAAW LYAQFVTSKT IDVKKSQLGL TFIRESTIRD KSFTERAPKL GGLIEFYRSP ARVQWSPTGT NVPDYPKLAQ LWWQAIGDAS SGAKTPQEAM DSLCGEQEKV MGRIEKSGVQ GDIGPKLAEE HDLAYWNADA VKKGNLAPQL KIENEKEKPV TVNYDELVKS WQSK
|
| |