Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4780 |
Symbol | |
ID | 6977874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 414152 |
End bp | 415270 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643393944 |
Product | ABC sugar transporter, periplasmic ligand binding protein |
Protein accession | YP_002278762 |
Protein GI | 209546844 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.462835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATA GACGAATGTT TCTGTGCGGC GCTGCCGCCG TGTTGACGAT CGGCCTGGTG GGACCGGCTT TTGCCGATCC GGACTCGGCC TTGGCAAAAC TGCAGGAAAG CGTCCTGTCG AAGGGACCGT CGGGCGAAAG CCCGTCACCG GCCTCAGGCA TCAGTTTGAG CGATGAGGAA CTCGGCAAGA TCAAGGCGAT GAACGCCACG GCGGCCATTG TCATGCATTA TGGCGGCAAC GACTGGTCGC GGGCGCAGAT CAACGGTCTG CAGACGCAGT TCAAGACGAT GGGCATCAAG GTGATCGCGG TCACCGATGC CGGTTTCAAG CCGGAAAAGC AGGTGGCCGA CCTCGAAACG ATCATGGCGC AGAAGCCCAA TGTCATCGTT TCAATCCCGA CCGACCCGGC AGCGACGGCA AAGGCCTACA AGGCCGCAGC CGATGCCGGT GTCAAGCTGG TGTTCATGGA CAACGTCCCG GCGGGCTTCA AGGCGGGAAG TGACTACGTT TCCGTCGTCT CCGCAGACAA TTACGGCAAC GGCGTCGCCT CCGCCCATCT CATGGCAAAA TCGTTGAACG GCGAAGGCGA AATCGGCGTA GTCTTCCACG CAGCCGACTT CTTCGTCACG AAGCAGCGCT ACGACGCCTT TAAGGCGACG ATCGCCTCCG ACTATCCGAA GATCAAGATC GTCGCCGAAC AGGGGATCGG CGGCCCGGAC TTTTCAGGTG ACGCAGAAAA GGCGGCTTCT GCAATTCTGA CCTCCAATCC CAACGTCAAG GGCATCTGGG CCGTCTGGGA TGTACCGGCA GAAGGCGTGA TCGCCGCGGC GCGCAATGCC GGCCGTGACG ATCTCGTTAT CACCACGATC GACCTCGGCG AGAATGTCGC GATCTCGATG GCGCAGGGCA GTTTCGTCAA GGGCCTCGGA GCACAACGCC CGTTCGATGC CGGCGTTGTC GAAGCGAAAC TCGCAGGCTA TGCCCTCGTC GGCAAGGAAG CACCCGCCTT CGTGGCGCTG CCAGCCCTAC CAGTCACCCG CGACAACCTG CTCGATGCCT GGAAGACCGT CTACTCCACC GAGGCGACGG CCAACATCAA GACCAGCCTC GGCCAATAA
|
Protein sequence | MTNRRMFLCG AAAVLTIGLV GPAFADPDSA LAKLQESVLS KGPSGESPSP ASGISLSDEE LGKIKAMNAT AAIVMHYGGN DWSRAQINGL QTQFKTMGIK VIAVTDAGFK PEKQVADLET IMAQKPNVIV SIPTDPAATA KAYKAAADAG VKLVFMDNVP AGFKAGSDYV SVVSADNYGN GVASAHLMAK SLNGEGEIGV VFHAADFFVT KQRYDAFKAT IASDYPKIKI VAEQGIGGPD FSGDAEKAAS AILTSNPNVK GIWAVWDVPA EGVIAAARNA GRDDLVITTI DLGENVAISM AQGSFVKGLG AQRPFDAGVV EAKLAGYALV GKEAPAFVAL PALPVTRDNL LDAWKTVYST EATANIKTSL GQ
|
| |