Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2945 |
Symbol | |
ID | 6981690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3001953 |
End bp | 3002942 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643397656 |
Product | putative sugar ABC transporter, substrate-binding protein |
Protein accession | YP_002282439 |
Protein GI | 209550522 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.64676 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAGG CATTATTGCT TACAGCTGCA GTTATGGCGC TGACGGCCGG TCAGGCGCTC GCCGCCAAGA AGCAGCTGGT CATCGTCGTC AAAGGCCTCG ACAACCCGTT CTTCGAGGCA ATCAACCAGG GTTGCCAGAA GTGGAACAAG GAAAACGCAT CCGCCGAATA TGAATGCTTC TACACCGGCC CGGCATCGAC CTCCGATGAG GCCGGCGAGG CGCAGATCGT CCAGGATATG CTGGGGAAAG CCGATACCGC CGCCATCGCC ATCTCGCCAT CGAACGCCAA GCTGATCGCC CAGACGCTGA AGACGGCCAA TCCGAGCGTT CCGGTGATGA CGCTCGATGC CGATCTTGCG GCCGATGATG CGGCGCTGCG CAAAACCTAT CTCGGCACCG ACAACTATCT GATGGGCGCC AAGATCGGTG AATACATCAA GAAGGCCAAG CCGAAGGGCG GCAAGATCTG CACCATCGAA GGCAATCCCG GCGCCGATAA CATTCTGCGC CGCGCCCAGG GCATGCGCGA TACGCTGAGC GGCCAGAAGG GGCTGGCGGC GCTGAAAGGT GAAGGCGGCT GGACTGAGGT GGCCGGCTGC CCGGTCTTTA CCAATGACGA TGGCGCCAAG GGCGTGCAGG CGATGACCGA CATCCTCGCC GCCAACACCG ACCTCGATGC CTTCGGCATC ATGGGCGGCT GGCCGCTGTT CGGCGCCCCG CAGCCCTATC GCGATCTGTT CAAGCCGATG GCCGACAAGA TCGCCAAGAA CGAGTTCGTC ATCGGCGCCG CCGACACGAT CGGCGACGAA GTGGCAATCG CCAAGGAAGG GCTGGTCACC GCGCTCGTCG GCCAGCGGCC GTTCGAAATG GGTTATAAGG CGCCAACCGT GATGATGGAC CTGATCGCCG GGAAGAAGGT CGAGGATCCG GTATTCACCG GGCTGGACGA ATGCACCAAG GATACCGTCG ACACCTGCAT TCAGAAGTGA
|
Protein sequence | MRKALLLTAA VMALTAGQAL AAKKQLVIVV KGLDNPFFEA INQGCQKWNK ENASAEYECF YTGPASTSDE AGEAQIVQDM LGKADTAAIA ISPSNAKLIA QTLKTANPSV PVMTLDADLA ADDAALRKTY LGTDNYLMGA KIGEYIKKAK PKGGKICTIE GNPGADNILR RAQGMRDTLS GQKGLAALKG EGGWTEVAGC PVFTNDDGAK GVQAMTDILA ANTDLDAFGI MGGWPLFGAP QPYRDLFKPM ADKIAKNEFV IGAADTIGDE VAIAKEGLVT ALVGQRPFEM GYKAPTVMMD LIAGKKVEDP VFTGLDECTK DTVDTCIQK
|
| |