Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2382 |
Symbol | |
ID | 8013370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2388009 |
End bp | 2389004 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644824963 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_002976193 |
Protein GI | 241205097 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0539771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00933627 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGACGT TTCTGATTCC GGCCGTCTTT GCCGCGGCTC TTTCTATCGT CGCGCCGGCT GAAGCCGCCG AATGCGGCAG CGTTTCGATC GCCGAAATGA AATGGGCCTC CGCAGGCATT GCGGCGAGCT TCGATAAGAT CATCCTGGAA AAGGGATACG GCTGTTCCGT CACCATTGTC GATGGCGACA CTCTGCCGAC CTTCGCCTCG ATGAATGAGA AAGGCACCCC CGACATCGCC TCGGAATACT GGATCAATTC CGTCAGGTCC CTGCTCGATC AAGCCGTCAA TAGCGGCCGG TTGGTCCAGG GGGCCGAAAT CCTGGCTGAC GGCGCCGTCG AGGGTTGGTG GATACCGAAA TTCATCGCCG ACGCCCACCC CGACATCCGC TCGGTCGAAG GCGCGCTGAA ACATCCCGAA CTCTTCCCCG CTGAGGACGA CCCGTCGAAA GGCGCAGTCT ACAACTGCCC CGCCGATTGG AGCTGCCAGA TATCGACCAC CAACCTGTTC AAGGCGCTTG CCGCCGACAA GAAGGGCTTC GAACTTGTCG AGACCGGCAG TCCCGAACAG CTGGATGCGT CGATTGCCCG CGCTTTCGAA AACAAGGTCG GCTGGCTCGG TTATTACTGG GCGCCGACCG CCATCCTCGG CAAATACGAC ATGACGCGCC TGAGCTTCGG CGTCGGCCAC AACAAGAATG AATGGGACCG CTGTACCGCG GTTGCCGGCT GCGCCAGGCC CGAGCTCAAC TCCTACCCGG TCTCGCGCGC CTTCACATTG ATGACCAGGT CCTTCGCCAG CCGCACCGGA CCTGTCACGA CCTATCTCAA GACCCGCAAA TGGGACAATG CGACGATCAA CCAGGTGCTC GCCTGGCAGG ACGAAAACCG CGAAACCAAT GAGGATGCCG CGATCTATTT CCTTCGCAAT TACGAGAGCC TGTGGACGAA ATGGGTGCCA GTCGATGTAG CCGAGAAGGT CAAGGCGAGC TTATAA
|
Protein sequence | MKTFLIPAVF AAALSIVAPA EAAECGSVSI AEMKWASAGI AASFDKIILE KGYGCSVTIV DGDTLPTFAS MNEKGTPDIA SEYWINSVRS LLDQAVNSGR LVQGAEILAD GAVEGWWIPK FIADAHPDIR SVEGALKHPE LFPAEDDPSK GAVYNCPADW SCQISTTNLF KALAADKKGF ELVETGSPEQ LDASIARAFE NKVGWLGYYW APTAILGKYD MTRLSFGVGH NKNEWDRCTA VAGCARPELN SYPVSRAFTL MTRSFASRTG PVTTYLKTRK WDNATINQVL AWQDENRETN EDAAIYFLRN YESLWTKWVP VDVAEKVKAS L
|
| |