Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4822 |
Symbol | |
ID | 8007210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 197443 |
End bp | 198444 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644821752 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_002973012 |
Protein GI | 241113177 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TACTCGCATC GACATGTCTG ACGTTCGGCC TGATCGGCGG GGCATCTTTC GCCAGCGCCG CCGAATGCGG CACTGTGACC ATCGCCAGCA TGAACTGGCA GAGTGCAGAG GTTCTCTCCA ACCTCGACAA GTTCATCCTG AACGAAGGTT ATGGCTGTGA GGCCGAAATC ACCGTCGGCG ATACCGTTCC GACGATCACC TCGATGGCCG AAAAGGGCCA GCCGGACATC GCACCGGAAG CCTGGATCGA CCTGCTGCCC GATGTCGTCA AGAAGGGAAC GGATGAAGGC CGGATCGTCC AGGTCGGCTC TCCCTTGCCC GATGGCGGCG TCCAGGGCTG GTGGATTCCC AAATATCTGG CCGACGCCCA CCCTGATATC AAGACTATCG GCGACGTGCT GAAGCATCCG GAACTCTTCC CCGCTCCTGA GGATGCGAAG AAGGGCGCCA TCTATAACGG TCCGCAGGGC TGGGGCGGCA CCGTGGTGAC CACGCAGCTC TACAAGGCCT TCGAAGCCGA TAAGGCCGGC TTCACCCTCG TCGATACCGG TTCTGCTGCC GGCCTCGATG GTTCGATCTC CAAGGCTTAC GAACGCAAGG AAGGCTGGGC CGGCTATTAC TGGGCGCCGA CCGCGCTGCT CGGCAAATAT GAAATGGTCA AGCTCGAAGC CGGCGTGCCG AATGACGCCG CCGAATGGAA GCGCTGCAAC ACCGTCGCCG ATTGCCCCGA TCCGAAGCCG AACGCATGGC CGGTCGACAA GATCGTCACC CTCGTTGCAA AGCCTTTCTC GGAAAAGGCC GGGCCGGAGG TCATGGACTA CCTGACGAAG CGCTCCTGGA GCAATGACAC GGTCAACAAG CTGATGGCGT GGATGACCGA CAACCAGGCG ACCGGCGAGG ACGGTGCCAA GCACTTCCTC AAAGAAAACA AGGACCTCTG GACCAAGTGG GTCTCGCCTG AGGCAGTCAC GAAGATCGAA GCTGCTCTTT AA
|
Protein sequence | MKKLLASTCL TFGLIGGASF ASAAECGTVT IASMNWQSAE VLSNLDKFIL NEGYGCEAEI TVGDTVPTIT SMAEKGQPDI APEAWIDLLP DVVKKGTDEG RIVQVGSPLP DGGVQGWWIP KYLADAHPDI KTIGDVLKHP ELFPAPEDAK KGAIYNGPQG WGGTVVTTQL YKAFEADKAG FTLVDTGSAA GLDGSISKAY ERKEGWAGYY WAPTALLGKY EMVKLEAGVP NDAAEWKRCN TVADCPDPKP NAWPVDKIVT LVAKPFSEKA GPEVMDYLTK RSWSNDTVNK LMAWMTDNQA TGEDGAKHFL KENKDLWTKW VSPEAVTKIE AAL
|
| |