Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4908 |
Symbol | |
ID | 6978002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 549940 |
End bp | 551019 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643394065 |
Product | ABC transporter related |
Protein accession | YP_002278883 |
Protein GI | 209546965 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4175] ABC-type proline/glycine betaine transport system, ATPase component |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.326635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGC ATCGTTTCGG CGGCATCAAG ATCCGCCATC TCTACAAGAT CTTCGGCCCC AACCCGGCCG CATATGTCGA TGCGGTGCAG AAGGGACTGT CGAAGACCGA GCTCAATGAA AAGCACGGCC ATGTGCTCGG CCTCAGGGAC ATCAATGTCG AGATCCCCTC CGGCCGCATC CAGGTCGTCA TGGGCCTTTC GGGCTCGGGC AAATCGACGC TGATCCGCCA TATCAACCGG CTGATCGATC CGACCGCCGG CGAAGTGCTG GTCGACGGCG TCGATGTCGT CAAGATGAAC GAGACCGAGT TGCGGGCGTT TCGCCGGCAC CAGACGGCGA TGGTGTTTCA GAAGTTCGCG CTTCTGCCCC ACCGCAACGT GCTCGACAAC ACCATCTTCG GCCTCGAAGT CCAGGGCATG GAGCGGTCAA AAGCCGTCGA CGTCGCCATG CGCTGGCTGG AGCGGGTCGG GCTGAAGGGC TTCGAGCAGC GCTATCCCAA CCAGCTCTCC GGCGGCATGC AGCAGCGTGT CGGCCTGGCG AGAGCTCTCT CCAACGATGC CCCCGTCCTC CTCATGGACG AGGCCTATTC GGCGCTCGAC CCGCTGATCC GCACCGACAT GCAGACCGTG CTTCTCGATA TCCAGAAGGA GATCAAGAAG ACCATCGTCT TCATCACTCA CGATCTCGAC GAGGCGCTGC GTCTCGGCGA CCAGATCGCC ATCCTGCGCG ACGGCGAAGT CATCCAGCAG GGCACCAGCC AGGACATCGT CCTGCGCCCC GCCGACGACT ACATCGCCAA CTTCGTCAAG GAGGTCAATC GTGGCAGGGT CATCCATGTC GAAGCCGTTA TGACGCCGCT GCACTCCGGT GCGGCGCCGG TTGGACTGGC AATTGTGGCG GGAACGACCG TCGAAGAGGC CGTCCGGATG CTGTCATCGG CGCCAGAGGG CGACGCCCGG GTGGTTTCGC CTTCCGGAGA AACCTTAGGG CTCGTCACCT TCCGCCAGCT CGCAAGTGCG ATGGTGAACT CACATGATAT GGCTCCCAAG CGCGACAGCG CCCTGTCGGT CGCCCTCTGA
|
Protein sequence | MAEHRFGGIK IRHLYKIFGP NPAAYVDAVQ KGLSKTELNE KHGHVLGLRD INVEIPSGRI QVVMGLSGSG KSTLIRHINR LIDPTAGEVL VDGVDVVKMN ETELRAFRRH QTAMVFQKFA LLPHRNVLDN TIFGLEVQGM ERSKAVDVAM RWLERVGLKG FEQRYPNQLS GGMQQRVGLA RALSNDAPVL LMDEAYSALD PLIRTDMQTV LLDIQKEIKK TIVFITHDLD EALRLGDQIA ILRDGEVIQQ GTSQDIVLRP ADDYIANFVK EVNRGRVIHV EAVMTPLHSG AAPVGLAIVA GTTVEEAVRM LSSAPEGDAR VVSPSGETLG LVTFRQLASA MVNSHDMAPK RDSALSVAL
|
| |