Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2187 |
Symbol | |
ID | 6980926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2242485 |
End bp | 2243735 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643396906 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002281694 |
Protein GI | 209549777 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00891] putative sialic acid transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0329536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.242906 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCTT TGGAAAGCCT GCGCCGGCTG ACGCCGCAGC AGCGCAACAC CGTCATCGCC AGCTATCTCG GCTGGACGCT CGATGCCTTC GATTTTTTCA TTCTCGTCTT CGTTCTCAAA TATATCGCCG AGGAATTCCA CACCGACGTT CCCGCCGTCT CGGTGGCGAT CTTCCTGACG CTCGCCATGC GGGCGCTGGG CGCGCTGATC TTCGGGTTGG CGGCCGACCG CTATGGCCGG CGCATCACGC TGATGGCCGA CGTGCTGCTC TATTCGCTGT TCGAATTCCT GACAGGTTTC TCCACCGGCC TCACCATGTT CCTGGTGCTC CGGGCGCTCT ACGGCATCGC CATGGGCGGC GAATGGGGCG TCGGCGCCTC GCTGGTCATG GAGACGGTGC CGGAGGAAAG CCGCGGCATC GTCTCAGGCA TCCTGCAGGC GGGTTATCCC TCGGGCTATC TGATCGCCTC GGTCGTGTTC TTCCTGCTCT TTCCCGTCAT CGGCTGGCGC GGCATGTTCT TCATCGGCGC GGCCCCGGCG CTGCTGGTGC TCTATATCCG GCGGAACGTC GAGGAGAGCC CCGCCTTTCT GAGACGGCAG GCCGAGGGGC GCCGGCCGTT CCTGACGGTG CTGCGCGAAA ATATTCCGCT GTTCATCTGG GCGGTGCTCT TGATGACGGC GTTCAATTTC TTCAGTCACG GCACGCAGGA TATCTACCCC ACCTTCCTCG AAACTCAGCG TAACTATTCG AGCTATACGG TCGGCGCCAT CGCCATCGTC TACAATATCG GGGCGATCTG CGGCGGGCTG TTCTTCGGGG CTCTGTCGCA GCGGATCGGC CGCAAGCGGG CGATTGTGAC GGCCGCACTG ATCGCCGTGC CCGTGGCACC TCTCTGGGCC TATTCGCCGG GGCCGGTGCT GCTCGCCATC GGTGCCTTTC TCATGCAGTT CTTCGTCCAG GGCGCCTGGG GCATCGTGCC GGTGCATCTG AACGAGTTGT CGCCGGACGA AGTGCGCGGC ACCTTTCCCG GCTTCGCTTA TCAACTCGGC AACCTGCTGG CCTCTGGCAA CGCCACGCTG CAGGCGGGGC TCGCCGCCCG CTGGAACGGC GATTACGCCT CAGCGCTGCT GATCGTCGCG GCGGTGGTGG CGCTCGTCGT CGCCCTGCTC GCCGGCTTCG GCTACGAGAA GAAGGATGTT CGCTTCGGCA TGGAGGAAGC CGAGGAGCCG CATGGCGCGA TGCGAATCTA G
|
Protein sequence | MSALESLRRL TPQQRNTVIA SYLGWTLDAF DFFILVFVLK YIAEEFHTDV PAVSVAIFLT LAMRALGALI FGLAADRYGR RITLMADVLL YSLFEFLTGF STGLTMFLVL RALYGIAMGG EWGVGASLVM ETVPEESRGI VSGILQAGYP SGYLIASVVF FLLFPVIGWR GMFFIGAAPA LLVLYIRRNV EESPAFLRRQ AEGRRPFLTV LRENIPLFIW AVLLMTAFNF FSHGTQDIYP TFLETQRNYS SYTVGAIAIV YNIGAICGGL FFGALSQRIG RKRAIVTAAL IAVPVAPLWA YSPGPVLLAI GAFLMQFFVQ GAWGIVPVHL NELSPDEVRG TFPGFAYQLG NLLASGNATL QAGLAARWNG DYASALLIVA AVVALVVALL AGFGYEKKDV RFGMEEAEEP HGAMRI
|
| |