Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1953 |
Symbol | |
ID | 8012992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1942664 |
End bp | 1943872 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644824542 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002975774 |
Protein GI | 241204678 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.366446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGATA TTTCCTGCGT GGAGCCGGCG GACGTGTCGG ATCGACACTT GCACGGCACG ACATCGCCGC GAGTGGATCG GACCCCATGG GCCGCGATGG CGGGCATCAT CGCGACCGTC ACGGTGTTTG CCGTGGCGCA GGGGCTGACC TATCCGCTGC TCAGCTTCAT CCTGGAACGG CAAGGGACGA CATCAGGCCT GATCGGCTTG TCGGCGGCGA TGACGCCGCT CGGTTTCATC CTATCGGCGC CCTTCATTCC TGCGCTTTCA CCGCGCGTGG GTGGGGCGCG GTTGGCGATC CTGTGTTCGA TCCTGGCCGC GCTCACTCTA ATGACGATTG CCTGGGCGCA GGACGTCTGG GCCTGGATGC CGCTGCGCTT CCTGCTCGGC GTCTTCGCCA ATCCGCTTTA CGTGATCAGC GAAACCTGGC TGATCTCGAT CACGCCGGCG CCACGCCGGG GCCGGATCAT GGGTCTCTAT TCATCGATCG TTTCGGGCGG CTTCGCCATC GGCCCGCTGT CGCTCTGGCT CACCGGCACG GAGGGTTGGC CGCCCTTCGC GATCGGCATT GCGGCCTTCC TCCTCTGCGG CCTGATCGTG CTTGCGGTCG TCCCACGCCT GCCCGACATG CCTGGTGAAG GCGAGGCAAC AACGGTGGGC CGCTTCTTCG CGCTAGCACC ACTGCTGTTG TTTGCCGTTT TTACCGCTGC CGCCTTCGAG CAGATCCTGC TTTCCCTCTT CGCGGTCTAT GGCGCAGCAC TCGGCAGCGC CGAGGAGCGC ATCGCTTCGC TCATCACCTG TTTCATCGCC GGCAATGCCG TATTGCAGAT TTTACTCGGG CGCTTGGCCG AACGGTTCGG CTCGACGCGG ATGATGCTCT TCTGCGTCCT GGCCTGCCTC GCCAGCTGTC TGCTGCTGCC GTCGGCCTTC AATTCGTGGC TCATCTGGCC GCTGCTTTTC GTCTGGGGCG GGGTCTCGTT CGGGATCTAC ACCCTGTCGC TGATCCTGCT CGGCGAGCGT TTCACCGGCC AGGCCCTGAT CGCCGGTAAC GCCGCCTTCG CCTTGGTATG GGGCATCGGC GGCATCGTCG GATCGCCCGC GACAGGACTG GCGATGCAAC TGATCGGACA TCAGGGTCTG CCATTGTCCC TTGTCCTGCT CAACTGTGGG CTGGCGGTGT TGCTGATGGC CAGGAGATGG CGGGGCTGA
|
Protein sequence | MNDISCVEPA DVSDRHLHGT TSPRVDRTPW AAMAGIIATV TVFAVAQGLT YPLLSFILER QGTTSGLIGL SAAMTPLGFI LSAPFIPALS PRVGGARLAI LCSILAALTL MTIAWAQDVW AWMPLRFLLG VFANPLYVIS ETWLISITPA PRRGRIMGLY SSIVSGGFAI GPLSLWLTGT EGWPPFAIGI AAFLLCGLIV LAVVPRLPDM PGEGEATTVG RFFALAPLLL FAVFTAAAFE QILLSLFAVY GAALGSAEER IASLITCFIA GNAVLQILLG RLAERFGSTR MMLFCVLACL ASCLLLPSAF NSWLIWPLLF VWGGVSFGIY TLSLILLGER FTGQALIAGN AAFALVWGIG GIVGSPATGL AMQLIGHQGL PLSLVLLNCG LAVLLMARRW RG
|
| |