Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4226 |
Symbol | |
ID | 8015903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4320206 |
End bp | 4322092 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826796 |
Product | General substrate transporter |
Protein accession | YP_002978005 |
Protein GI | 241206909 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.162126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAATG TCGCAAGCAT CGACGGCGCA AAGGCCGGTC CGATGACCGG TGAGGAGAAG AAGGTCATCT TCGCCTCTTC GCTCGGCACC GTTTTCGAAT GGTACGATTT CTATCTCTAT GGTTCGCTCG CCACCTATAT CGGCGCGACC TATTTCACCC AATATCCCGA GGCAACGCGT AACATCTTCA CGTTGCTCGC CTTTGCCGCC GGCTTCCTGG TGCGCCCCTT CGGCGCGCTG GTGTTCGGCC GTCTCGGCGA TCTCGTCGGC CGTAAATACA CCTTCCTGAT GACGATCATG ATCATGGGTC TGTCGACCTT CCTCGTCGGC ATCCTGCCGG GTGCCGCCAC GATCGGTATC GCAGCCCCGA TCATCCTGAT CGCGCTCCGT CTGCTCCAGG GTCTGGCGCT GGGCGGTGAA TATGGCGGCG CGGCCACCTA TGTCGCCGAA CATGCGCCGA ACGGGCGCCG CGGCTACTTC ACCTCGTGGA TCCAGACGAC GGCAACGCTC GGCCTGTTCC TGTCGCTGAT CGTCATCGTC CTGGTTCAAT ATCTGATGGG TGCGGCTCAG TTTGCCGCCT GGGGCTGGCG CATTCCGTTC CTGGTCTCGG TCGTCCTGCT CGGCATTTCC GTCTGGATCC GCCTGAGGAT GAACGAATCG CCGGCGTTCC AGCGGATGAA GGCAGAAGGC AAAGGCTCCA AGGCGCCGCT GACCGAGGCC TTCGGGACGT GGAAAAATGC CAAGATCGCG ATCATCGCGC TGCTCGGCGC CACCATGGGC CAGGCGGTCG TCTGGTACGG CGGCCAGTTC TATGCGCTGT TCTTCCTGCA GAACGTGCTG AAGGTGGACC TGTTTTCGGC CAATGTCATG GTGGCCATCG CACTTCTCCT CGGCACGCCC TTCTTCGTCA TCTTCGGCGG TCTCTCCGAC AAGATTGGCC GCAAGCCGAT CATCATGGCA GGCCTTTTCA TTGCGGCGGT GACCTATAAT CCGCTGTTCA AGGCGATGAC CTGGACGGCG AACCCGGCGC TTGCCGAAGC GCAGGCTTCG ATTCGGGCAA CGGTGACGGC CGATCCGGCT GATTGCAGGT TCCAGTTCAA CCCGACCGGG ACGACGAAGT TCACCAGTTC CTGCGACGTG GCAACGGCGT TCCTGACCAG GAACTCGGTG CCTTACGACG TTGTGCCCGG TACCGCCGGA CAGCCGGCAA CGGTGAAGGT CGGCAACGCG ACGATCCCAA GCTTCGACGT CGTCGCTGCC GGCGACAAGG CGAAGGGGAT GACCGCCGCC TTCGAAAAGA GCGTCAACAT CGCGCTCCAC GATGCCGGCT ATCCGCTGAA CCGCGGCGCC GTCAAGGTGC CGGATGCCAA GCTCGACGCC TTCATCGCAG CCAATCCCGA GCTGTCGCTC AACGCCGATG CCGTGCGCGC CGGCGAGAAG GAAACCGTGC CTGCGGCCAA GCTGGTCGAG ACCAAGCTGC TGACCGCGGA TGAGGCCAAT GGCGTCACCG ACATGACGGT CTACAATATC GCCAATGGCG GCACCTTCGC CATGGTCGCC GATCCGGCTC GCGTTAACTG GATCGGCACG ATCGCCGTGC TGTTCGTCCT TGTCTTCTAT GTGACGATGG TCTACGGCCC GATCGCCGCT CTGCTGGTCG AGCTTTTCCC GACCCGCATC CGCTATACCG GCATGTCGCT GCCCTATCAC ATTGGCAACG GCTGGTTCGG TGGCCTGCTT CCGGCGACGG CCTTCGCGAT GAGCGCTGCC GCGGGCGATA TCTACTACGG TCTCTGGTAC CCGATCGTCT TTGCGACGAT CACGCTGGTG ATCGGCTTGA TCTTCCTGCC GGAAACGAAG AACAGGGATA TCCACGCCAT GGATTGA
|
Protein sequence | MANVASIDGA KAGPMTGEEK KVIFASSLGT VFEWYDFYLY GSLATYIGAT YFTQYPEATR NIFTLLAFAA GFLVRPFGAL VFGRLGDLVG RKYTFLMTIM IMGLSTFLVG ILPGAATIGI AAPIILIALR LLQGLALGGE YGGAATYVAE HAPNGRRGYF TSWIQTTATL GLFLSLIVIV LVQYLMGAAQ FAAWGWRIPF LVSVVLLGIS VWIRLRMNES PAFQRMKAEG KGSKAPLTEA FGTWKNAKIA IIALLGATMG QAVVWYGGQF YALFFLQNVL KVDLFSANVM VAIALLLGTP FFVIFGGLSD KIGRKPIIMA GLFIAAVTYN PLFKAMTWTA NPALAEAQAS IRATVTADPA DCRFQFNPTG TTKFTSSCDV ATAFLTRNSV PYDVVPGTAG QPATVKVGNA TIPSFDVVAA GDKAKGMTAA FEKSVNIALH DAGYPLNRGA VKVPDAKLDA FIAANPELSL NADAVRAGEK ETVPAAKLVE TKLLTADEAN GVTDMTVYNI ANGGTFAMVA DPARVNWIGT IAVLFVLVFY VTMVYGPIAA LLVELFPTRI RYTGMSLPYH IGNGWFGGLL PATAFAMSAA AGDIYYGLWY PIVFATITLV IGLIFLPETK NRDIHAMD
|
| |