Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2420 |
Symbol | |
ID | 8013402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2420434 |
End bp | 2421669 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644825001 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002976231 |
Protein GI | 241205135 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.260445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATCA GCACCGAACG ACCACCGGTA GCGGCCATTG CCGCGCTCGG GCTGACGCAG ATCATCGGTT ATGGATCGCT CTATTACAGC TTCAGCATCC TCGCCCCCGA CATGGCCCGT GATTTGGGCT GGTCATCTGA ATGGATCTTC GGCGCGCTCT CTGTGGCCCT TCTGATCGGC GGTCTGGCCG CACCTCTCAT GGGCACGTGG ATTGATCGCT TCGGCGCAGG CAGGATCATG ACGTCGGGCT CGGCGATTGC CGCCGCCGCC CTCGTCGCCT GCGCCTTCGC GCCGGGAAAG ATCGCTTTCG TCGCGGCATT GATCGGCATC GAGATCGCCT CGAACCTGGT GCAATATGGC GCAGCCTTCG CCCTGCTCGT GCAGATCAGG CCGCAGGTCG CCCAGCGCAG CATCACCTAT CTGACCCTAA TCGCCGGCTT CGCCTCAACC ATTTTCTGGC CGATCACCAC GGCGCTGCAT GCGCATCTCT CATGGCAGAA CGTCTATCTG ATTTTCGCCG CGCTCAATCT CGTTGTCTGC CTTCCCATTC ATGCCTGGCT CTCCCGTGGC ATCAGCCAAA CCAGGGGACG GACCGGAGAG GAGGCCGCAA AACGCGCCGA GCCGAGCCTT CCGCCGTCGG TGCGCAGGCT GGCCTTTATC CTGATGGTGA CAGGCTTTGC TTTGGAAAGC TTCGTGAACT CGGCGCTTCT GGTTCACATG GTGCCTGTGA TGTCGGCCCT CGGCCTCGGC GCCATGGCAG TGGTGGTCGG AACGCTCTTC GGCCCGTCAC AGGTGCTGAG CCGCCTCATC AACATGGTCT TTGGCGAGAG CTTGTCGCAA GTGATGCTCG CGATCATCTG CGCCATCTTG CTGCCGACAG CCCTTGTCAT CCTCATCGCC ACCGCACCCT CGGTGCCTGG TGCGCTGGTC TTCGCCGTCG TCTTCGGCCT CGGCTCGGGG CTTAACAGCA TCGTCTACGG AACCTTGCCG CTCGCGCTCT TCGGAAGCGA TGGCTACGGC CGGCGGCAAG GACAAATCAT GTCGGTTCGT CTCGTCGTCT CCTCGATGGC GCCGTTCGCG CTTGCCTTCC TGATGGGCTA TCTCGGCGTA TCATGGTCAC TATCGATCGC TGCATTGCTC AGCACCGTCG CCGTTGCCGC ATTCTTCGCC ATCACGCGGC TGACGCGCCC GGTTGTTGCC CGGCCGGAAC CTGTTCCCAA TCCCGGAGAA GCTTGA
|
Protein sequence | MTISTERPPV AAIAALGLTQ IIGYGSLYYS FSILAPDMAR DLGWSSEWIF GALSVALLIG GLAAPLMGTW IDRFGAGRIM TSGSAIAAAA LVACAFAPGK IAFVAALIGI EIASNLVQYG AAFALLVQIR PQVAQRSITY LTLIAGFAST IFWPITTALH AHLSWQNVYL IFAALNLVVC LPIHAWLSRG ISQTRGRTGE EAAKRAEPSL PPSVRRLAFI LMVTGFALES FVNSALLVHM VPVMSALGLG AMAVVVGTLF GPSQVLSRLI NMVFGESLSQ VMLAIICAIL LPTALVILIA TAPSVPGALV FAVVFGLGSG LNSIVYGTLP LALFGSDGYG RRQGQIMSVR LVVSSMAPFA LAFLMGYLGV SWSLSIAALL STVAVAAFFA ITRLTRPVVA RPEPVPNPGE A
|
| |