Gene Rleg_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1953 
Symbol 
ID8012992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1942664 
End bp1943872 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content64% 
IMG OID644824542 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002975774 
Protein GI241204678 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.366446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGATA TTTCCTGCGT GGAGCCGGCG GACGTGTCGG ATCGACACTT GCACGGCACG 
ACATCGCCGC GAGTGGATCG GACCCCATGG GCCGCGATGG CGGGCATCAT CGCGACCGTC
ACGGTGTTTG CCGTGGCGCA GGGGCTGACC TATCCGCTGC TCAGCTTCAT CCTGGAACGG
CAAGGGACGA CATCAGGCCT GATCGGCTTG TCGGCGGCGA TGACGCCGCT CGGTTTCATC
CTATCGGCGC CCTTCATTCC TGCGCTTTCA CCGCGCGTGG GTGGGGCGCG GTTGGCGATC
CTGTGTTCGA TCCTGGCCGC GCTCACTCTA ATGACGATTG CCTGGGCGCA GGACGTCTGG
GCCTGGATGC CGCTGCGCTT CCTGCTCGGC GTCTTCGCCA ATCCGCTTTA CGTGATCAGC
GAAACCTGGC TGATCTCGAT CACGCCGGCG CCACGCCGGG GCCGGATCAT GGGTCTCTAT
TCATCGATCG TTTCGGGCGG CTTCGCCATC GGCCCGCTGT CGCTCTGGCT CACCGGCACG
GAGGGTTGGC CGCCCTTCGC GATCGGCATT GCGGCCTTCC TCCTCTGCGG CCTGATCGTG
CTTGCGGTCG TCCCACGCCT GCCCGACATG CCTGGTGAAG GCGAGGCAAC AACGGTGGGC
CGCTTCTTCG CGCTAGCACC ACTGCTGTTG TTTGCCGTTT TTACCGCTGC CGCCTTCGAG
CAGATCCTGC TTTCCCTCTT CGCGGTCTAT GGCGCAGCAC TCGGCAGCGC CGAGGAGCGC
ATCGCTTCGC TCATCACCTG TTTCATCGCC GGCAATGCCG TATTGCAGAT TTTACTCGGG
CGCTTGGCCG AACGGTTCGG CTCGACGCGG ATGATGCTCT TCTGCGTCCT GGCCTGCCTC
GCCAGCTGTC TGCTGCTGCC GTCGGCCTTC AATTCGTGGC TCATCTGGCC GCTGCTTTTC
GTCTGGGGCG GGGTCTCGTT CGGGATCTAC ACCCTGTCGC TGATCCTGCT CGGCGAGCGT
TTCACCGGCC AGGCCCTGAT CGCCGGTAAC GCCGCCTTCG CCTTGGTATG GGGCATCGGC
GGCATCGTCG GATCGCCCGC GACAGGACTG GCGATGCAAC TGATCGGACA TCAGGGTCTG
CCATTGTCCC TTGTCCTGCT CAACTGTGGG CTGGCGGTGT TGCTGATGGC CAGGAGATGG
CGGGGCTGA
 
Protein sequence
MNDISCVEPA DVSDRHLHGT TSPRVDRTPW AAMAGIIATV TVFAVAQGLT YPLLSFILER 
QGTTSGLIGL SAAMTPLGFI LSAPFIPALS PRVGGARLAI LCSILAALTL MTIAWAQDVW
AWMPLRFLLG VFANPLYVIS ETWLISITPA PRRGRIMGLY SSIVSGGFAI GPLSLWLTGT
EGWPPFAIGI AAFLLCGLIV LAVVPRLPDM PGEGEATTVG RFFALAPLLL FAVFTAAAFE
QILLSLFAVY GAALGSAEER IASLITCFIA GNAVLQILLG RLAERFGSTR MMLFCVLACL
ASCLLLPSAF NSWLIWPLLF VWGGVSFGIY TLSLILLGER FTGQALIAGN AAFALVWGIG
GIVGSPATGL AMQLIGHQGL PLSLVLLNCG LAVLLMARRW RG