Gene Rleg2_4071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4071 
Symbol 
ID6982842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4247299 
End bp4248930 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content65% 
IMG OID643398801 
Productprotein of unknown function DUF894 DitE 
Protein accessionYP_002283559 
Protein GI209551642 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCTACA CTTCGTCGAC ATTGGCGCCT TTGCGGCACG ATACCTACCG CACCATCTGG 
TTCGCCAGCC TGTCGTCGAA TTTCGGCGGC CTGATCCAGG CGGTCGGTGC CGCCTGGATG
ATGACGACGA TCACCGCCTC GGAGGACATG GTCGCGCTGG TGCAGACCTC GACGGCGCTG
CCGATCATGC TGTTTTCGCT GATTTCAGGG GCGCTCGCCG ACAATTACGA TCGCCGCCGG
GTGATGCTGA CTGCGCAGTG TATGATGCTG ACAGTCTCGG CGCTTTTGAC GGCCACGGCC
CTTCTCGGCT GGATCACACC CTGGCTGCTG CTCTTCTTCA CCTTCCTGAT CGGCTGCGGC
ACCGCGCTCA ACAACCCTTC CTGGCAGGCC TCGGTCGGCG ACATGGTGCC GCGCGCCGAT
CTGCCGGGCG CCGTCACGCT GAACAGCATG GGTTTCAACA TTACCCGCAG CGTCGGCCCG
GCCATCGGCG GTGTCATCGT CGCCGCCGCC GGCGCGGCGG CGGCCTTCGC GGTGAACACC
GTGAGCTACC TCGCTTTGAT CTATGCCCTG CTGCGCTGGC GCCCAAGCAC GCCGGTCTCG
ACCCTGCCGC GCGAGGCGCT CGGCAGCGCC ATCTTCGCCG GCCTGCGTTA TGTCTCGATG
TCGCCCAATC TCGAAAAGGT TCTCCTCCGG GGGCTGCTCT TCGGCATCGG CGCCAGCTCG
ATCCTGGCGC TGCTGCCGGT CGTGGCACTC GATCTCGTCG GCGGCGGCCC GCTGACCTAT
GGTTTCATGC TCGGCGCCTT CGGCATCGGC GCGATCGGCG GCGCGGTGTT GAATGCGCGG
CTGCGCCAGA TGCTGTCGAG CGAGATGATC ATCCGTCTGG CCTTTACAGG CTTCGCGCTG
AGCGCCGTCA TCGCTGCCTT CAGCCCAAGC GCAGTGCTGA CCTCGGCCGG GTTGCTCATC
TCCGGCGCCT GCTGGGTCTC CGCACTGTCG CTCTTCAACA CCATCGTCCA GCTGTCGACG
CCGCGCTGGG TGGTGGGACG GGCGCTGTCG CTCTACCAGA CCGTCACCTT CGGCGGCATC
GCCGGCGGCA GCTGGCTCTG GGGTGTGGCC GCCGATCGCT ACGGTGTCGC CGACGCGCTG
CTGATGTCAT CGGTCGTCAT GCTGCTCGGC ATCGTGATCG GCCTGCGCTT TTCCATGCCG
GCCTTTGCCT CGCTCAATCT CGATCCGCTG AACCGCTTCA CCGAGCCGGC TCTCAGCCTC
GACATCACCC CCCGCAGCGG CCCGATCGTC ATCCAGGTCG ATTATGAGAT CGGAGATGAC
GACCTTGCCG AATTCATGCA GCTGATGGGC GAACGCCGCC GTATCCGCAT CCGCGACGGC
GCCCGCAACT GGGCTTTGAT GCGCGATCTC GAAAATCCCG GGCTCTGGAC GGAAACCTAC
CATACGCCGA CCTGGGTCGA ATATATAAGA CACAACCAGC GGCGCACGCA GGCCGATGCC
GAAAACACCG ACAGGCTTCG TGCGCTTCAT CGCGGCGAAG GTCCGCTGCA TGTCCACCGC
ATGATCGAAC GCCAGGCCAT TCCATCCGGC GACGACGTCT TCCATAAAGC GCCGATCGAT
CTGCATCATT GA
 
Protein sequence
MAYTSSTLAP LRHDTYRTIW FASLSSNFGG LIQAVGAAWM MTTITASEDM VALVQTSTAL 
PIMLFSLISG ALADNYDRRR VMLTAQCMML TVSALLTATA LLGWITPWLL LFFTFLIGCG
TALNNPSWQA SVGDMVPRAD LPGAVTLNSM GFNITRSVGP AIGGVIVAAA GAAAAFAVNT
VSYLALIYAL LRWRPSTPVS TLPREALGSA IFAGLRYVSM SPNLEKVLLR GLLFGIGASS
ILALLPVVAL DLVGGGPLTY GFMLGAFGIG AIGGAVLNAR LRQMLSSEMI IRLAFTGFAL
SAVIAAFSPS AVLTSAGLLI SGACWVSALS LFNTIVQLST PRWVVGRALS LYQTVTFGGI
AGGSWLWGVA ADRYGVADAL LMSSVVMLLG IVIGLRFSMP AFASLNLDPL NRFTEPALSL
DITPRSGPIV IQVDYEIGDD DLAEFMQLMG ERRRIRIRDG ARNWALMRDL ENPGLWTETY
HTPTWVEYIR HNQRRTQADA ENTDRLRALH RGEGPLHVHR MIERQAIPSG DDVFHKAPID
LHH