Gene Rleg2_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1670 
Symbol 
ID6980407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1700492 
End bp1702090 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content60% 
IMG OID643396395 
Productprotein of unknown function DUF894 DitE 
Protein accessionYP_002281185 
Protein GI209549268 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0165754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCCG CTAAAACGTC CGGGGGCACT TTCGCTCCTC TTGCGCAGCC CGTCTTTGCG 
GTTCTCTGGA TCGCTACGGT TCTCGGCAAC ACCGGCAGCT TCATGCGCGA CGTCGCCAGT
TCCTGGCTGA TGACGGATCT TTCTGCATCG CCTGCCGCAG TTGCCATGGT TCAGGCGGCC
GGAACCCTGC CGATTTTCCT GCTTGCCATT CCGGCAGGCG TTCTCACGGA CATTCTCGAT
CGCCGCAAAT TCCTGATCGC CGTCCAGCTT TTGCTGGCAT CAGTCAGCAT TTCGCTGATG
GTTCTGTCGC AGACGGGGAT GTTGTCGGTC AGCGCGCTGA TCGGTCTGAC CTTCCTTGGC
GGCATCGGTG CCGCTCTGAT GGGACCGACC TGGCAGGCGA TCGTGCCGGA ACTGGTGAAA
CGCGAGGATG TGAAGAGCGC GGTCGCTCTC AATTCGCTCG GCATCAATAT CGCCCGCTCT
ATCGGGCCAG CTGCTGGTGG CCTGCTTCTA GCAGCCTTTG GAGCCGGGAT CACTTATGGG
GCGGACGTTG CCAGCTACTT CGCCGTGATC GCGGCTCTGG TCTGGTGGCC AAGAGCAAAG
AATGCCGACG ACGTGCTTCA GGAGAACTTC TTCGGTGCGT TTCGGGCCGG ACTTCGCTAC
ACCCGCTCAA GCACCACGCT CCATGTGGTT CTGCTGCGCG CCGCAATCTT TTTCGCCTTC
GCCAGTGCTG TTTGGGCTCT TCTTCCCCTC GTTGCCCGGC AACTGCTCGA CGGTGGCGCC
AGCTTCTACG GTATCCTGCT TGGTGCCGTC GGCACAGGCG CGATCGGCGG TGCCTTGGTC
ATGCCCAAGC TGCGCCAACG CCTGAGTTCT GATGGTTTGC TTCTCGGCGC AGCACTCGTC
ACTGCAGTCG TCATGGGTGT CCTGTCGCTT GCCCCGCCGA AGATTGTCGC CATCATTGTT
CTTCTTTTCC TCGGTGGCGC ATGGATCACC GCGCTCACAA CGCTCAACGG CGCAGCGCAG
GCAGTGCTTC CCAACTGGGT GCGCGGTCGT GGCCTTGCCG TCTATCTGAC TGTCTTCAAC
GGTGCGATGA CAGCCGGAAG CCTAGGCTGG GGTGCGGTCG GCGAGGCTGT CGGCATCCAG
GCTACCTTGC TTATCGGAGC CGTCGGACTG CTCGTTGCCG GTTTCATCAT GCACCGCCTG
AAGCTTCCGA CCGGTGATGC CGACATGGTG CCCTCAAACC ATTGGCCCGA GCCGCTGGTG
GCTGAACCTG TTGCCCACGA TCGAGGCCCG GTTCTGATCT TGATCGAATA CAAGGTCGAA
AAGGAGCACC GCAGCGCATT CCTGCACGCC ATCGATCATC TCTCCAAGGA GCGTCGCCGC
GATGGTGCCT ATGGATGGGG TATCACGGAG GATTCGGCCG ACCCAGAAAA GATCGTCGAA
TGGTTCATGG TGGAATCCTG GGCCGAACAT CTTCGCCAGC ATAAGAGGGT TTCCAACGCT
GACGCCGACC TGCAAAGCAA AGTGCTCGGC TACCATATCG GTCCCGACAA ACCAGTTGTC
CGTCACTTCC TGACGATTAA TCGGCCTGAT GCCGCATAA
 
Protein sequence
MSAAKTSGGT FAPLAQPVFA VLWIATVLGN TGSFMRDVAS SWLMTDLSAS PAAVAMVQAA 
GTLPIFLLAI PAGVLTDILD RRKFLIAVQL LLASVSISLM VLSQTGMLSV SALIGLTFLG
GIGAALMGPT WQAIVPELVK REDVKSAVAL NSLGINIARS IGPAAGGLLL AAFGAGITYG
ADVASYFAVI AALVWWPRAK NADDVLQENF FGAFRAGLRY TRSSTTLHVV LLRAAIFFAF
ASAVWALLPL VARQLLDGGA SFYGILLGAV GTGAIGGALV MPKLRQRLSS DGLLLGAALV
TAVVMGVLSL APPKIVAIIV LLFLGGAWIT ALTTLNGAAQ AVLPNWVRGR GLAVYLTVFN
GAMTAGSLGW GAVGEAVGIQ ATLLIGAVGL LVAGFIMHRL KLPTGDADMV PSNHWPEPLV
AEPVAHDRGP VLILIEYKVE KEHRSAFLHA IDHLSKERRR DGAYGWGITE DSADPEKIVE
WFMVESWAEH LRQHKRVSNA DADLQSKVLG YHIGPDKPVV RHFLTINRPD AA