Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1670 |
Symbol | |
ID | 6980407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1700492 |
End bp | 1702090 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643396395 |
Product | protein of unknown function DUF894 DitE |
Protein accession | YP_002281185 |
Protein GI | 209549268 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0165754 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCCG CTAAAACGTC CGGGGGCACT TTCGCTCCTC TTGCGCAGCC CGTCTTTGCG GTTCTCTGGA TCGCTACGGT TCTCGGCAAC ACCGGCAGCT TCATGCGCGA CGTCGCCAGT TCCTGGCTGA TGACGGATCT TTCTGCATCG CCTGCCGCAG TTGCCATGGT TCAGGCGGCC GGAACCCTGC CGATTTTCCT GCTTGCCATT CCGGCAGGCG TTCTCACGGA CATTCTCGAT CGCCGCAAAT TCCTGATCGC CGTCCAGCTT TTGCTGGCAT CAGTCAGCAT TTCGCTGATG GTTCTGTCGC AGACGGGGAT GTTGTCGGTC AGCGCGCTGA TCGGTCTGAC CTTCCTTGGC GGCATCGGTG CCGCTCTGAT GGGACCGACC TGGCAGGCGA TCGTGCCGGA ACTGGTGAAA CGCGAGGATG TGAAGAGCGC GGTCGCTCTC AATTCGCTCG GCATCAATAT CGCCCGCTCT ATCGGGCCAG CTGCTGGTGG CCTGCTTCTA GCAGCCTTTG GAGCCGGGAT CACTTATGGG GCGGACGTTG CCAGCTACTT CGCCGTGATC GCGGCTCTGG TCTGGTGGCC AAGAGCAAAG AATGCCGACG ACGTGCTTCA GGAGAACTTC TTCGGTGCGT TTCGGGCCGG ACTTCGCTAC ACCCGCTCAA GCACCACGCT CCATGTGGTT CTGCTGCGCG CCGCAATCTT TTTCGCCTTC GCCAGTGCTG TTTGGGCTCT TCTTCCCCTC GTTGCCCGGC AACTGCTCGA CGGTGGCGCC AGCTTCTACG GTATCCTGCT TGGTGCCGTC GGCACAGGCG CGATCGGCGG TGCCTTGGTC ATGCCCAAGC TGCGCCAACG CCTGAGTTCT GATGGTTTGC TTCTCGGCGC AGCACTCGTC ACTGCAGTCG TCATGGGTGT CCTGTCGCTT GCCCCGCCGA AGATTGTCGC CATCATTGTT CTTCTTTTCC TCGGTGGCGC ATGGATCACC GCGCTCACAA CGCTCAACGG CGCAGCGCAG GCAGTGCTTC CCAACTGGGT GCGCGGTCGT GGCCTTGCCG TCTATCTGAC TGTCTTCAAC GGTGCGATGA CAGCCGGAAG CCTAGGCTGG GGTGCGGTCG GCGAGGCTGT CGGCATCCAG GCTACCTTGC TTATCGGAGC CGTCGGACTG CTCGTTGCCG GTTTCATCAT GCACCGCCTG AAGCTTCCGA CCGGTGATGC CGACATGGTG CCCTCAAACC ATTGGCCCGA GCCGCTGGTG GCTGAACCTG TTGCCCACGA TCGAGGCCCG GTTCTGATCT TGATCGAATA CAAGGTCGAA AAGGAGCACC GCAGCGCATT CCTGCACGCC ATCGATCATC TCTCCAAGGA GCGTCGCCGC GATGGTGCCT ATGGATGGGG TATCACGGAG GATTCGGCCG ACCCAGAAAA GATCGTCGAA TGGTTCATGG TGGAATCCTG GGCCGAACAT CTTCGCCAGC ATAAGAGGGT TTCCAACGCT GACGCCGACC TGCAAAGCAA AGTGCTCGGC TACCATATCG GTCCCGACAA ACCAGTTGTC CGTCACTTCC TGACGATTAA TCGGCCTGAT GCCGCATAA
|
Protein sequence | MSAAKTSGGT FAPLAQPVFA VLWIATVLGN TGSFMRDVAS SWLMTDLSAS PAAVAMVQAA GTLPIFLLAI PAGVLTDILD RRKFLIAVQL LLASVSISLM VLSQTGMLSV SALIGLTFLG GIGAALMGPT WQAIVPELVK REDVKSAVAL NSLGINIARS IGPAAGGLLL AAFGAGITYG ADVASYFAVI AALVWWPRAK NADDVLQENF FGAFRAGLRY TRSSTTLHVV LLRAAIFFAF ASAVWALLPL VARQLLDGGA SFYGILLGAV GTGAIGGALV MPKLRQRLSS DGLLLGAALV TAVVMGVLSL APPKIVAIIV LLFLGGAWIT ALTTLNGAAQ AVLPNWVRGR GLAVYLTVFN GAMTAGSLGW GAVGEAVGIQ ATLLIGAVGL LVAGFIMHRL KLPTGDADMV PSNHWPEPLV AEPVAHDRGP VLILIEYKVE KEHRSAFLHA IDHLSKERRR DGAYGWGITE DSADPEKIVE WFMVESWAEH LRQHKRVSNA DADLQSKVLG YHIGPDKPVV RHFLTINRPD AA
|
| |