Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4071 |
Symbol | |
ID | 6982842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4247299 |
End bp | 4248930 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643398801 |
Product | protein of unknown function DUF894 DitE |
Protein accession | YP_002283559 |
Protein GI | 209551642 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCTACA CTTCGTCGAC ATTGGCGCCT TTGCGGCACG ATACCTACCG CACCATCTGG TTCGCCAGCC TGTCGTCGAA TTTCGGCGGC CTGATCCAGG CGGTCGGTGC CGCCTGGATG ATGACGACGA TCACCGCCTC GGAGGACATG GTCGCGCTGG TGCAGACCTC GACGGCGCTG CCGATCATGC TGTTTTCGCT GATTTCAGGG GCGCTCGCCG ACAATTACGA TCGCCGCCGG GTGATGCTGA CTGCGCAGTG TATGATGCTG ACAGTCTCGG CGCTTTTGAC GGCCACGGCC CTTCTCGGCT GGATCACACC CTGGCTGCTG CTCTTCTTCA CCTTCCTGAT CGGCTGCGGC ACCGCGCTCA ACAACCCTTC CTGGCAGGCC TCGGTCGGCG ACATGGTGCC GCGCGCCGAT CTGCCGGGCG CCGTCACGCT GAACAGCATG GGTTTCAACA TTACCCGCAG CGTCGGCCCG GCCATCGGCG GTGTCATCGT CGCCGCCGCC GGCGCGGCGG CGGCCTTCGC GGTGAACACC GTGAGCTACC TCGCTTTGAT CTATGCCCTG CTGCGCTGGC GCCCAAGCAC GCCGGTCTCG ACCCTGCCGC GCGAGGCGCT CGGCAGCGCC ATCTTCGCCG GCCTGCGTTA TGTCTCGATG TCGCCCAATC TCGAAAAGGT TCTCCTCCGG GGGCTGCTCT TCGGCATCGG CGCCAGCTCG ATCCTGGCGC TGCTGCCGGT CGTGGCACTC GATCTCGTCG GCGGCGGCCC GCTGACCTAT GGTTTCATGC TCGGCGCCTT CGGCATCGGC GCGATCGGCG GCGCGGTGTT GAATGCGCGG CTGCGCCAGA TGCTGTCGAG CGAGATGATC ATCCGTCTGG CCTTTACAGG CTTCGCGCTG AGCGCCGTCA TCGCTGCCTT CAGCCCAAGC GCAGTGCTGA CCTCGGCCGG GTTGCTCATC TCCGGCGCCT GCTGGGTCTC CGCACTGTCG CTCTTCAACA CCATCGTCCA GCTGTCGACG CCGCGCTGGG TGGTGGGACG GGCGCTGTCG CTCTACCAGA CCGTCACCTT CGGCGGCATC GCCGGCGGCA GCTGGCTCTG GGGTGTGGCC GCCGATCGCT ACGGTGTCGC CGACGCGCTG CTGATGTCAT CGGTCGTCAT GCTGCTCGGC ATCGTGATCG GCCTGCGCTT TTCCATGCCG GCCTTTGCCT CGCTCAATCT CGATCCGCTG AACCGCTTCA CCGAGCCGGC TCTCAGCCTC GACATCACCC CCCGCAGCGG CCCGATCGTC ATCCAGGTCG ATTATGAGAT CGGAGATGAC GACCTTGCCG AATTCATGCA GCTGATGGGC GAACGCCGCC GTATCCGCAT CCGCGACGGC GCCCGCAACT GGGCTTTGAT GCGCGATCTC GAAAATCCCG GGCTCTGGAC GGAAACCTAC CATACGCCGA CCTGGGTCGA ATATATAAGA CACAACCAGC GGCGCACGCA GGCCGATGCC GAAAACACCG ACAGGCTTCG TGCGCTTCAT CGCGGCGAAG GTCCGCTGCA TGTCCACCGC ATGATCGAAC GCCAGGCCAT TCCATCCGGC GACGACGTCT TCCATAAAGC GCCGATCGAT CTGCATCATT GA
|
Protein sequence | MAYTSSTLAP LRHDTYRTIW FASLSSNFGG LIQAVGAAWM MTTITASEDM VALVQTSTAL PIMLFSLISG ALADNYDRRR VMLTAQCMML TVSALLTATA LLGWITPWLL LFFTFLIGCG TALNNPSWQA SVGDMVPRAD LPGAVTLNSM GFNITRSVGP AIGGVIVAAA GAAAAFAVNT VSYLALIYAL LRWRPSTPVS TLPREALGSA IFAGLRYVSM SPNLEKVLLR GLLFGIGASS ILALLPVVAL DLVGGGPLTY GFMLGAFGIG AIGGAVLNAR LRQMLSSEMI IRLAFTGFAL SAVIAAFSPS AVLTSAGLLI SGACWVSALS LFNTIVQLST PRWVVGRALS LYQTVTFGGI AGGSWLWGVA ADRYGVADAL LMSSVVMLLG IVIGLRFSMP AFASLNLDPL NRFTEPALSL DITPRSGPIV IQVDYEIGDD DLAEFMQLMG ERRRIRIRDG ARNWALMRDL ENPGLWTETY HTPTWVEYIR HNQRRTQADA ENTDRLRALH RGEGPLHVHR MIERQAIPSG DDVFHKAPID LHH
|
| |