Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1424 |
Symbol | |
ID | 6980152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1448807 |
End bp | 1449997 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643396145 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002280944 |
Protein GI | 209549027 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.935376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.465976 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATA CAGCTGTGCT AACGAAGGTC GATGGCGGCG AATTTACCGT TTTGTGCATC GGCGGGTTTC TTCTGTCGAT CGCCTATGGG GTGACCTTTC TGATCCCGGT GCTGGTGGGG CAACGCGGCG GCAACGAGGC GCTCGCCGGC CTGATCATTT CGGCCGCGAC CGTCAGCACC GTCATTCTGG TGATCCTGTC CGGCCACATC GCCGATGCCA TCGGCTCGGC GCGGGCGGTG GCGGTTTCAG CGCTCTTTCT GGCGGCATCC GGGCTGGGAT TTGCCACCGT CCCCGCGGCC GGGTTGAGCC TGATGACTGT CGGTTTCGTC CTCGGCATCG GCTGGGGCAC CTTCTATGCG CTCGGCCCCA TCCTGGTGGC GGCGATCACT GAACCCGAGC ACCGCATCCG GTTCTTCGCC CTGCTGTCGG GATCGATGAT GTCGGGCATC GGCGCCGGAC CGATCATCGG CCGCATTGCC ACCGGCTGGT CTTTGCCGAT CGAGACGGCC TTCGCGTTTG CATTCCTCTC CAGTCTCGCC GGCGGCGCGC TTTATTTCCT GCTGCATCTC AGGCTGACCA ATGCCGGCAA GATTTTGCCG CATGTCAACA AGATCTCCTT TGCCGCTGCG CGTCAGGTGA TCGGGTCGCG GGCGATCTAT TCCATCGCCA TGGTCGGCAT CGGCGGCGCG ATTTTCGGCG GGCTCTCCAG CTTTCAGACC AGCTACGCCA AGGCGCACGG TTTCGATTAT TCGCTGTTCT TCATCGGCTT CATGTCCGCC GCCATTCTGA GCCGGCTGTT CGTGGCGGGA TATGTGGTCA AGAAGGATCC ATTCTATTCG CTTCTGGTGC TGACGAGCCT GACGCTGGCA TCCATCCTGC TGTTTCTGGT GGTGACATCA AGTCAGCTGG CCTATCTCGG AGCCGCGGCA ATGCTGGGCC TGGGATATGG GCTAACCTAT TCCGTCATCA ACGGGCTGGC CGCAAATGAA GCGCCGGCAG GTCTCATGCC GCAATCGCTT CTCTTGTTCT CGCTGTCCTA TTTCATCGGT GTCTTCGGTT TTCCGCTGAT TGCCGGGAAC CTGATCGTTT CCTCCGGGAT TGAAGCCATG CTGCATGTCC TGCTACTGCT TGCCGTCGTC AATCTGGCAA TCGTGCTGTT TCGTATGGCC CGCCGCGCGA GGAATGGCTG A
|
Protein sequence | MSDTAVLTKV DGGEFTVLCI GGFLLSIAYG VTFLIPVLVG QRGGNEALAG LIISAATVST VILVILSGHI ADAIGSARAV AVSALFLAAS GLGFATVPAA GLSLMTVGFV LGIGWGTFYA LGPILVAAIT EPEHRIRFFA LLSGSMMSGI GAGPIIGRIA TGWSLPIETA FAFAFLSSLA GGALYFLLHL RLTNAGKILP HVNKISFAAA RQVIGSRAIY SIAMVGIGGA IFGGLSSFQT SYAKAHGFDY SLFFIGFMSA AILSRLFVAG YVVKKDPFYS LLVLTSLTLA SILLFLVVTS SQLAYLGAAA MLGLGYGLTY SVINGLAANE APAGLMPQSL LLFSLSYFIG VFGFPLIAGN LIVSSGIEAM LHVLLLLAVV NLAIVLFRMA RRARNG
|
| |