Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6859 |
Symbol | |
ID | 8022442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 310346 |
End bp | 312334 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644833725 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_002984859 |
Protein GI | 241666775 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.810171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCGA GCGACGCCAA GATCACGGAG GCGGAATTGG AAAATGACCT CGGTCCCCGC CGCGTCTATT CGACCTCGCC ATCCGCCCCT TCACGCATGC CTGCCTTCGC GAGTGTACTC GCTGTTATTG CCATTCTGTA TTTCGGCAAG GAAGTACTTC TTCCTCTAGC AATCGCAGTC CTGTTGACGT TTGCGTTGGC TCCCATCTCC TCTCGTCTTC GAAAACTTGG GATGCCGCGT ATTCCGGCGG TAATCGTCAC CGTCGTGATC GCTTTTCTTG TTCTCGTCCT GTTCGGGCTT GTCGTAGCGG GACACGTAGC CGAAGTCGCC CAGAACCTTC CGGCCTACCA AGGCAACATC ATAGCAAAAA TTCGGTCTCT CCAGGAAAGT GGAACGGATA GCGGTATTGT GCGGCGCCTG ACATCCGTCG TTGAGAGCGT CGGTCGTGAG CTCAGCAACG CTGAGCAGCG CCCAGGTGCT CCAGGTACCG GATCAAGACC AAGGGAGCCC GTGCTCGTTG AGATATTCGC GCCTAGTAGA CCAATTGAAA CGCTTACTAG CCTGATTGGT CCTCTGCTTG GCCCCATCGC TTCATTGGGC TTGATCATCG TCGTCGTAAT ATTCATGCTT TTAGAGCGGG AAGAGCTTCG CGATCGCTTT ATCCGGCTCG TTGGCTATGG CGATCTGCAT CGCACAACAG AAGCCATCCA GGAGGCAGGT AGCCGCGTCG CGCAGTATCT TCTTATGCAG CTGGTGGTCA ACTGCGCCTA CGGTGTCCCA TTGGCGCTCG GACTATGGGC GGTTGGCATT CCAAATCCAG CGCTCTGGGG GATGCTAGCC ATCGTCCTTC GATTTGTCCC TTATATCGGG CCTGTAATCG CGACGGTTCT GCCGCTGTTC CTGGCCTTTG CAGTTGACCC TGGCTGGAGC CTTGTTCTCT GGGTTGGGGC CATCTTCCTT GTGCTGGAAT TGACCAGCAA CAACGTCATC GAGCCCTGGC TCTATGGTTC TCGTACCGGC CTCTCTCCGC TGGCAATCAT CGTCGCGGCG ATTTTTTGGG CGTGGCTGTG GGGACCAGTC GGTCTTGTGC TGTCCACGCC GCTGACCGTG TGTCTCGCAG TGTTGGGCCG GTATGTCCCG CAGTTCGAGT TCCTTGAGGT CGTTTTTGGT AGTGACCCTG TGCTCGATCC CAAGGAGCGG CTATACCAGC GGCTTCTTGC CGGCGATCCC GATGAGGCGA CCGATTACGC TGAGGAATTC CTCGAGGAGG ACTATCTGGA GGATTACTAT GGCAAGGTTG CCATTCCCGC CCTTCTACTC GCGGAGAAGG ATAGGCGTCG AGGCGTCTTG ACCGCGGAGC AGATGGAACA GGTGTTCGGG ACCGCCATCA CGCTGGTCTC GAATCTCGCG GAAATCGCGC AGGAAGAAGA GCAGGAAGAG GAAGAGGAAG AGGAACAGAA GGAAGCGGCA GGTCGGCCCA GCCCCCCCAA GGAAGGGAAT GGCGATGAAA GCGAACTACC TGACGGACGG GGCAAGACCG TCTTTTGCGT CGGAGGCAGG GGCCCGCTCG ACGACGCGTC GGCTGCGATG CTTGCCCAGA TACTTCAGGT ACAGGGAGCC GAGGTGGTCG CAGCGAGGCA TTCCGACATC CCCAATCGCC GCGCCATGAG CCTCGTTCCG AAACAATCGA ACGCCATCGT AGTTTGTTTC CTCAATGAGG ACTCGACAAG GCACGCCACC ATACTTGTTC GTCGGTTCAA GCGCATATAC CCAGCCATTC GCGTCGGCGC GGTCCTTTGG GCGGAAAACC AGAAGGAAAG GCAACCGCCC GCGCTTGGGG AGGCAGATTT CGTTGCCACA ACCTTGACCT CGGCTGCCCG CGAGGCACTC GCCGATGCAC CACCATCATT GGTGACGACC GCGCGCAAGA TTCGTACCCG GCGGTCCTCG AACAAGACAG GCATAGCCGC CGCGCGCTCC GGGATTTAG
|
Protein sequence | MSSSDAKITE AELENDLGPR RVYSTSPSAP SRMPAFASVL AVIAILYFGK EVLLPLAIAV LLTFALAPIS SRLRKLGMPR IPAVIVTVVI AFLVLVLFGL VVAGHVAEVA QNLPAYQGNI IAKIRSLQES GTDSGIVRRL TSVVESVGRE LSNAEQRPGA PGTGSRPREP VLVEIFAPSR PIETLTSLIG PLLGPIASLG LIIVVVIFML LEREELRDRF IRLVGYGDLH RTTEAIQEAG SRVAQYLLMQ LVVNCAYGVP LALGLWAVGI PNPALWGMLA IVLRFVPYIG PVIATVLPLF LAFAVDPGWS LVLWVGAIFL VLELTSNNVI EPWLYGSRTG LSPLAIIVAA IFWAWLWGPV GLVLSTPLTV CLAVLGRYVP QFEFLEVVFG SDPVLDPKER LYQRLLAGDP DEATDYAEEF LEEDYLEDYY GKVAIPALLL AEKDRRRGVL TAEQMEQVFG TAITLVSNLA EIAQEEEQEE EEEEEQKEAA GRPSPPKEGN GDESELPDGR GKTVFCVGGR GPLDDASAAM LAQILQVQGA EVVAARHSDI PNRRAMSLVP KQSNAIVVCF LNEDSTRHAT ILVRRFKRIY PAIRVGAVLW AENQKERQPP ALGEADFVAT TLTSAAREAL ADAPPSLVTT ARKIRTRRSS NKTGIAAARS GI
|
| |