Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6432 |
Symbol | |
ID | 6983503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | + |
Start bp | 93542 |
End bp | 95746 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643399429 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002284185 |
Protein GI | 209552270 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.402351 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATTT CCACTCTTCC GCCGGACCGC AATCTTCCGA TCTCACCGAT GCGCTATGAT CCTCGCTTTC AGGTTGAGAC CAAGGCGGAC AACTTCGATC TGGCATCCGT CTTCGGTCTC ATCAGGCGAA GAATGGTCGT GATCATCACG ATAACCGTGC TGCTGATGGT GCCCGCGGTG ACGACGATCT CCGGGTTGAA GCCCACATAT CAGGCATGGG CCCGACTGAT TGTCCATCAG CCTCTCGCGA CATCACTTGG TGGGGACGAT ACCAGCCGCG GTGATGCACT TGACGTGAAA TCGGAGACTG AGCGGCTCTT ATCCAGAAGC ATCGCAGAGC GAGTCGTTCG CGAAATGCGG CTCAACATGC AGCCGGAATT CAATCCAGCG CTACGCAAGG TCTCGTTCAT TCAAAGGATC CGGGCAAAGC TGCGCAGCCT ATTCGACACG AAAAAACCCG CTTTGGCGGT AGGAGACGGC ATCGACGCCA CAATCCCGGA ATATTATCAG GCGCTCGGTG TGTGGCACGA AGACAAGAGT GAAGTGATCC AGATCAGCTT CACTGCGAGT GACGCTGAAC TCGCCGCAGC GGTGCCGAAT CGGCTCGTCA GCATTTACCT GGATGAGCGG AAGGGTAGTG TCAGTGGCCG CCTGGTTTTA GCGGAAGAAT GGATCCGACA GCGAATCACC GAACAGCAGG TTCGCGCGGA TATTGCTCGC GACGCTGCCC GCAACTACCA AGAATCGATG GACATCGTCT CGAAAGAGGA CGCTCAGGAT GAACGTATCA AAACACTCCT GGAACTCACC GGCCGCGAAG GAAAAATCGA GGAAAGCCGG GTTGAGGTAA AGGCGACGAT ATCCGCGCTT GAGGCAACGG ATAACGCTTC GGTCGCGGTG CAGAACACCA TTGTCCCGGA CAGCATTACC TCAAAGCAAC GCGATCTTCA CGCACAAGAG CAAGATCTCG CCCGTCTTCT TGAGACATAT GACGGCAATG CCGAGGCCGT AGTGGATTTG CGTAACAAGA TCGAACAATC CCGTATCGAT CTCGACCTTG CGACCAAACA ATATCTCCAG GCAATGCGTG CCAAGCTCGC CGCACTTGAA CATGAAGCCA ACGCCGTGCA GTCCATCCTG GCTGTCGCTC AGGAAAAACG CACACGCGAT GCCCTGGCGC AGAATGAATT GACGCGACTG CAGCGTATTG CCGAAAAGGA GCAAGCGGCG CTCGACAAGC TTGAGGACCA GCGCCGTGAC CTTGCCGCCA AGGCCATGCT GCCTGGGGCG GAATTGGAGG TTTTCTCACC GGCTTCAGTG CCGCTCGTCT CACAGGGACG TGGACGGCTT TCCTATCTGA TAGGCGCCCT CCTGGCTGCA ATCTCGGCAG CCGTGACGGC TGCTTTCGTG GTCGAAATGT GGGACCGGTC GGTCCGCAGT TTCGACCAGC TTGCCGGAAT TGCACGCACC ATACCGGCTG GACTTATCCC GCGTCTGGCG CGAAGTGAAA ATTCGAAGAC GATGTTCGCA CATATGCCGG TCCCGATGTT TGACGAAGCA ATCCGTACCG TGGTGCTTTC ACTCAAGCAG TCCAATGGCG GCAAATTGCC GAACAGTATT GTCGTTACCT CGGCTCATAG CGGAGATGGC AAATCGCTTG TCGCGCGATC GCTGGCCATG GAACTTGCCG CTGCCGCGAT TCCGGTCCTG CTGGTCGATG GCGACCTTCG ACGGGGAAAA CTCGATGCGT TTTTCAGGTC GGGGGTCATG TGCGGACTGA ACGAATTCCT GAACGGTCAG GCCGATTTCC GCGACATCGT CTACCACCAT CCAAGCGGCA TCGATTTCAT TCCATCTGGA AAATTCAGTC TCCAGCGGCG GGTCCGTCCG AACGGCCTGG CAGAAATCAT CGAGATGGCG GTCGCGGCCG GCCAGATTGT CATCTTCGAC AGCGCGCCTG TACTCGCCTC GGCAGATACC GTGCATTTGA CCTCCCTGGC GGAAAGAACA CTGGCAATTG TTAGATGGGG AAAGACGAGC CGACGCGCGG TCGAGTTCAG CCTGCAGCAA ATGAAAAGCT CGCGAAATTC GGAAATCATC GTCGCGATCA ACAACGTAAA CCCCGAAAAG CACGCATTGT ATAACTTCAG CGACTCGGAA TTATTTTCGA AATCTCTGAT GAAATACTAT AAATTCAAAG CATGA
|
Protein sequence | MTISTLPPDR NLPISPMRYD PRFQVETKAD NFDLASVFGL IRRRMVVIIT ITVLLMVPAV TTISGLKPTY QAWARLIVHQ PLATSLGGDD TSRGDALDVK SETERLLSRS IAERVVREMR LNMQPEFNPA LRKVSFIQRI RAKLRSLFDT KKPALAVGDG IDATIPEYYQ ALGVWHEDKS EVIQISFTAS DAELAAAVPN RLVSIYLDER KGSVSGRLVL AEEWIRQRIT EQQVRADIAR DAARNYQESM DIVSKEDAQD ERIKTLLELT GREGKIEESR VEVKATISAL EATDNASVAV QNTIVPDSIT SKQRDLHAQE QDLARLLETY DGNAEAVVDL RNKIEQSRID LDLATKQYLQ AMRAKLAALE HEANAVQSIL AVAQEKRTRD ALAQNELTRL QRIAEKEQAA LDKLEDQRRD LAAKAMLPGA ELEVFSPASV PLVSQGRGRL SYLIGALLAA ISAAVTAAFV VEMWDRSVRS FDQLAGIART IPAGLIPRLA RSENSKTMFA HMPVPMFDEA IRTVVLSLKQ SNGGKLPNSI VVTSAHSGDG KSLVARSLAM ELAAAAIPVL LVDGDLRRGK LDAFFRSGVM CGLNEFLNGQ ADFRDIVYHH PSGIDFIPSG KFSLQRRVRP NGLAEIIEMA VAAGQIVIFD SAPVLASADT VHLTSLAERT LAIVRWGKTS RRAVEFSLQQ MKSSRNSEII VAINNVNPEK HALYNFSDSE LFSKSLMKYY KFKA
|
| |