Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0100 |
Symbol | |
ID | 8011341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 96024 |
End bp | 97955 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644822691 |
Product | hypothetical protein |
Protein accession | YP_002973950 |
Protein GI | 241202854 |
COG category | [S] Function unknown |
COG ID | [COG5616] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTTG TTCTAAACAC CTTTGGCAGG TTGCAGCTCG TTGACGGGGA GGGGAGCCTC GTCGCCTTTC CCGAGAAAGG TTTGCTGCTT CTCGTCTATT TGTTGACGAC CGGTGAAGGT TCGGCGGATC GAACGACCTT GGCGCGTTTT CTGTGGGGCG ATGCCGATAG GGACGTTGCG CTTTCGACGT TGCGTAAGCT GATTTCGAGA GTGAAGGCCC GTCAAGCCGA ACTCGGAATA AACATTCTCT CATCCCAGGG CAACATGGTC TCTCTCGACC GAAAGTCCTT GTCTTCCGAC CTCCTGCTAT CCGAGACCGA CGAAGCGGTC GCGTCGTTCT CTCTGCTTAA ACATCTTGTG AAGCTGCTGA ACCAACCCTT TCTGGGGCCG GTTCACTGCC ACAGCCGCGA GTTTCAACAG TGGCTTGCCG AGCGCGAAAA ATGCCATATC GACCTTCTCG CAAATACATT GAAAACGGTA TCGCGACGAG CGCAGTCGAG AGCGGAATCA GAACTCCTGC GAAAGGCCGC CATTATCCTG TTTCGGACGG AACCGAAGGA TCCGGATACG CTGCAGTTGC TGATAGAGAT ATTCAAGGCG GAGGAAGAGG TTGAATCGCT TCGGACCTAT TTCGAACAGC GGCGTAATTC GATTTCGCGA GGGATCGCGG TACGCGGCGC ATCCGACGGC GCTGACACAA AACCCGTTCG CCCGGCGTTA GTGCCGTCAA GGGAAAAACA CGTAACCGCT GCTTCCCTTG AGCCCGAGGA CGTCAGCATT GCAATTCCTC GCCTGGTGCT GCTTCCACCC AGAAATCAAT CCATTCATCC CCAAGCCGGT TTTTTAGCTG CGTCTTTGGT GGAGGATATC ACGATTGGAT TTTGCGCCTT CAACAGCCTG CAGGTCATAG CCCCATATTC GGCGGTGCAA ATTGGCCACC ACATGGAGAC CCAGAAGGCC TTCTTTGAAC GACATCACGT CAATTACATT CTCGACACTC GGATCAGCAA TGCGGGCGAT GACGTCACCC TGTTCGCCCA ACTGATCTTT TTTGACCAGA ATCAAATTGT CTGGGCAGAG AGGTTCAGCC TTGATCATCG GGATCTTGTC AAAGACAGGA GGACTGTCTC TCGACGGATT GCTCTTTCCA TATCCAGCGA AATCGAGCGC CATGAGGCGT TGCGCGAGGA TTTGAACCCG GCTGCCTACC ATCGATATCT CGTTGGCAGG CGGCATCTGG CGCGGCTGAC ACTTCCGAAT CTACGGCGTG CGCGCAAGGA GATGAAAGCC GCGCTCAGCC TCAGCCCCGA TTTCGCACCG GCGCTGAGTT CAATGGCGCG GACTTACTCC AAGGAATGGT TGTTGACCGC GCGGGGTGAT ATCGATCTGT TGAAAACGGC AGAGATCTTG GCAAAGCAGG CCACCGAAAC GCGTCCAGAT TTTGCCGATG GATATCGCGA GTTCGGCGTG GCGAAATTGT TGCAGGGTGC ATTTGACGAA AGCGCCGAGG CAATGGAAGT GGCGGAGAGC CTTGCCCCGC ACTATGCGGA TGTTATTGCC GACTACGCCG ACACTTTGGT TCATTGTTCG CTCCCTGCCA TCGCCTTGCG AAAGATCGAG CGGGCAATCG AGCTGAACCC GCTCAGCCCC GACACCTATT TCTGGACCGC TGCCGGCGCA AATTATGCCC TTGGCGAATT CGAAGCTTCG CTGGATTACA TTGGGCAGAT GGCCGATGCC AGTTTGGCCG ACAGGCTAGC GGCCGCAAGC TGGGCCATGT TGGGCCACCA GGACAAGGCG CGGATCTTCG TCAGGAGGTT TCGCGAAGTC AATCCGGACT TCGACGTGGA CAAATGGCTG TCTGCGGTTC CGAGTAAGGA GCAATGGCAT AAGGATCTTT ACCGAGAAGG CCTGAAGAAA GCTGGATTTT AA
|
Protein sequence | MAFVLNTFGR LQLVDGEGSL VAFPEKGLLL LVYLLTTGEG SADRTTLARF LWGDADRDVA LSTLRKLISR VKARQAELGI NILSSQGNMV SLDRKSLSSD LLLSETDEAV ASFSLLKHLV KLLNQPFLGP VHCHSREFQQ WLAEREKCHI DLLANTLKTV SRRAQSRAES ELLRKAAIIL FRTEPKDPDT LQLLIEIFKA EEEVESLRTY FEQRRNSISR GIAVRGASDG ADTKPVRPAL VPSREKHVTA ASLEPEDVSI AIPRLVLLPP RNQSIHPQAG FLAASLVEDI TIGFCAFNSL QVIAPYSAVQ IGHHMETQKA FFERHHVNYI LDTRISNAGD DVTLFAQLIF FDQNQIVWAE RFSLDHRDLV KDRRTVSRRI ALSISSEIER HEALREDLNP AAYHRYLVGR RHLARLTLPN LRRARKEMKA ALSLSPDFAP ALSSMARTYS KEWLLTARGD IDLLKTAEIL AKQATETRPD FADGYREFGV AKLLQGAFDE SAEAMEVAES LAPHYADVIA DYADTLVHCS LPAIALRKIE RAIELNPLSP DTYFWTAAGA NYALGEFEAS LDYIGQMADA SLADRLAAAS WAMLGHQDKA RIFVRRFREV NPDFDVDKWL SAVPSKEQWH KDLYREGLKK AGF
|
| |