Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0534 |
Symbol | |
ID | 6979250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 547623 |
End bp | 549557 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643395246 |
Product | hypothetical protein |
Protein accession | YP_002280057 |
Protein GI | 209548140 |
COG category | [S] Function unknown |
COG ID | [COG4907] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.24043 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACGTC GGTTTTTCGG ATTTTGTTTG GCTCTGCTCT TGATGCTGGC CGCCTCGGCG GCCTTCGCCG CCGAGGTCAT CGACAATTTT GCCTCCGCCA TCCAGCTGGA AAAGAGCGGC GCGATGACGG TGGCGGAGAC GATCACCGTC AATGCCGAGG GCAACCGGAT CAATCACGGC ATTTTCCGTG ACTTCCCGCT CTATTTCACC GATGCCGAAG GCCGTCGCCG CAGCGTCGAT TTCGACATGG TGTCGGTCAC CCGCGACGGC GAGGACGAGC CATGGCACAC GGAATCGATT TCCGGCGGCA TCCGCATCTA TGCCGGTTCC GCCGATGTGT CGGTGACGCC GGGCCGTCAC CGCTACGTCT TCACCTACAA GACCAACCGG CAGATCCGCT ATTTCGACGA TCATGACGAA CTCTACTGGA ACGTCACCGG CAATGGCTGG ATCTTCCCGA TCCGCTCGGC GACGGCAACG GTGACGCTGC CACCGGGTGT CACCGCCACC GAGACGACGT TTTTCACCGG CCCGCAGGGT GCGACGGGAA AGAACGCCCG CGTCAGCGAG ACAGGGGCCG GCCTTGTCTT CGCCACCACT GCGCCGCTCG ATGCCAACGA AGGGCTGACT TTCGCAGTCC GCATGCCGAA GGGGTCGATC GATCCGCCGA GCGCGGATAT GGAAAGCACA TGGTGGCTCA AGGACAACCG CAATTATTTC ATCGGCTTCG GCGGCTTGAT CCTGGTTTTC GCTTATTATC TCAGATCATG GCTGAAGGTC GGCCGCGATC CCGCCCGCGG CGTCGTCGTG CCGCGCTGGG ACGCGCCTGA CGGCATCTCG CCGGCGCTGG TCAACTACAT CGACAATAAG GGATTTTCCG GCGGGGGTTG GACGGCGCTG GCTTCAACCG CGCTCAATCT TGCGGTGCGC GGTTACGTCA AGCTCGAAGA CCTGAAGAAT TCGATCGTCA TCCGCGGCAC CGGCAAACCG CTCGGCAAGG AGAAATTCCA GGCCGGCGAG GCCGAATTGC TGAGGGTCGC CGGCGGCGCC GGCGCGACGC TGACGATCGA CAAGGCCAAT GGCGAGCGGG TGAAGTCCGT CGGCCAGAGC TTCCGGTCGG CGATCGAGAA AGAGCATCGC GGCAAATATT ACAATTCCAA CATCGGCTAC ACCACCGGCG GCATCGCGCT CAGCGCCGCC GCCTTGGTGG CGCTGTTCGT TTTCGGCTCG CTGCAGCCGG ACACGATCGC GCTGATGCTG GTCCCGACGG CGATCTCGGT TTTCGTTGCC GTCTTCGTCG CCGGTTTCAT GAGGTCGATG CATCGCGGCA ACTCGCTGTT CGGCAAGATC GTCGCCATTA TCGCCACGGC GGTCGGCGTT TTCGTCGGCA TCAGCATTCT TGCCGTCGTC GTCCTGGCGC TTGCTTCGTC GCTGGTGCAG CTGCACGAAA CGCCGATGCT CTTTGCCGTC GGCGGCATCG TGCTGCTCAA CATCCTCTAT TTCTTCATCA TGGGTGCGCC GACCCCGCTT GGCGCCCGGA TGATGGACGG CATAGACGGC CTGCGCCAAT ATCTGACGCT GGCCGAAAAA GACCGGATGA ACACGGCGGG CGCGCCTGAA ATGTCGCCGC AGCATTTCGA GACGCTGCTG CCCTACGCCG TGGCGCTCGG TGTCGAGAAA CCCTGGTCGC GCGCCTTCGA GACCTGGCTC GCAGCAGCCG CGGCCGGCGC GGCCGCCGCC TATGCGCCCA CCTGGTATTC CGGCAATTTC AACAGCGGCA GTTTCTCCGA CCGTGTCGGC GGCTTTTCCT CGTCGATGGC TTCGACTATC GCCTCGACCA TACCCTCGCC GCCGCCGTCG AGCTCGTCCT CCGGTTTTTC CGGCGGTGGT TCGTCCGGCG GCGGTGGCGG AGGCGGGGGA GGCGGGGGCT GGTAA
|
Protein sequence | MGRRFFGFCL ALLLMLAASA AFAAEVIDNF ASAIQLEKSG AMTVAETITV NAEGNRINHG IFRDFPLYFT DAEGRRRSVD FDMVSVTRDG EDEPWHTESI SGGIRIYAGS ADVSVTPGRH RYVFTYKTNR QIRYFDDHDE LYWNVTGNGW IFPIRSATAT VTLPPGVTAT ETTFFTGPQG ATGKNARVSE TGAGLVFATT APLDANEGLT FAVRMPKGSI DPPSADMEST WWLKDNRNYF IGFGGLILVF AYYLRSWLKV GRDPARGVVV PRWDAPDGIS PALVNYIDNK GFSGGGWTAL ASTALNLAVR GYVKLEDLKN SIVIRGTGKP LGKEKFQAGE AELLRVAGGA GATLTIDKAN GERVKSVGQS FRSAIEKEHR GKYYNSNIGY TTGGIALSAA ALVALFVFGS LQPDTIALML VPTAISVFVA VFVAGFMRSM HRGNSLFGKI VAIIATAVGV FVGISILAVV VLALASSLVQ LHETPMLFAV GGIVLLNILY FFIMGAPTPL GARMMDGIDG LRQYLTLAEK DRMNTAGAPE MSPQHFETLL PYAVALGVEK PWSRAFETWL AAAAAGAAAA YAPTWYSGNF NSGSFSDRVG GFSSSMASTI ASTIPSPPPS SSSSGFSGGG SSGGGGGGGG GGGW
|
| |