Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2209 |
Symbol | |
ID | 8013218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2211562 |
End bp | 2212662 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644824795 |
Product | hypothetical protein |
Protein accession | YP_002976025 |
Protein GI | 241204929 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0305153 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.169215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACGCC GAAACCTGTC TCTTGCCCTT CTCCTTGCCC TTGCGGCGCC GGCCGCCGCG CAAAGCAGCG CCGTCTGTGA GGACCTGCGC GGTCGCCTTG CCGACTTGCC GCGATCGATC GGCAATGGCA ACGGCCCGGA GGCGCGCCAA TATTCCAGTG CCATGGCCGA ACAGAACCTC GAGCTGCGCA AGGTTCGCAA CGAGCTGCGC AGCAATGACT GCACCTCGGG CAGCATGGTC GTGATCGGCG GCGAGAATGC CGATTATTGC GCCGAACTCT CGCAGTCCGA AGCCCGCATG ATCGACAATA TCCGCTATCT CCAGGACCGC CGCAACGAAC TGGCCGGCCA GAACGGTGCC GATGACGGCG CGCGCCGCGA ACTGGTCGCG GCTCTCGACC GCAATGGCTG CAACAGCGAA AATTTCTATG CTCCGAGCGA TCGCAGCGCC AACGAACCGG CTCCGAGCGT CGAGGAACAG GCGATGCGCA CTGATACCTT CATTCCGCTC GGCGGCGGTG AAGAGGTCGA TCCGCGCTAC GACCTGCCGC GGGCCGAGAT GCTCTCGCCG GTCAGCACCA TGTGTGTGCG CAGCTGCGAC GGTGGCTTCT TCCCGATCAG CTCGAACGCC ACCTCGGTCG ATTTCGGCCG CGACGCCCAG ACCTGCGCCA AGATGTGCCC GGGCATCGAG ACCGAACTGT TCTATCGTGA CGTGACGAGC ACCGAAGCCT CGAACATGAT CTCGGTCGCA ACGGGCACGC CCTACAGCGC CATGAAAAAT GCCTTTGCCT ACAAGAACCG CACGCCCGGG GAAAAGTCCG CTTGCACCTG CAATCTCACC GCCTATTACG AGGAGATGCG CGGCAAGCAG ACGTTGAGCG AACCGCCGCA GCAGGGCTCG ATCACCACCA TCCGCACCAA TCCCCCGGCG AAGGATGCGG CAGCGCAGAT CGCACCGCAG CCGTCCGTTC CCGAGCGCCC CTACGATCCG GCGCAGAACC GCGTCCGTCA GGTCGGCCCG CAGTTCCTGG CCGGCGATCA GGGCTCGATC GATCTCGCCA ACCCGGCAAC GTCAGGTCCG CAGCCGCAAC AGCAGCAGTG A
|
Protein sequence | MKRRNLSLAL LLALAAPAAA QSSAVCEDLR GRLADLPRSI GNGNGPEARQ YSSAMAEQNL ELRKVRNELR SNDCTSGSMV VIGGENADYC AELSQSEARM IDNIRYLQDR RNELAGQNGA DDGARRELVA ALDRNGCNSE NFYAPSDRSA NEPAPSVEEQ AMRTDTFIPL GGGEEVDPRY DLPRAEMLSP VSTMCVRSCD GGFFPISSNA TSVDFGRDAQ TCAKMCPGIE TELFYRDVTS TEASNMISVA TGTPYSAMKN AFAYKNRTPG EKSACTCNLT AYYEEMRGKQ TLSEPPQQGS ITTIRTNPPA KDAAAQIAPQ PSVPERPYDP AQNRVRQVGP QFLAGDQGSI DLANPATSGP QPQQQQ
|
| |