Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5241 |
Symbol | |
ID | 8007415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 652136 |
End bp | 653128 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644822149 |
Product | ectoine utilization protein EutC |
Protein accession | YP_002973409 |
Protein GI | 241113574 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2423] Predicted ornithine cyclodeaminase, mu-crystallin homolog |
TIGRFAM ID | [TIGR02992] ectoine utilization protein EutC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.326235 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGGA TGATCATTCT GACGGAAGCG GAACTGCGGA AAGTCATCGC GCTTGATCGC GATGCGGTTG ATTGCGTCGA GGCCGCTTTC GCAGCGCTTG CGACCAAGGC TGTCGCCATG CCGCCGATCC TGCGGCTCGA CATTCCGGAA TATCGGGGCG AAGTCGACGT AAAGACCGCC TATGTGCCCG GCATCGAGGG CTTCGCAATC AAGATCAGCC CCGGCTTCTT CGACAACCCC AAGATCGGCC TGCCGAGCAC CAACGGCATG ATGGTGCTGC TGTCGAGCCG AACCGGACTG GTGCAGGCGC TGCTCTTGGA CAACGGCTAT CTCACCGACG TGCGCACCGC AGCGGCCGGC GCCGTCGCGG CAAAACATCT GTCGCGGGAA AATGCGTCCG TGGCCGCGAT CTTCGGCGCC GGCATGCAGG CGCGGCTGCA GCTCGAGGCA CTGACGCTGG TCCGGCCGAT CCGCGAAGCG AGGATATGGG CGCGCGATTC TGCCAAGGCG CAAAGCGTGG CAGCGGAACT GGCCGCAAAG CTCGGCTTTT CCGTCACCGC CACACCGGAC GCCAGAGGCG CAGTGACCGG CGCCGATCTC ATCGTTACCA CCACGCCTTC CGAAACCCCG ATCATCGAGG CCGGGTGGCT GGAACCCGGA CAGCATCTGA CGGCCATGGG CTCGGACACC GAACACAAGA ACGAGATCGA TCCGGCCGCC ATTGCGGTTG CTGACCTCTA CGTCGCCGAC AGCCTGAAGC AGACGCGCCG TCTCGGCGAG TTGCATCACG CAATCGATGG CGGCCTGGTC GCAGATGACG CGATCTTTGC CGAGCTCGGC CAGATCGTTG CCGGCCGGAC GCGGGGACGG ACGCGCAACG ACCAGATCAC CATTGCGGAC CTGACCGGAA CCGGCATCCA GGACACCGCC ATCGCCACGC TCGCCTTTAC CCGCGCCGGC GCGGCCAATG CCGGGACCAC ATTCGAAAGC TGA
|
Protein sequence | MSRMIILTEA ELRKVIALDR DAVDCVEAAF AALATKAVAM PPILRLDIPE YRGEVDVKTA YVPGIEGFAI KISPGFFDNP KIGLPSTNGM MVLLSSRTGL VQALLLDNGY LTDVRTAAAG AVAAKHLSRE NASVAAIFGA GMQARLQLEA LTLVRPIREA RIWARDSAKA QSVAAELAAK LGFSVTATPD ARGAVTGADL IVTTTPSETP IIEAGWLEPG QHLTAMGSDT EHKNEIDPAA IAVADLYVAD SLKQTRRLGE LHHAIDGGLV ADDAIFAELG QIVAGRTRGR TRNDQITIAD LTGTGIQDTA IATLAFTRAG AANAGTTFES
|
| |