Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5242 |
Symbol | |
ID | 8007416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 653161 |
End bp | 654342 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644822150 |
Product | ectoine utilization protein EutD |
Protein accession | YP_002973410 |
Protein GI | 241113575 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | [TIGR02993] ectoine utilization protein EutD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.490883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAGC CCAAACTCAA ATTCTCGCTC GGCGAATATG CCGCGCGGCT GGAAAAGACA CGGCGTGCCA TGGAGGCGAA GGGTGTCGAC CTGCTGATCG TCAGCGATCC GTCGAATATG GCCTGGCTGA CCGGCTATGA CGGCTGGTCC TTCTACGTGC ACCAGGCGGT GATCGTGCCG CCGCAGGGCG AGCCGATCTG GTTCGGCCGC GGCCAGGATG CCAACGGCGC CAAATTCACT GCCTATCTGA AGCACGACAA CATCGTCGGT TATCCCGATC ACTACGTGCA GTCGACCGAG CGCCACCCGA TGGACTACCT CTCGGGCATC CTGACCGAGC GCGGCTTCGG CAAGCTGACG ATCGGTGTCG AGATGGACAA TTACTGGTTT TCGGCGGCGG CCTTTGCGGC GCTGCAAAAA CATTTGCCGA ACGCGCGCTT TGTCGACGCG ACCGCCCTCG TCAACTGGCA GCGAGCCGTC AAGAGCGACA CCGAGATCGG CTATATGCGC AATGCCGCCC GAATCGTCGA GGCGATGCAC GCCCGCATCT TCGACAAGAT CGAAGTCGGC ATGCGCAAGT GCGATCTGGT CGCGGAAATC TATGATGCCG GCACCCGCGG CGTCGACGGC ATCGGCGGTG ATTATCCGGC GATCGTGCCG CTGCTGCCGT CCGGCGTCGA GGCATCGGCA CCGCACCTGA CCTGGGACGA CCGGCCGCTG AAGAAGGGCG AGGGCACCTT CTTCGAGATC GCCGGCTGCT ACAACCGCTA TCACCTGCCG CTGTCGCGCA CCGTCTTCCT CGGCAAGCCG ACGCAGGCCT TTCTTGATGC CGAAAAGGCG ACGCTGGAAG GCATGGAAGC CGGTCTTGCA GTCGCCAGAC CCGGCAATAC CTGCGAGGAT ATTGCCAACG CCTTCTTCGC GGTGCTGAAG AAATACGGGA TCGTCAAGGA TAACCGCACC GGTTATCCGA TCGGCCTTTC CTATCCGCCG GACTGGGGCG AGCGCACTAT GAGCCTGCGA CCGGGCGATC GGACGGAGCT GAAGCCCGGC ATGACCTTCC ATTTCATGAC CGGTCTCTGG CTCGACGACA TGGGTTTCGA AACGACCGAG AGCATCCTCA TCACCGACAG CGGCGTCGAG TGCTTCGCCA AAGTGCCGCG CAGGCTGATG GTCAAGGATT GA
|
Protein sequence | MTKPKLKFSL GEYAARLEKT RRAMEAKGVD LLIVSDPSNM AWLTGYDGWS FYVHQAVIVP PQGEPIWFGR GQDANGAKFT AYLKHDNIVG YPDHYVQSTE RHPMDYLSGI LTERGFGKLT IGVEMDNYWF SAAAFAALQK HLPNARFVDA TALVNWQRAV KSDTEIGYMR NAARIVEAMH ARIFDKIEVG MRKCDLVAEI YDAGTRGVDG IGGDYPAIVP LLPSGVEASA PHLTWDDRPL KKGEGTFFEI AGCYNRYHLP LSRTVFLGKP TQAFLDAEKA TLEGMEAGLA VARPGNTCED IANAFFAVLK KYGIVKDNRT GYPIGLSYPP DWGERTMSLR PGDRTELKPG MTFHFMTGLW LDDMGFETTE SILITDSGVE CFAKVPRRLM VKD
|
| |