Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5521 |
Symbol | |
ID | 6978615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1169650 |
End bp | 1170831 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643394620 |
Product | ectoine utilization protein EutD |
Protein accession | YP_002279438 |
Protein GI | 209547520 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | [TIGR02993] ectoine utilization protein EutD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00590765 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCAGC CCAACCTCAA ATTCTCGCTC GGCGAATATG CCGCGCGGCT GGAAAAGACG CGGCGCGCCA TGGAGGCGAA GGGCGTCGAC CTGCTTATTG TCAGCGACCC GTCGAACATG GCCTGGCTGA CCGGTTATGA CGGCTGGTCT TTCTACGTGC ACCAGGCAGT GATCGTGCCG CCGCAGGGCG AGCCGATCTG GTTCGGCCGC GGCCAGGATG CCAACGGCGC CAAGTTCACC ACCTATCTGA AGCACGACAA CATCGTCGGT TATCCCGATC ATTACGTGCA GTCGACCGAG CGCCATCCGA TGGATTACCT CTCGGGCATC CTGACGGAGC GCGGCTCTAG CAAGCTGACG ATCGGCGTCG AGATGGACAA TTACTGGTTC TCGGCGGCCG CCTTCGCCGC GCTGCAGAAA CATCTGCCGC ATGCGCGCTT CGTCGACGCG ACGGCGCTGG TCAACTGGCA GCGCGCGGTC AAGAGCGAGA CCGAGATCAA ATATATGCGC AATGCCGCCC GCATCGTCGA AGCGATGCAT GCCCGCATCT TCGACAAGAT CGAAGTTGGC ATGCGCAAAT GCGATCTAGT CGCGGAAATC TATGATGCCG GCACTCGCGG CGTCGACGGC ATCGGCGGCG ATTATCCAGC GATCGTGCCG CTGCTGCCGT CCGGCGTTGA AGCATCCGCG CCGCATCTAA CCTGGGACGA CCGGCCGCTG AAGAAGGGCG AGGGCACCTT CTTCGAGATT GCCGGCTGCT ACCACCGCTA TCACCTGCCA CTGTCGCGCA CCGTCTTCCT CGGCAAGCCG ACGCAGGCCT TTCTCGATGC CGAGAAGGCG ACATTGGAAG GCATGGAGGC CGGTCTTGCC GTTGCCAAGC CCGGCAACAC CTGCGAGGAC ATCGCCAACG CCTTCTTCGC CGTGCTGAAG AAATACGGCA TCGTCAAGGA CAACCGCACC GGTTACCCGA TCGGTCTGTC CTATCCGCCG GACTGGGGCG AGCGCACCAT GAGCCTGCGG CCGGGCGACC GGACCGAGTT GAAGCCCGGC ATGACTTTCC ATTTCATGAC TGGCCTCTGG CTCGACGACA TGGGTTTCGA AACGACCGAG AGCATCCTGA TCACCGAGAG CGGTGTCGAA TGTTTCGCCA ATGTGCCGCG CAGGCTGATG GTCAAGGATT GA
|
Protein sequence | MTQPNLKFSL GEYAARLEKT RRAMEAKGVD LLIVSDPSNM AWLTGYDGWS FYVHQAVIVP PQGEPIWFGR GQDANGAKFT TYLKHDNIVG YPDHYVQSTE RHPMDYLSGI LTERGSSKLT IGVEMDNYWF SAAAFAALQK HLPHARFVDA TALVNWQRAV KSETEIKYMR NAARIVEAMH ARIFDKIEVG MRKCDLVAEI YDAGTRGVDG IGGDYPAIVP LLPSGVEASA PHLTWDDRPL KKGEGTFFEI AGCYHRYHLP LSRTVFLGKP TQAFLDAEKA TLEGMEAGLA VAKPGNTCED IANAFFAVLK KYGIVKDNRT GYPIGLSYPP DWGERTMSLR PGDRTELKPG MTFHFMTGLW LDDMGFETTE SILITESGVE CFANVPRRLM VKD
|
| |