Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5522 |
Symbol | |
ID | 6978616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1170839 |
End bp | 1171846 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643394621 |
Product | ectoine utilization protein EutE |
Protein accession | YP_002279439 |
Protein GI | 209547521 |
COG category | [R] General function prediction only |
COG ID | [COG3608] Predicted deacylase |
TIGRFAM ID | [TIGR02994] ectoine utilization protein EutE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00516127 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGAGA TTGACTTGCG GCCGTCGCCG ATCAGCGCGA CGGTGGATTT CGCCGCCGAG GGCGTCCAGC ACGGTTTCCT GAGGCTGCCT TACAGCCGCG ACGATTCCGC CTGGGGTTCG GTGATGATCC CGATAACGGT CGTCAGGAAC GGCAAGGGAC CGACGGCGCT GCTGACCGGC GGCAATCATG GCGACGAGTA TGAAGGACCG ATCGCGCTTT TCGACCTTGC CCGTTCGCTG AAGGGCGAGG AGGTGAGCGG CGCTGTCATT GTCGTGCCGG CGATGAATTA TCCGGCATTC CTGGCGGGAA CCCGAACCTC GCCGATCGAC AGGGGCAATA TGAACCGCAG CTTTCCGGGC CAGCCGGACG GCACGGTGAC GCAAAAGATC GCCGACTATT TCCAGCGCGT GCTCCTGCCG ATGGCTGATC TGGTTCTCGA TTTCCATTCC GGCGGCAAGA CGCTCGATTT TCTCCCGTTC TGCGCAGCCC ATATCCTGTC GAACAAGCAA CAGGAAGCGA AGGCTTTCGA TTTCGTCACG GCCTTTGCCG CACCCTATTC GATGAAGATG CTGGAGATCG ATGCAGTGGG CATGTACGAC ACTGCCGCCG AGGAGATGGG CAAGGTCTTC ATCACCACGG AACTCGGCGG CGGCGGGACG GCTACGGCCA AGAGTGCGGC GATTGCCAAG CGCGGCACCA TGAACGTGCT GCGCCACGCC GGGATCGTTG CGGGCGCCGC CGATATCGGT CCGACCACCT GGCTCGACAT GCCGGACGGC CGGTGTTTTT CCTTCGCTGA GGAGGGCGGG TTGATCGAGC CCGTCATCGA TCTCGGTGAA GCCGTCGGTA AGGATGCGGT CATCGCTCGC ATCTATCCGA CCGGGCGGAC CGGAGTGGCC CCCCACGAGG TCCGCGCCGG CATGGATGGC ATCCTCTGCG CCCGGCATTT TCCCGGACTG GTCAAGTCAG GCGATTGCGT CGCCGTGGTC GCGATCGTTA CCGGCTGA
|
Protein sequence | MTEIDLRPSP ISATVDFAAE GVQHGFLRLP YSRDDSAWGS VMIPITVVRN GKGPTALLTG GNHGDEYEGP IALFDLARSL KGEEVSGAVI VVPAMNYPAF LAGTRTSPID RGNMNRSFPG QPDGTVTQKI ADYFQRVLLP MADLVLDFHS GGKTLDFLPF CAAHILSNKQ QEAKAFDFVT AFAAPYSMKM LEIDAVGMYD TAAEEMGKVF ITTELGGGGT ATAKSAAIAK RGTMNVLRHA GIVAGAADIG PTTWLDMPDG RCFSFAEEGG LIEPVIDLGE AVGKDAVIAR IYPTGRTGVA PHEVRAGMDG ILCARHFPGL VKSGDCVAVV AIVTG
|
| |