Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4043 |
Symbol | |
ID | 6982814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4217813 |
End bp | 4218970 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643398773 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_002283531 |
Protein GI | 209551614 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAGA ACACAGGCGA TACGGCAGCG CAGGGAGCCG GGTCAGGGCA GATTTCAGTA GAGGCGAGAA TAAGCGATCT TGTCCGGTTG GGCATCATCG GGCTTTTCGC CTATTGGACG ATCGTCTTGA TTGCTCCCTT CGCGCTGATC GTTATTTGGT CGGCCATTCT GGCGGTGGCG CTATTCCCGA TATTCCAAGC GCTCTGCAGG CTGCTTGGAA ACAGGCCCGT CATCGCAGCC AGTATCATCG TCGTCTTTTG CCTCGTCCTG ATCATTGCGC CCCTGGCTCT GGTCGCGGTC AATTTTGCCG ACACCGCGCA GGCATTGATC GGCAAGTTGC GGGTGGGGGA ATTCACGCTC CCCTCGGCGC CGGCCGCCAT TCGGGAGTGG CCCGTTGTCG GCGAGCGGCT CCATGACGCG TGGAATCAGA TTGCAAGCGA TCTGGCCGCA ACGATTATCA AGTTTCAGGC GCCCATTCGT GAAGTGACGG CCGTTATCGT CACAAAGCTT GCCTCGATCG GCGGCGGCGT GTTGAGCTTT GTCGTTTCGA TCATGCTTTC GGGAATATTT CTCACGCGGT CAGCACGCCT GGCGGCGGCC ATACAAGTGC TGGCAAACCG GATCGCCGGT GAAAAGGGTG TCGGCTTTGC CCGGCTGGCG GGAGCCACGG TGCGCAATGT ATCGCGGGGT GTCATCGGCG TTGCCTTCCT GCAGACATTG CTCTGCGGAT TGTGCTTTGC TTTCTTTGGC GTCCCGGCGC GTGGGGCGCT GACATTCGTG ATCTTCATGT TTTGCCTGAT GCAGCTGGGG CCTGGGCTCG TGCTTCTTCC CGTTATCATC TGGTCGTGGT TTTCGTGGTC GCCCGCCGCT GCTTTTGCCT TTACCGCCAT TACCGTGCCC ATCATGCTCA TCGACAACAT ATTGAAGCCC GTGCTGATGG CGCGGGGGCT CTCGACCCCG ATGCCGGTCA TCCTGATCGG AGTCATCGGC GGCACACTTT CCCACGGGCT GCTGGGCTTA TTTCTGGGGC CGGTCGTGCT CAGCGTCTTC TACGAGCTGC TGAAAGCCTG GGCCTGGCCC TCAGTCCAGA CCGCGTCGGA AAACAGCGGC CCAGCCAAGC TCGATGCTCT GCCGGAACGC ATCGAGCACA GGCAATGA
|
Protein sequence | MAENTGDTAA QGAGSGQISV EARISDLVRL GIIGLFAYWT IVLIAPFALI VIWSAILAVA LFPIFQALCR LLGNRPVIAA SIIVVFCLVL IIAPLALVAV NFADTAQALI GKLRVGEFTL PSAPAAIREW PVVGERLHDA WNQIASDLAA TIIKFQAPIR EVTAVIVTKL ASIGGGVLSF VVSIMLSGIF LTRSARLAAA IQVLANRIAG EKGVGFARLA GATVRNVSRG VIGVAFLQTL LCGLCFAFFG VPARGALTFV IFMFCLMQLG PGLVLLPVII WSWFSWSPAA AFAFTAITVP IMLIDNILKP VLMARGLSTP MPVILIGVIG GTLSHGLLGL FLGPVVLSVF YELLKAWAWP SVQTASENSG PAKLDALPER IEHRQ
|
| |