Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2851 |
Symbol | |
ID | 6981595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2901813 |
End bp | 2902997 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643397563 |
Product | protein of unknown function DUF1228 |
Protein accession | YP_002282347 |
Protein GI | 209550430 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0859226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGAGC GCATCCGTCA TTCTCCGCCG AATCTCGTAT CCACTGCCGC CGCCGGCGCC GTTGCGATGG CGGCGGCCAT GGGGTTCGGG CGTTTCTCCT ACACGCCGAT CCTGCCCGGC ATGATGAGCG GGGTGCCGCT TTCTGCGGCG GATGCAGGTT TTATCGCTTC GGCGAATTTC GTCGGTTATC TCGTCGGCGC CGTGCTTGCC GCCTATGGCT GGGCGGCGGG GCGCGAACGG CTGGTGGCGC TGCTGGCACT GCTTGCCACC GCAATCCTGC TCGCCGCCAT GGCCGCCACA AATTCCGTCG CAGTCTTTGC CGTCATCCGC TTCCTGGCCG GCCTCGCCAG CGCTTTTGCG ATGGTTTTCA CCTCATCGAT CGTGCTCAGC CACGGGGCTG CCGCCGGCAA CGACCATGTG CAGGCAGCGC ATTTCGGCGG GCCGGGGGCG GGGATCGCGC TGTCCTCGAT CATGGTGTTT CTCATCGGCC TCGGCTTTCA CGGCGGCCAG GACAGCTGGC GCGCCGACTG GATCGGCGGC GCGCTCTACT GCGCGGCAAG CCTCGTCGTG GTCTTCCTGC TCTTGCCGTC GGCGCCGGCG CAATCGGCGC GTACGGGCAA GGAGCCGGCC CTCGTCTGGA GCCGGCCGAT GGTGCTGATC ACGCTGTCCT ACGGCCTGTT CGGCTTCGGC TATGTGATCA CCGCGACCTT TCTCGTCACC ATCGCCCGCC TGTCCTCCAC GGGTGCCTTC GTCGAATTTC TCTGCTGGTT CATTGCGGGC CTGACGTCGG CGGTGGCGCT ATTTGCCTGG AAGCCGCTGG TCAGGCCGCT CGGGCTCGGC GGCGTCTATG TCGCAGCACT TCTGGTCGAA GCCGCCGGCG TGCTTGCAAC GGTGATGCTG CCGCACTCTG CCGCACCGCT GATCGGCGGG GCGCTGTTCG GGGCGACCTT CCTGGCGATC ACCGCTTACG GCCTGCAGAT CGGCCGCAAA CTTTCCCCGG AGAGCCCGCG GCGGATCCTG GCGATGATGA CCGCAGCCTT CGGTGTCGGC CAGATCGTCG GGCCTGTCGT TGCCGGCTGG ATCGCCGAAC GCTCGGGCAG TTTCACCGTT CCGACGGTGA TTGCCGCCGC CGCACTTCTT GTCTGCGCGG CGCTTGTGAT GCCGGTGATC AAGAAAATCG CTTAA
|
Protein sequence | MLERIRHSPP NLVSTAAAGA VAMAAAMGFG RFSYTPILPG MMSGVPLSAA DAGFIASANF VGYLVGAVLA AYGWAAGRER LVALLALLAT AILLAAMAAT NSVAVFAVIR FLAGLASAFA MVFTSSIVLS HGAAAGNDHV QAAHFGGPGA GIALSSIMVF LIGLGFHGGQ DSWRADWIGG ALYCAASLVV VFLLLPSAPA QSARTGKEPA LVWSRPMVLI TLSYGLFGFG YVITATFLVT IARLSSTGAF VEFLCWFIAG LTSAVALFAW KPLVRPLGLG GVYVAALLVE AAGVLATVML PHSAAPLIGG ALFGATFLAI TAYGLQIGRK LSPESPRRIL AMMTAAFGVG QIVGPVVAGW IAERSGSFTV PTVIAAAALL VCAALVMPVI KKIA
|
| |