Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0942 |
Symbol | |
ID | 8012087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 928172 |
End bp | 929446 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644823526 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002974777 |
Protein GI | 241203681 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.429612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTCTGG AAATTGGAAT TGTGGCGTTT CTCACCATCC TGAATGGTGT GCTCGCCATG TCGGAGCTGG CCGTCGTGTA TTCTCGAACA GCTCGCCTAA AGGTCCTCTC CGACAATGGA AGCAAGGGTG CAGCTCAAGC GATCAAACTT GCTGAAAACC CTGGTCGTTT TCTCTCAACG GTGCAGATCG GCATCACGCT GGTCGGCGTT CTATCCGGCG CTTTCTCGGG GGCCACGCTC GGCGGCCGCC TGAGCGGATG GCTAGAAGCC CAGGGAATGT CATCGACGGC CGCTGATGCC ATTGGCGTAG GTTCAGTCGT CGTGGCAATC ACATATCTTT CGTTGATCGT CGGCGAACTT GTTCCAAAGC AGATCGCATT GCGGGAACCC GAAGCGGTTG CGGCCAGGGT CGCACCCGCT ATGGCGGTCC TTTCAAAAAT TGCGCTGCCA CTCGTGTGGC TTCTAAACGC CTCCGGAAAC CTTGTGCTCA AACTCTTGGG CCAAACAGGA AAAGCTGGCG AAAATGTCTC TGACGCAGAA ATCAAAACTG TTCTGGCCGA GGCGCAGTCG GCTGGAGTGA TCGAAAGCGA AGAGTCCGCG ATGATATCAG GTGTCATGCG GCTGGCGGAT CGCACTGCCC GAGCGCTTAT GACGCCCCGA CGGGACGTCG AAATTATTGA TATCGACGAC AGCCTTGATG AAATTCGGAC CCAGTTGCAC AGGACGAAGC GGTCGCGGTT GCCCGTTCGA AAAGGCAGTT CGGACGAGGT GATCGGCATC CTTCCGGTCA AGGACTTCTA CGACGCGATG TCCGAACACG GCAGCGCCGA CATCAAGGCT CTGACGCAAG ACGTCCCGGT GGTTTCAGAC CTTTCAACTG CCATCAATGT TATTGAAGCC ATCAGGAAAT CGCCCGTTCA CATGGTGCTG GTTTTCGACG AGTACGGCCA CTTCGAGGGG GTTGTCTCGT CAGGTGACAT TTTGGAAGCA ATCATGGGGG CTCTGCAGGA GGGACCGGTC GATGAACAGG CCATCGCTCG GCGAGACGAC GGCTCTTATC TCGTGTCGGG CTGGACGCCA ATTGACGAGT TCGCTGAATT CTTAAACCTC AAGCTCGATG GCGACTTGGA ATATCAGACT GTCGCCGGCC TGGTGTTGGA AGAGTTGAAA CATCTGCCGG AATTGGGCGA GAGCTTCACG AGAGATGGAT GGCGCTTCGA AGTCGTCGAT CTCGACGGGA GGCGCGTGGA CAAAATACTT GTGTCGGCTG AGTGA
|
Protein sequence | MFLEIGIVAF LTILNGVLAM SELAVVYSRT ARLKVLSDNG SKGAAQAIKL AENPGRFLST VQIGITLVGV LSGAFSGATL GGRLSGWLEA QGMSSTAADA IGVGSVVVAI TYLSLIVGEL VPKQIALREP EAVAARVAPA MAVLSKIALP LVWLLNASGN LVLKLLGQTG KAGENVSDAE IKTVLAEAQS AGVIESEESA MISGVMRLAD RTARALMTPR RDVEIIDIDD SLDEIRTQLH RTKRSRLPVR KGSSDEVIGI LPVKDFYDAM SEHGSADIKA LTQDVPVVSD LSTAINVIEA IRKSPVHMVL VFDEYGHFEG VVSSGDILEA IMGALQEGPV DEQAIARRDD GSYLVSGWTP IDEFAEFLNL KLDGDLEYQT VAGLVLEELK HLPELGESFT RDGWRFEVVD LDGRRVDKIL VSAE
|
| |