Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4577 |
Symbol | |
ID | 8015328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4702881 |
End bp | 4704419 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644827154 |
Product | protein of unknown function DUF1111 |
Protein accession | YP_002978354 |
Protein GI | 241207258 |
COG category | [C] Energy production and conversion |
COG ID | [COG3488] Predicted thiol oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0615808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCATG CCCCGGCCCG CCGATTTTTC GCCAGCGTAG CGCTCTGCGC CACGATTGCC GGTTTTTCCG TCAGCATCGC CGCCGGTTTC GATCTGCCGC GGAAACGCAC CGACCTCTCC GAGGCCGATC TGAAACGCGT CGCCGCCGTC ACCCGGCCGA CAGCGGATTT TTCCAAGGCC GAACAATACG AAGCCATGCA GGCAGGGGCT ACGACCTCGA TAGACCCTGT CACCGAAGAC AGCTTCTCGC ATATTTCGGC CAATATCCCC TTCGAGGAAG AGCAGAATTT CAAGCTCGGC AACGCGCTCT TCCGCAAGCT CTGGGTGTCC GCTCCCTCCT CGACGCAGGC TTCCGATGGT CTCGGGCCGC TGTTCAACGC CCGCTCCTGC ATGAGCTGCC ATGTCAATGA CGGCCGCGGC AAACCACCGG AGGGAGGCCC GAGCGCCACC TCGATGTTCC TGCGGCTTTC CCGCGCCGCC ACGACGCCGG AGGAAGAAAA GGCGGTCGCA AGTGCCGATG TCGTCAATTT TCCCGATCCG GTCTACGGCC ATCAGCTGCA GGACCTTGCC GTTCCCGGCC TTGCTGCAGA AGGAAAGATG GCGATCAGCT ACCAGGAAGA GAGGGTGACG CTCGGCGACG GCGAGACCGT ATCGCTGCGC CTGCCGAGTT ATGCGGTGAC GAACCTCGGT TATGGACCAC TCCACCCCGC GACGACGATT TCGCCGCGTG TCGCCTCGGC GATGATCGGC CTCGGACTGA TCGAGGCCAT TCCCGAGGCC GATATCCTGG CCCATGCCGA TCCTGATGAT GCCGACGGCG ACGGCATCTC CGGCAAGGCA GCAATCGTGC GCGACCACCG CAGCGGCAAG ATCGCGCTCG GACGATTCGG CTGGAAGGCA CAGAACGCCA CGGTGCGCGA CCAGAGTGCC GATGCCTTCG CCAACGATAT CGGCATCTCG ACGCCCGATC ACCCGGATGC GCAGGGCGAT TGCACCCGGG CCGAAGAGAA ATGCCGTGAT ATGCCAACCG GCGTGCAGAA GCGGCTGGGC GCCGAAGAAG CGCCGGGGCC CATTCTCGAC CTCGTCACCT TCTATTCCGG AAATCTTGCC GTTCCGGCGC GGCGCAAGGC GAGTTTCCCC GAGACGCTGC AGGGCAAGCG GATCTTCTAC GAAAGCGGCT GTATTTCCTG CCATCTGCCG AAATTCGTCA CCCGCCGGGA TACGCCGGAC AAGGCACAGT CCTTCCAGCT GATCTGGCCC TATTCCGACT TTCTTCTGCA CGACATGGGC GACGGGCTTG CCGACGGGCA GCAGGTCGGC CTTGCAAGCG GACGTGAATG GCGCACGCCG CCGCTATGGG GTATAGGACT GACCCGAACT GTCAGCGGAC ACAGCTTTTT CCTGCATGAC GGCCGTGCGC GTGATCTCAC CGAAGCGATC CTCTGGCATG GCGGCGAAGC TGAAAAGGCC CGCAACGCTT TCTCCTCCCT GCCGAAAGAC GACAGGGCGG CCCTGATTAC ATTCCTGGAG TCACTTTGA
|
Protein sequence | MSHAPARRFF ASVALCATIA GFSVSIAAGF DLPRKRTDLS EADLKRVAAV TRPTADFSKA EQYEAMQAGA TTSIDPVTED SFSHISANIP FEEEQNFKLG NALFRKLWVS APSSTQASDG LGPLFNARSC MSCHVNDGRG KPPEGGPSAT SMFLRLSRAA TTPEEEKAVA SADVVNFPDP VYGHQLQDLA VPGLAAEGKM AISYQEERVT LGDGETVSLR LPSYAVTNLG YGPLHPATTI SPRVASAMIG LGLIEAIPEA DILAHADPDD ADGDGISGKA AIVRDHRSGK IALGRFGWKA QNATVRDQSA DAFANDIGIS TPDHPDAQGD CTRAEEKCRD MPTGVQKRLG AEEAPGPILD LVTFYSGNLA VPARRKASFP ETLQGKRIFY ESGCISCHLP KFVTRRDTPD KAQSFQLIWP YSDFLLHDMG DGLADGQQVG LASGREWRTP PLWGIGLTRT VSGHSFFLHD GRARDLTEAI LWHGGEAEKA RNAFSSLPKD DRAALITFLE SL
|
| |