Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4314 |
Symbol | |
ID | 6983088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4479878 |
End bp | 4481416 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643399042 |
Product | protein of unknown function DUF1111 |
Protein accession | YP_002283798 |
Protein GI | 209551881 |
COG category | [C] Energy production and conversion |
COG ID | [COG3488] Predicted thiol oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCATG CCCCGGCCCG CCGACTGATC GCCCGCGCAA TGTTCTGCGC CACGCTTGCC GGTTTTCCCG CAGCTCTCGC CGCCGGCTTC GATCTGCCGG CGAAGCGCAC CGACCTTTCC GAAGCCGATC TGAAACGCGT CGCCGACGTC ACCCGGCCGA CCAGGGATTT TTCCAAGGCC GAACAATATG AGGCGATGCA GGCGGGCGGC GCGACCTCGA TCGATCCGGT TACCGAAGAC AGTTTCTCGC ATATTTCGGC CAACATTCCC TTCGAGGAAG AGCAGAATTT CAAGCTCGGC AATGCGCTTT TCCGCAAGCT CTGGGTTTCC TCGCCCTCCT CGACGCAGGC CTCCGATGGG CTCGGCCCTC TCTTCAACGC CCGCTCCTGC ATGAGCTGTC ACGTCAATGA CGGCCGCGGC AAACCGCCGG AAGGGGGTCC AAGCGCCGTC TCGATGTTCC TGCGGCTTTC CCGCGCGGCC GCAACCCCGG AGGAAGAAAA GGCGATCGCG GGCGCGGATA TCCTCAATTT CCCCGATCCG GTCTATGGCC ATCAGCTGCA GGATCTTGCC GTTCCCGGCC TTGCCGCCGA AGGCCGGATG ACGATCCGCT ATGACGAGGA GACGGTGACG CTTGGTGACG GCGAAACCGT GTCTCTACGC CGCCCGCATT ATGCGGCGAC CAACCTGGCC TATGGACCGC TCGATGCGGC AACGACGATC TCGGCGCGTG TCGCCCCCGC GATGATCGGG CTCGGGCTGA TCGAGGCCAT TCCCGCGGCC GATATCCTTG CCCATGCCGA CCCTGAGGAT GCCGATGGCG ACGGCATCTC CGGCAAGGCG GCGATCGTCC GTGATCACCG CAGCGGCGAG ATCGCGCTCG GCCGCTTCGG CTGGAAGGCG CAGAACGCCA CGGTGCGCGA CCAGAATGCC GACGCCTTCG CCAACGATAT CGGTATCTCG ACACCCGACC ACCCGGACGC GCATGGCGAC TGCACCAAGG CGGAGGAGAA ATGCCTCGAT ATGCCGACCG GCGTGCAGAA ACGGCTGGGC GCGGAAGAAG CGCCAGGCCC CATTCTCGAC CTCGTGACCT TCTATTCCGA AAATCTTGCC GTTCCGGCGC GCCGCAAGGC GAGCTTCCCC GAGACGCTGA AGGGCAAACG GATTTTCTAC GAAACCGGCT GCATTTCCTG CCATGTGCCG AAATTCGTCA CCCGCCGGGA TTCACCCGAC AAGGCGCAGT CCTTCCAGCT GATCTGGCCC TATTCCGACT TCCTTCTGCA CGATATGGGC GACGGGCTTG CCGACGGGCA GCAGGTCGGT CTTGCAAGCG GACGTGAATG GCGCACGCCA CCGCTATGGG GTATAGGACT GACCCGGACT GTCAGCGGAC ACAGCTTTTT CCTGCATGAC GGCCGTGCGC GGGATCTCAC CGAGGCGATC CTCTGGCACG GCGGCGAAGC AGACAAGGCC CGCAACGCTT TCTCCTCCCT GTCGAAAGAC GACAGGAAGG CCCTGATTAC ATTCCTGGAG TCACTTTGA
|
Protein sequence | MSHAPARRLI ARAMFCATLA GFPAALAAGF DLPAKRTDLS EADLKRVADV TRPTRDFSKA EQYEAMQAGG ATSIDPVTED SFSHISANIP FEEEQNFKLG NALFRKLWVS SPSSTQASDG LGPLFNARSC MSCHVNDGRG KPPEGGPSAV SMFLRLSRAA ATPEEEKAIA GADILNFPDP VYGHQLQDLA VPGLAAEGRM TIRYDEETVT LGDGETVSLR RPHYAATNLA YGPLDAATTI SARVAPAMIG LGLIEAIPAA DILAHADPED ADGDGISGKA AIVRDHRSGE IALGRFGWKA QNATVRDQNA DAFANDIGIS TPDHPDAHGD CTKAEEKCLD MPTGVQKRLG AEEAPGPILD LVTFYSENLA VPARRKASFP ETLKGKRIFY ETGCISCHVP KFVTRRDSPD KAQSFQLIWP YSDFLLHDMG DGLADGQQVG LASGREWRTP PLWGIGLTRT VSGHSFFLHD GRARDLTEAI LWHGGEADKA RNAFSSLSKD DRKALITFLE SL
|
| |