Gene Rleg_4577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4577 
Symbol 
ID8015328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4702881 
End bp4704419 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content63% 
IMG OID644827154 
Productprotein of unknown function DUF1111 
Protein accessionYP_002978354 
Protein GI241207258 
COG category[C] Energy production and conversion 
COG ID[COG3488] Predicted thiol oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0615808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCATG CCCCGGCCCG CCGATTTTTC GCCAGCGTAG CGCTCTGCGC CACGATTGCC 
GGTTTTTCCG TCAGCATCGC CGCCGGTTTC GATCTGCCGC GGAAACGCAC CGACCTCTCC
GAGGCCGATC TGAAACGCGT CGCCGCCGTC ACCCGGCCGA CAGCGGATTT TTCCAAGGCC
GAACAATACG AAGCCATGCA GGCAGGGGCT ACGACCTCGA TAGACCCTGT CACCGAAGAC
AGCTTCTCGC ATATTTCGGC CAATATCCCC TTCGAGGAAG AGCAGAATTT CAAGCTCGGC
AACGCGCTCT TCCGCAAGCT CTGGGTGTCC GCTCCCTCCT CGACGCAGGC TTCCGATGGT
CTCGGGCCGC TGTTCAACGC CCGCTCCTGC ATGAGCTGCC ATGTCAATGA CGGCCGCGGC
AAACCACCGG AGGGAGGCCC GAGCGCCACC TCGATGTTCC TGCGGCTTTC CCGCGCCGCC
ACGACGCCGG AGGAAGAAAA GGCGGTCGCA AGTGCCGATG TCGTCAATTT TCCCGATCCG
GTCTACGGCC ATCAGCTGCA GGACCTTGCC GTTCCCGGCC TTGCTGCAGA AGGAAAGATG
GCGATCAGCT ACCAGGAAGA GAGGGTGACG CTCGGCGACG GCGAGACCGT ATCGCTGCGC
CTGCCGAGTT ATGCGGTGAC GAACCTCGGT TATGGACCAC TCCACCCCGC GACGACGATT
TCGCCGCGTG TCGCCTCGGC GATGATCGGC CTCGGACTGA TCGAGGCCAT TCCCGAGGCC
GATATCCTGG CCCATGCCGA TCCTGATGAT GCCGACGGCG ACGGCATCTC CGGCAAGGCA
GCAATCGTGC GCGACCACCG CAGCGGCAAG ATCGCGCTCG GACGATTCGG CTGGAAGGCA
CAGAACGCCA CGGTGCGCGA CCAGAGTGCC GATGCCTTCG CCAACGATAT CGGCATCTCG
ACGCCCGATC ACCCGGATGC GCAGGGCGAT TGCACCCGGG CCGAAGAGAA ATGCCGTGAT
ATGCCAACCG GCGTGCAGAA GCGGCTGGGC GCCGAAGAAG CGCCGGGGCC CATTCTCGAC
CTCGTCACCT TCTATTCCGG AAATCTTGCC GTTCCGGCGC GGCGCAAGGC GAGTTTCCCC
GAGACGCTGC AGGGCAAGCG GATCTTCTAC GAAAGCGGCT GTATTTCCTG CCATCTGCCG
AAATTCGTCA CCCGCCGGGA TACGCCGGAC AAGGCACAGT CCTTCCAGCT GATCTGGCCC
TATTCCGACT TTCTTCTGCA CGACATGGGC GACGGGCTTG CCGACGGGCA GCAGGTCGGC
CTTGCAAGCG GACGTGAATG GCGCACGCCG CCGCTATGGG GTATAGGACT GACCCGAACT
GTCAGCGGAC ACAGCTTTTT CCTGCATGAC GGCCGTGCGC GTGATCTCAC CGAAGCGATC
CTCTGGCATG GCGGCGAAGC TGAAAAGGCC CGCAACGCTT TCTCCTCCCT GCCGAAAGAC
GACAGGGCGG CCCTGATTAC ATTCCTGGAG TCACTTTGA
 
Protein sequence
MSHAPARRFF ASVALCATIA GFSVSIAAGF DLPRKRTDLS EADLKRVAAV TRPTADFSKA 
EQYEAMQAGA TTSIDPVTED SFSHISANIP FEEEQNFKLG NALFRKLWVS APSSTQASDG
LGPLFNARSC MSCHVNDGRG KPPEGGPSAT SMFLRLSRAA TTPEEEKAVA SADVVNFPDP
VYGHQLQDLA VPGLAAEGKM AISYQEERVT LGDGETVSLR LPSYAVTNLG YGPLHPATTI
SPRVASAMIG LGLIEAIPEA DILAHADPDD ADGDGISGKA AIVRDHRSGK IALGRFGWKA
QNATVRDQSA DAFANDIGIS TPDHPDAQGD CTRAEEKCRD MPTGVQKRLG AEEAPGPILD
LVTFYSGNLA VPARRKASFP ETLQGKRIFY ESGCISCHLP KFVTRRDTPD KAQSFQLIWP
YSDFLLHDMG DGLADGQQVG LASGREWRTP PLWGIGLTRT VSGHSFFLHD GRARDLTEAI
LWHGGEAEKA RNAFSSLPKD DRAALITFLE SL