Gene Rleg_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0203 
Symbol 
ID8011432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp211311 
End bp213323 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content56% 
IMG OID644822796 
Productprotein of unknown function DUF87 
Protein accessionYP_002974053 
Protein GI241202957 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.187802 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAG ATGACAGGAA GAGAGCGATT GGTAAAATCG TTTCGGTCGC TGCCGACCGG 
TTCGTAGTCG AGATGCATTC TGGAACGGAC AATTTCACCG TAGTCGGCTT CGACGACGTC
CATTACGTCG CTCGGCTCGG GTCCTTCCTA ATGGTTCCAG TCCAAGATGA ATACGTGGTC
GTCGAGGTTG TCGGCCTCCG CGAACGGGAT TCTGGAAATC CGGGTCACGG CAGCGGCGAC
ATGGACAAGG CAGCTTCAGC GAAATACCTA GACGTGGTGC CAGTCGGCAT GCTTCCGCAG
GTTCGCGGCG CTAAATTTCG CTTCGGCGTC TCAATCTATC CATCTCTCTA TGCAGACGCG
CTCTACGCGC TCGACGCAGA ACTCGACAGG GTTTTCGAGA CGGAAGTCGT TACCGAGCAG
CCGCCGGATC CCGGCGGCCA GGCGCTTCCC ACCCGATACA ACGCTCTCTC GATCGGGAGG
TCGGTCGTGT TCGAAGGTTA CGACGTAAAG GTCAAGATCG ACGAATTCTT TGGCGCACAC
ACAGCGGTTC TCGGAAACAC AGGCAGCGGG AAGTCCTGCA CGATCTCTTC GGTGCTGCAG
TCGCTATTTC AGAAACCGGA CGAACACCGC GCACGCGGCG CGACATTCAT CGTCTTTGAC
GTGAATGGCG AGTACTGGCA ATCGCTATCC CCGTTGGCAG CGGATGAAGG TATTGGGGTT
TCACGTCTGG TCCTTGATGG ATCCGCAGAA CCCGGCCGCT TCAGGCTCCC TCACTGGTTC
CTCGACCAGA CCGAATGGGA GCTGCTTCTC CAAGCGAGCG AGCGCACGCA GATGCCCGTG
CTAAGAACAG CTCTGGGTCT GACCGGATTA TTCAGAAAGG ACTCCCCGGA AGCCTTGCTC
GTCAAAGAGC ATTTCATGGC TAGATGCATC ATCGAGTGCT TTCGGGGAGC GGACGGCGAC
TCTCCCGTGT CGAAGTTCCA TCGCGTCGTC TCTCTCCTAC AGAGGTACCC GACCAAGGAT
CTTAACCTGG CACTGCTGCG CGCCTACGGC GCCAATTTCC AATTCGGGAA CTTTGCCAAC
AATAATCTCG TGCCGTTCCT TGAAAAGGTC GGAGAGAAGG TTCGGGAAGA GATCAAGCTT
CCCTCATACG ACCGCACCCC TTTTGCTTTC GATGATCTCG AGGAGTGTCT GGACTTTGCG
ATCCTCTATG AAGAGTCGCA TGGCAATCGG CAAATACGCG ACTACTGCTC GCAAATGGTG
ACCCGGCTGA AATCCTTGAA GGAGCGGTCC GACTTCCGAT TTCTGCGTCA CGAGCTACCA
ACCGAAGGCG ACGCACCGAC AACTGGAACG TTTCTTAAAA CATTGCTCGG CCTCCGTGAA
GCTGGCCCCG GCGGCAAGCT CATCAAAGAC GCCCAGATCG TCGTCGTCGA CATGAACGAC
GTCGAGGACG AGGTCGTCGA ACTCGTCTCA TCCGTTCTCG CGAGAATGAC TTTCAGGCTG
CTCAGGCAGG CCGACCCCAG GAACCGCTTC CCCGTACATC TTCTCCTTGA GGAAGCCCAC
CGATACATTT CTGAAACTCC GTCGCGATTT GCGATCGACG CTCACCGTAT ATACGAGAGG
ATTGCGAAAG AGGGCAGGAA GTACGGCCTC TTTCTTCTCG TCGCCTCTCA ACGACCGAGC
GAGCTGTCGA AGACGGTGCT TTCCCAGTGT TCTAACTTCG TAGTCCATCG TATCCAGAAT
CCTGATGACC TGTCCCAGAT CCGGCAAATG ACACCGTTCA TCTCTGACGC TGTCCTGAAG
CGCCTACCGT CGCTGCCAAA GCAACACGCT CTTGTATTTG GAACATCCGT CAATCTTCCC
ACTACGTTTC GCGTTCGCAA CGCTGATCCC TTGCCAAAAA GCGACGACGC CAAGATCCGA
GACCTATGGT TTCATGGAGC TGATCGAGCT GCCGGTATCA GCTTTTCTCA ATCATCAGCG
CCGCCAGGCC TGCTCGAGAG TGAAGCATGT TGA
 
Protein sequence
MSRDDRKRAI GKIVSVAADR FVVEMHSGTD NFTVVGFDDV HYVARLGSFL MVPVQDEYVV 
VEVVGLRERD SGNPGHGSGD MDKAASAKYL DVVPVGMLPQ VRGAKFRFGV SIYPSLYADA
LYALDAELDR VFETEVVTEQ PPDPGGQALP TRYNALSIGR SVVFEGYDVK VKIDEFFGAH
TAVLGNTGSG KSCTISSVLQ SLFQKPDEHR ARGATFIVFD VNGEYWQSLS PLAADEGIGV
SRLVLDGSAE PGRFRLPHWF LDQTEWELLL QASERTQMPV LRTALGLTGL FRKDSPEALL
VKEHFMARCI IECFRGADGD SPVSKFHRVV SLLQRYPTKD LNLALLRAYG ANFQFGNFAN
NNLVPFLEKV GEKVREEIKL PSYDRTPFAF DDLEECLDFA ILYEESHGNR QIRDYCSQMV
TRLKSLKERS DFRFLRHELP TEGDAPTTGT FLKTLLGLRE AGPGGKLIKD AQIVVVDMND
VEDEVVELVS SVLARMTFRL LRQADPRNRF PVHLLLEEAH RYISETPSRF AIDAHRIYER
IAKEGRKYGL FLLVASQRPS ELSKTVLSQC SNFVVHRIQN PDDLSQIRQM TPFISDAVLK
RLPSLPKQHA LVFGTSVNLP TTFRVRNADP LPKSDDAKIR DLWFHGADRA AGISFSQSSA
PPGLLESEAC