Gene Rleg_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0201 
Symbol 
ID8011431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp206135 
End bp207463 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content59% 
IMG OID644822794 
Producthypothetical protein 
Protein accessionYP_002974051 
Protein GI241202955 
COG category[R] General function prediction only 
COG ID[COG3950] Predicted ATP-binding protein involved in virulence 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0853474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGC GTCTCGACAA GCTCTCGCTG ACAAATTTTC GGTGCTTCGC TAATTGCGAA 
ATCGAATTCC ATTCCGGTCT CACAGTTCTG GTCGCCCAGA ATGGGAGCGG TAAGACCGCG
GTGCTCGACG CCGCCGGCGC TGCTCTTTCG GTCTTCGTTA ACACGCTTTA TCCGCTCGAG
AAGATTTGGC GGATCGAGCG GAGCGATGTG CGCTTGATCC CCGGTCAGGA GCATAAAATG
TCTCCCTGCC TCCCAGTCGA ATACGAAGCG CAAGCGACCG TTCAGGCCAC TTCAGTAACT
TGGAGGAGCG CCGTCAGAAC GTACGGTGAC AAAGTACGCC CGAGCACCCG GAACCTCGGC
CCGATAAGTA TGGCTGCACA ACCATTCTTA TCCGATACGG CGGTACTTCC GCTCATCGCC
TATTATGGAA CGGGCCGCCT CTGGAACGAG CAGCGCCAGA CCGAATATCG CCGTTCCTCC
GTTACGAATG TTGATGAGCG CGTAGCCGGA TATGCCGACT GCCTCACCTC GTCTTCGTCC
TTCAAGGGCA TTTCGGCGTG GTTTGAGCAT CGCTTCAGGC AGACGGCGTC GCCGTTATAT
CGGGAGAGCC TGCAAACTAA TCTCGCGATG ATCGAAGGTG TGAAGACCGC CACCGACACC
GTTCTGCAGC CAACCGGCTG GTCGAATCTT CGTTGGGATG ACGAGCTTCA TACTCTGACA
GCCAAGCACG ACGGGCGCGG CGAACTGCCG TTGTCCATGC TGAGCGATGG TGTCCGCACC
ATGCTTGCGC TTGTCGCAGA CGTCGCGCGG CGTTGCGCTA GTCTCAATCC ACAACTGAGC
GATCAGGCCG CGCTCAAGAC TCCTGGCGTC CTCATCGTAG ATGAAGTGGA TATGCATCTA
CATCCAAGCT GGCAGCAGCA GGTTCTTGGC TTGCTGCAAA GCGCGTTTCC TGCCCTACAG
ATCATCGTTA GCACGCATAG CCCGCATGTG CTGTCGACGG TCGAAAAATC ATCGATCCGT
GTCCTTCACG TCAAGAACGG GGACGTAGTG GTTGAGACGC CTTTGGTTCA GACCCGCGGG
GTCGAGAGTG CGGATGTCCT CGCTACCGTA ATGGACGTGG ACCCCGTTCC AGCGCTTGAG
GAATCGACCG CTTTAAGCGC CTATCGTAAG CTCATCGAGG CGGGCGAAGC CGAGGGCCAG
GAAGCGTCAG CCCTGCGGTA TCGCCTGAAC GCGCACTACG GCGAAAGCCA TCCGGTCATG
CTCGAGGCGG ATCGCCTCAT CCGTTTCCAA CGGTTCCGGC TCGCCAAGAG CCGGCCGGAG
AGCGCGTGA
 
Protein sequence
MALRLDKLSL TNFRCFANCE IEFHSGLTVL VAQNGSGKTA VLDAAGAALS VFVNTLYPLE 
KIWRIERSDV RLIPGQEHKM SPCLPVEYEA QATVQATSVT WRSAVRTYGD KVRPSTRNLG
PISMAAQPFL SDTAVLPLIA YYGTGRLWNE QRQTEYRRSS VTNVDERVAG YADCLTSSSS
FKGISAWFEH RFRQTASPLY RESLQTNLAM IEGVKTATDT VLQPTGWSNL RWDDELHTLT
AKHDGRGELP LSMLSDGVRT MLALVADVAR RCASLNPQLS DQAALKTPGV LIVDEVDMHL
HPSWQQQVLG LLQSAFPALQ IIVSTHSPHV LSTVEKSSIR VLHVKNGDVV VETPLVQTRG
VESADVLATV MDVDPVPALE ESTALSAYRK LIEAGEAEGQ EASALRYRLN AHYGESHPVM
LEADRLIRFQ RFRLAKSRPE SA