Gene Rleg2_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2820 
Symbol 
ID6981564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2868230 
End bp2869360 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content62% 
IMG OID643397532 
Producthypothetical protein 
Protein accessionYP_002282316 
Protein GI209550399 
COG category[S] Function unknown 
COG ID[COG5345] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0366551 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGACC GGATAGCCGG TTTTTTCAGG CTGATCGGCC AGACGATCGG TCGCTGGGCC 
CGCTTGTTTT CCGCCTGGGC CTTCTGGCCC TTCCTCGCCG CGCATGGCTG GTATCAGCGC
CGGAGTTGGA TGATCCGGCT TCCGGTTATC GCGTTCGTGG CGCTCTTCGT CGTGCTCTAC
GGCTATTTCT TCTGGCAGAC GCAGGTCTGG TCGAATTTCA ATACCGCCTT TGTCGACCAA
TACCGGCTTT CCGAACGCAA GGTTGCTGCC GGGCAGGACG TACCGGTCGC GGAGGGAGGC
AACAGCTCAG CCGCCAAGAC CTGCCAGCGT TCGGCCATCG TCGACGTCAC GGCCGACCTG
ACCGACTTCA ACGTCAACCA GAACGCATGG ATCTCCTCGA TGCTGCTCTA TAAAATGGGC
TTTTTCGGCA TCGACTGGGA TCACACGCCC TTCCTCGACA ACAAGGCGTC GTTCCAGCGC
GGCATCAACC AGGCGGTCCG GCGGACGTCG GCGGAGCTTG TCGATACGCT GGGGCGGGTG
CGCGGCACGT CGGGCATCAA CAACGATCTG CAGAGCGCCC GCGGTAACCT GCAGTTCGAC
GAATACAGCT GGTATTTCGG GCTCAATCCC TTCGGGCCGA AGACGCCGAC GCCCTCTTAT
TACCGCTCGG CGATCGGCAG CCTGCGCAAG TTCAACACCG ATCTTTCCGC CTGCAGCGTC
ATTTTCGACG GCCGCGCCGA CAACCTCATG CAGTTCATCG ACCGCATCGC CAACGATCTC
GGTGGTACGT CCGACATGCT CGCCGAACGC TCGGAAAATC ACAATCGCGG TTGGTTCGAT
ACGCGCGCCG ACGACCGGTT CTGGTTTGCC TACGGCCAGC TCTACGCCTA CTACGCCATC
CTTGCCGCCG CGCAGGCGGA TTTCTCGCAA GTGGTGCAGG AGCGCAATCT CGGAGCGGTC
TGGGGCAGCA CGATGCGGCA GTTCCAAGCG GCGCTGCGTA TTCAGCCGGC GATCATCTCG
AACGGGCGCG AGGACGGCTG GATCATGCCG AGCCATCTCG CCACCATGGG CTTCTATATT
CTGAGGGTGC GCTCGAACCT CGTCGAGATC CGCTCGGTGC TCGACCGCTA G
 
Protein sequence
MLDRIAGFFR LIGQTIGRWA RLFSAWAFWP FLAAHGWYQR RSWMIRLPVI AFVALFVVLY 
GYFFWQTQVW SNFNTAFVDQ YRLSERKVAA GQDVPVAEGG NSSAAKTCQR SAIVDVTADL
TDFNVNQNAW ISSMLLYKMG FFGIDWDHTP FLDNKASFQR GINQAVRRTS AELVDTLGRV
RGTSGINNDL QSARGNLQFD EYSWYFGLNP FGPKTPTPSY YRSAIGSLRK FNTDLSACSV
IFDGRADNLM QFIDRIANDL GGTSDMLAER SENHNRGWFD TRADDRFWFA YGQLYAYYAI
LAAAQADFSQ VVQERNLGAV WGSTMRQFQA ALRIQPAIIS NGREDGWIMP SHLATMGFYI
LRVRSNLVEI RSVLDR