Gene Rleg2_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4047 
Symbol 
ID6982818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4221410 
End bp4223218 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content61% 
IMG OID643398777 
Producthypothetical protein 
Protein accessionYP_002283535 
Protein GI209551618 
COG category[S] Function unknown 
COG ID[COG5616] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.136047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGATC TTGTCGTGGG GGCGATCATG AGAAACGCGA CAAACGTCAA TCTATTCAGC 
AGTCTTGGGC CGACGGCTGA GGAAATCTGC CGGCAGGCCG AGAGAATTCT CGCAAGTGAA
GAATTTCACG CGCCGCAGCG CGGCCGAAAT TTTCTGGAGT TTGTCGTCAA CGAGACGCTC
GCCGGCCGAT CTGGCTTCCT GAAGGCGTTC ACCATCGCAA ATGTGGTTTT TGGCAGGGAA
GCGTCCTTCG ATCCGCAGAA TGATCCGGTC GTCCGGATCG AGGCCGGCCG GATACGAAAG
GCCCTGGAAC GGTATTATCT CGTCGCGGGC CAGGCGGACG AGGTCATTAT TACGATGCCG
AAAGGCGGAT ATGTCCCGCA TTTCGAATAT GTCCGCGATG CGGCGATGGC GCCGCCGCTG
AACGAGCCGG AAAATGCTCG GATCGCAGAC CTCGCGCATC AAGCTCATCC GCTTCCCGAG
GCGCACCCTC CGACGCTGCC GGCGGGTCGC GGCCGCGGGA TCGCTCTACC GGCCATCTTC
GTTCTTCTTC TCCTCGCATT GGTGTCTGCA CTTTTTATTG CCCGAAGTGT CCCGGTGCGA
GCGCCGCCGC CGGCAGCCGC CGTGCCGACG GTTGCCGTCG AGGTGTTCGC CGAAAGCAGC
TCCGTCGATT CCAGGGCGGA TATCGCCCGC GGCCTGAGGG ATGATATTAT CGGCCAGCTT
GCTGAGCGCG ATGAGGTCAT CGTCGTCGCC GATCCGTCGA CGGGTGATCG TGCCGTTGCT
GCCGACTACG CGCTGCAAGG CAACATCCAG ATGGACGGCA GCAGGTTACG TTCGGTCGCC
AGGCTGGTGC GCCAAAGGGA CGGGGTCGTC ATCTGGGCCG ATAATTTCGA TGCCGATTTT
CGCGCTCAAA ACAAGCTTGG AATTCAAGCA AACGTCGCCC GGCAAATCGC CGGTGCGATA
GCGCAGCCCC ATGGCGCGAT CTTTCAGGCC GAGCCAGCGA TGATCGCGCG GTCGGGCCTA
AAGGCAGACC AAAATGCTGA AGCCTGCACC CCGGCCTATA ACAGCTATCT CCAGACGATG
ACTGCGCAAA GCCATGGTGT CGTGCGCGAG TGCCTGCGAC AGGCAACCCA GCGTAACCCT
GATAGCGCGA CGTCCTGGGC GCTCTTGTCC CTTGTCTATC TTGACGAAGT GCGATTTCGC
TACAGGCTCG GCACCCCATC TTCGGCCGAA CCTCTCGAAT TGGCGAATGC TTCAGCGCAA
CGGGCGGCAT CGCTCGCGCC GGACAATACC CGCGTGCTCC GCGCCGTTAT GCTGGTGAAT
TTCTTCCGGG GAGATATCGA CAAAGCACTG GCGGCTGGAA CGGCGGCCTA CGCTGCAGAT
CCCGACGATG TGGAAGTTGC GGGCGAGTAC GGTTTGCGTT TGGCGATGGC GGGGAAATGG
CAATCGGGCT GCGAGTTGGT TTCGATCGCA TTCGACAAAA ATGTCGGCCC CAGAGGCTAT
TACGCGGGCG GCATGGCGAT GTGCGCCTTT ATGCGGGGCG ATATCGACGC GGCGGAACAA
TGGTCGAGAA TATCCGATCT CGACTTCAAT CCCATGCGCC ACCTTGCCCT GCTGTCCATT
CTCGGGGCAG CGGGCAAAAC GGCTGAGGCC AAGCTGGAGC AGGATTGGCT CCTTGCCAAC
GCGCCGGCAT TGATGACGAA CATCCGCCAG GAAATTTCCC TACGCTTGCA GCGGCCGGAG
GATCAGGAAA GGGTCCTTGC GGGACTGCGG GCCGCAGGCG TCGCCATTGA CCTGCCGTCG
GGAGGATAA
 
Protein sequence
MRDLVVGAIM RNATNVNLFS SLGPTAEEIC RQAERILASE EFHAPQRGRN FLEFVVNETL 
AGRSGFLKAF TIANVVFGRE ASFDPQNDPV VRIEAGRIRK ALERYYLVAG QADEVIITMP
KGGYVPHFEY VRDAAMAPPL NEPENARIAD LAHQAHPLPE AHPPTLPAGR GRGIALPAIF
VLLLLALVSA LFIARSVPVR APPPAAAVPT VAVEVFAESS SVDSRADIAR GLRDDIIGQL
AERDEVIVVA DPSTGDRAVA ADYALQGNIQ MDGSRLRSVA RLVRQRDGVV IWADNFDADF
RAQNKLGIQA NVARQIAGAI AQPHGAIFQA EPAMIARSGL KADQNAEACT PAYNSYLQTM
TAQSHGVVRE CLRQATQRNP DSATSWALLS LVYLDEVRFR YRLGTPSSAE PLELANASAQ
RAASLAPDNT RVLRAVMLVN FFRGDIDKAL AAGTAAYAAD PDDVEVAGEY GLRLAMAGKW
QSGCELVSIA FDKNVGPRGY YAGGMAMCAF MRGDIDAAEQ WSRISDLDFN PMRHLALLSI
LGAAGKTAEA KLEQDWLLAN APALMTNIRQ EISLRLQRPE DQERVLAGLR AAGVAIDLPS
GG