Gene Rleg_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2045 
Symbol 
ID8013076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2039314 
End bp2041911 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content57% 
IMG OID644824631 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002975862 
Protein GI241204766 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCT TCTATCTCGC GACCGCCTCA CTTTTTCTCG TCATCTCCAT TGCCAATTCT 
CCCGCAATTG CTGCAAGCGC AGATGACCCC ACATCAGAAG CCAAGTGCCG CCATCTGGCG
CGTGATGGCC TAGATATCGA CAATCTCGCT TATTGGCGGA ATGCGCTTAC CGCTTGCGAA
GTCTATTTGC AGGCAGAACC AGGCTCCATC GACGCCCGCT ATTTTTCGCT CGCCAGCAAA
TTCAAACTTC AGCAATTCGA GCCGGCACTT GCCGGATTCG AGGAGTTGGC GGCCCAGGGA
CACGCACCCT CCATGGATTA CCTGGCGGAG GCAGCTGCTT ACGGCCTCGG GCGTGAGCGC
GATCTCAATG CCGCGCTGTC CTGGCTGAGA AAGGCAAGAG CAACAAACGA TCCGCGCTCC
GCCGAATATC TCGGAGAGAT GTACATGTTG GGCTGGGGCG TTGCGCAGGA CTTTGTCATG
GCGCGCTCCT ATTTCGAACT TGCAGATCGG CTAGGCTATA TTCCTGCCAA GAGATCACTT
GCTTGGATGT ACCTGGACGG GATTGGTGTT CCCGTTAACG CAGAGCGGGG TTTTGCACTG
CTTTCAGCCG CAGCAGAACA GGGCCATGAG AGAGCGACAA CGGACATCGC ATTCCTTTAC
ACGAAAGGCA TTGGAACGGA GAAGAACGCC AGCCGCGCCG TTGCCATGCT GGACGACCTG
GTGCGGAAAG GTTCCGCCGA TGGGATGAGG ACGCTTGCTC GATACTATCT GAGAGATGGG
GACAAGACGC AGAGAAAGAC GGCGCTCGAT CTCCTTGAGA AAGCAGTGGC ACTTGGCGAC
GGCAATGCGA TGAGGACGCT GGCAGACGTC TACGTGGCGG GCAGGGAGAC GAAGCGTGAC
TTCGCAAAGG CCGAGAGCCT GCTGGATCAG GCAATTGCGC AAGACATACT CGCTGCCTAT
CGTAGCAAGG CTTTGCTTTT ATTGAAGATG CCCGATCCGC GCTATGACCA GGCCATGTAC
TGGATGAAGG AAGCTGCCTC TCGTGGGAAT GCCCGTGCCA TGGCAGATGT GGGCGCCATG
TATGAGCAAG GCAAAGGTGT GCCGGTAGAC GAGAGCGAAG CCCGCATCTG GTATGGAAAA
TCTGCCGAAC TCGGAGATTC CAGCGGCGCC CGACGTTTCG GTCAATCGCT CTATAGGGGA
ACGGGTGGTC CTCGCGACGT CGAAAACGGC GTGAAATGGC TGGAGAAATC TGCGGCCGCC
GGCGATACAG ACGCCATGCG GGTGCTTGCC TATGGTTATG AAAATGGCGG CGGCCTTCCC
AAGGATGTGG TCAAAGCTTT CGAATGGTTC CAGAAGGCGG CGGAGGCAGG AGATACCGAC
GCATTTCTCG AAGTTGCGGA CCGCACCTAT GACGGTTCGG GTACGACTGC CGACGCCCAA
AAATCCTTTG TCTGGTATCT CAAGGCCGCC GAAAATGGTT CGGCCAAGGC CCAGTATTGT
GTTGGGCTTT TGTATGAAAG AGGGGAGGGT ACGACTCAGG ATTCGAGAGA AGCCGTACAC
TGGTTCAAGG CCGCCGCAGA GAACGGATAC ATCGACGCCT TCGCCGAGCT TGGCCAAATG
TACGCCAATG ATGCCAATCT GCCGCGGGAC GACGGCAAGT CGATCGACTA TCTGGAAAAG
GGCGCCGCCG CCGGTAACGT CACGGCGATG GTGCTGCTCG GCATCAAATA TGAAGATGGC
GATGGTGTCG CCCGCGACTA TGCGAAAGCG ATCGAGTGGT ACGAAGCGGC GGCGAACAAG
AAATCTGCTG ACGCGATGTA CAGGCTCGGG ACGCTCTATC GAGGAGCGGA TGGAAGCCAG
AAAGATTTCG GCAAAGCTCT GGAATGGCTG ACGAAGGCCG GGGCCCATGG CAATGCGAAA
GCGCAATACG CCCTGGGCGA TATCTACGAA TACGGGCAGG GCGTGCCCAT TGATCGCTCC
AAGGCGCTCA GCTGGTTCAT GATGGCGGCC CTCAAGCAAT ATCCTGAAGC GATGAACGCC
GTCGGCTACT ACTACCAGAA TGGCATCGGC ACCAAAGAGG ACCAGACCAT CGCCCGCAAC
TGGTTTCAGA AGGCGGCCGA TGCGGGATCT GCGGCCGGCG CTCTTAATCT TGCGTGGTAT
TACGAGAACG GCAAAGATCA AGATCAGGCT ATCGCATTCC AATATTATAA GAAATCCGCC
GAGCTGAATT CGGCCGGCGG CATGTTCGCG CTCGGGCGTT TCTACGACGA TGGCCTCGGA
ATTGCTGTCA ACCGGCAAGA AGCCATCAAG TGGTATCTGC GGGCGATGGA CACTGGCTAC
AATAGAGCCG CCTATCGGCT GGCCTATGCC TATGATGCGA CCTTCAGTTC GGATAATGCA
GCCGAAAACA TCCTTCTGGC AATGCGGCAG GGAGACCGGC AGATTGCCGA TGAGCTGAAG
GCGTTTTCGC CTTTCACCCG CACGGCACTG AAAAGAAGCC TCTTGCGACG CGGCCTTTAC
GATGGGCCGC TGGACGATGA GATCGACCAG CCCTTGAAGT CGGCACTCAA GAATTACACA
ATGACCTATG GGGGATAG
 
Protein sequence
MKLFYLATAS LFLVISIANS PAIAASADDP TSEAKCRHLA RDGLDIDNLA YWRNALTACE 
VYLQAEPGSI DARYFSLASK FKLQQFEPAL AGFEELAAQG HAPSMDYLAE AAAYGLGRER
DLNAALSWLR KARATNDPRS AEYLGEMYML GWGVAQDFVM ARSYFELADR LGYIPAKRSL
AWMYLDGIGV PVNAERGFAL LSAAAEQGHE RATTDIAFLY TKGIGTEKNA SRAVAMLDDL
VRKGSADGMR TLARYYLRDG DKTQRKTALD LLEKAVALGD GNAMRTLADV YVAGRETKRD
FAKAESLLDQ AIAQDILAAY RSKALLLLKM PDPRYDQAMY WMKEAASRGN ARAMADVGAM
YEQGKGVPVD ESEARIWYGK SAELGDSSGA RRFGQSLYRG TGGPRDVENG VKWLEKSAAA
GDTDAMRVLA YGYENGGGLP KDVVKAFEWF QKAAEAGDTD AFLEVADRTY DGSGTTADAQ
KSFVWYLKAA ENGSAKAQYC VGLLYERGEG TTQDSREAVH WFKAAAENGY IDAFAELGQM
YANDANLPRD DGKSIDYLEK GAAAGNVTAM VLLGIKYEDG DGVARDYAKA IEWYEAAANK
KSADAMYRLG TLYRGADGSQ KDFGKALEWL TKAGAHGNAK AQYALGDIYE YGQGVPIDRS
KALSWFMMAA LKQYPEAMNA VGYYYQNGIG TKEDQTIARN WFQKAADAGS AAGALNLAWY
YENGKDQDQA IAFQYYKKSA ELNSAGGMFA LGRFYDDGLG IAVNRQEAIK WYLRAMDTGY
NRAAYRLAYA YDATFSSDNA AENILLAMRQ GDRQIADELK AFSPFTRTAL KRSLLRRGLY
DGPLDDEIDQ PLKSALKNYT MTYGG