Gene Rleg_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0163 
Symbol 
ID8011394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp162020 
End bp164020 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content61% 
IMG OID644822754 
ProductSigma 54 interacting domain protein 
Protein accessionYP_002974013 
Protein GI241202917 
COG category[R] General function prediction only 
COG ID[COG4178] ABC-type uncharacterized transport system, permease and ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG CTAAACTTAA ACCGAAATCG GTCGACGGCA CCGAACCGCA CGGTGCCGGG 
AACTCGCCCC AGGAGACGGC ATCGACTGTC GAGGTCATGC CGCCACCAGA TGCCATCGAG
CCCGATCCGG AGTTGACACC TGAAGAGGCC GAGCAGGCGC GCAAGCGGTA TCTGCTCAAA
CGTTTCTGGA TCAGCGCGCG CCGTTACTGG GGTCGCGGTG GCGACAAGTT CGCTTGGCCC
TTCTCGATCG GGCTATTGGC CCTGATCGGC ATGAATGTCG GCTTCCAGTA CGGAATCAAT
CTGTGGAACC GCGGGATCTT CGACGCCATA GAGCGACACG ATGCCGGCAC CGTCTATTTC
CTGACCGCCG TATTCGTGCC GCTTGTGCTC GGAACCGTCG CCATTGTCAC GATACAGGTC
GCCGTTCGCA TGATGATCCA ACGTCGCTGG CGTTCCTGGC TGACAACATC AGTCATCGCG
CGCTGGCTTG CAAACGGCCG TTACTATCAG TTGAATCTCA TCGGCGGCGA CCACAAGAAC
CCGGAAGCGC GCATTTCCGA GGATTTGCGG ATTGCCACCG AAGCACCCGT CGATTTCATC
GCCGGTGTCA TTTCCGCATT TCTGGCGGCC TCGACCTTCA TCGTGGTGCT CTGGACGATC
GGCGGGGCTC TCACTCTGCC GATCGCAGGT TTCCCCGTTA CCATTCCCGG CTTTCTCGTC
GTCACTGCGG TCCTCTACGC CGCGATCACC TCTACTTCGA TGGCGGTCAT CGGCCGCTAT
TTCGTCCACG TCTCCGAGGC CAAAAATCAA GCAGAAGCCG AGTTTCGCTA CACGCTGACG
CATGTCAGGG AAAACGGCGA GAGCATCGCG CTTCTCGGCG GCGAAGAGGA GGAGCGTAAC
GACCTCGATA AGACCTTCGC CAATGTGCTA AGGCAATGGG CGCTGCTTGC CCGCCAGCAC
ATGCGCACAA CGCTTGTGTC GCATGGGTCG ATGCTGATTG CGCCAGTCGT CCCGGTCCTG
CTTTGCGCAC CAAAATTTCT CGAAGGCAGC ATGAGCCTCG GACAGGTCAT GCAGGCCGCC
TCTGCTTTTG CCATCGTTCA GGGCGCGTTC GGCTGGCTGG TCGACAACTA TCCCCGTCTT
GCCGATTGGA ATGCCTGTGC ACGGCGCATC GCCTCGCTGA TGATGTCGCT CGACGGGCTG
GAGCGCGCCG AACAGAGCGA CTCGCTCGGG CGCATCAAGC ATGGTGAAAC CGAAGGCGAG
GCGATGCTCA GCCTCAACGA TCTCTCCGTG TCGCTTGACG ATGGCACCGC CGTGGTGACG
GAAACCCGGG TCGAAATCGA GCCCGGCGAG CGGGTGCTTG TGTCCGGTGA ATCCGGGTCG
GGCAAGAGCA CGCTGGTGCG GGCCATCGCG GGTCTTTGGC CGTGGGGCGG CGGCAGCGTC
AATTTCCATG CCGACCGGCG ATTATTCATG TTGCCGCAAC GGCCCTATAT CCCTTCGGGC
ACGCTTCGCC GTGCGGTCGC CTATCCGGGC GCCGCCGATA GCTGGCCGCT GGACGAGATC
AAGGCGGCTC TCGACAAGGT GGGACTGGAT TATCTGAACG ACAAGATCGA GGAAGATGCG
CCCTGGGACC AGACCTTGTC GGGTGGCGAA AAGCAGCGGC TCGCCTTTGC GCGTCTGCTG
CTGCACCAAC CCGATATCAT CGTGCTGGAT GAAGCAACGG CAGCACTCGA TGAGAAGAGC
CAGGATAAGA TGATGCAGAT GGTGATCGAT GAATTGCCTG AAGTCACCAT CCTGAGCGTC
GCGCATCGCG CTGAGCTGGA AGTCTTCCAT AGCCGCAAGA TCACGCTCGA GCGGCGCGAG
GGCGGCGCAA AGCTTGTCAG CGATATCGAC CTGATCAAGC GCAAGAGAAA ACGGAACTTG
CTGTCACGCG TTTTGGAGAA GCGGCGCTCC CCGCCGAAAG GCAGTACGAC CGCGAATGAA
GGCGGCACAG TCCCCGAATA G
 
Protein sequence
MADAKLKPKS VDGTEPHGAG NSPQETASTV EVMPPPDAIE PDPELTPEEA EQARKRYLLK 
RFWISARRYW GRGGDKFAWP FSIGLLALIG MNVGFQYGIN LWNRGIFDAI ERHDAGTVYF
LTAVFVPLVL GTVAIVTIQV AVRMMIQRRW RSWLTTSVIA RWLANGRYYQ LNLIGGDHKN
PEARISEDLR IATEAPVDFI AGVISAFLAA STFIVVLWTI GGALTLPIAG FPVTIPGFLV
VTAVLYAAIT STSMAVIGRY FVHVSEAKNQ AEAEFRYTLT HVRENGESIA LLGGEEEERN
DLDKTFANVL RQWALLARQH MRTTLVSHGS MLIAPVVPVL LCAPKFLEGS MSLGQVMQAA
SAFAIVQGAF GWLVDNYPRL ADWNACARRI ASLMMSLDGL ERAEQSDSLG RIKHGETEGE
AMLSLNDLSV SLDDGTAVVT ETRVEIEPGE RVLVSGESGS GKSTLVRAIA GLWPWGGGSV
NFHADRRLFM LPQRPYIPSG TLRRAVAYPG AADSWPLDEI KAALDKVGLD YLNDKIEEDA
PWDQTLSGGE KQRLAFARLL LHQPDIIVLD EATAALDEKS QDKMMQMVID ELPEVTILSV
AHRAELEVFH SRKITLERRE GGAKLVSDID LIKRKRKRNL LSRVLEKRRS PPKGSTTANE
GGTVPE