Gene Rleg_5145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5145 
Symbol 
ID8007043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp547070 
End bp548779 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content65% 
IMG OID644822058 
Producthypothetical protein 
Protein accessionYP_002973318 
Protein GI241113483 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCG GGGCACAGGC GGTCTGTGAG AGCGAGGCAA GGTCGGAGGC GGAGCGACTT 
CTGGCCGACC CGCGGCTTCA TGTCTCCGAT CGCCATAGAG CGTTTCTCAG ATACATCGTC
GACGCCGTTT TCGAAGGTCG CGGCGACGCG GTGAAGGCCT ATTCCATCGC GATCGACGTC
TTCAACCGGC CTGCGTCGTT CGATCCCTCG TCCGATCCGA TCGTGCGCAT AGAGGCGACG
CGTCTTCGTG AGACCTTGGC GAAATATTAC GAGCAGCTCG GCGACGAGCC TGGAGCGCGC
CTGGACATTC CCCGCGGACG ATATGTTCCT GTCTTCGTCG AGCGAGGGCA ACCGCCATGC
CCCGGCGAAG ATGTTTCCGA TGTCGAAGAG GACATCGTCC CGGCAGCCGA AGACCGGACG
CCTTCGTCAG GGTCCGCCGT CAAGCGCAAA GGTCACATTG CGATCGCGCT GGCGAGCTTC
GCCGCCATTG CCATGGCCGG AGGCTACGCT GTCTTCCGCG CCGCCATGCC CGCGCCTTTG
GATACGCAAA AGCCATTCGT TTCATTATCG CTGAATGCGG CGCAGAAGGA TACGGTGGCC
GGAGAGGCGG TCATCGAAGA TCTGGCCGTG TCGCTTGCGC GGTTCGGCAC CGTCCGCCTC
AAATCCGATG CAATGACGAG GCCCGGCGTC GAGCCGGCCG AGCAGACGAC GTACGACGTC
AGGATGCGCT ACGGTGAGGA TGCGACATCC GTCTCCTTGT GGTGGCAGCT TTCCGACGCG
GCCACGGGAG AGGCGGTCTG GACCGACCAG GATCGACGCC AAAAAGGAAC CGGGACGCGA
GATGACGCGA TGCGCGCGCT GGTCTACGGC GTCTCCCGCA GGATCGCGGG ACCTGTCGGC
GTCGTCAACA CCATGGAACT GCGGCGCAAT CTGCCGGCAT CCACGACCGG CAACATCTGC
GTGCTGAGGG GCGAATTTGC CGCCGAACAG CGCCGCCTGG CCGCTCTGAA AGCGGCCCGT
CCCTGCCTGG AAGCAACGAT CGCCGCGGAT CCCGCTGACG CCGACGCGAT GGCGACACTT
GCACGCGTCT TCATGTGGAC CGGCCGCACG ACCGGCGACG ACAGCTATTT CGGCCGCGGT
CTCGAACTCG CGAACCGGGC TGCAACGGTC TCGCCCGTCT CCACCAGGGC AGCGCTTGCC
CAGTTGGCGA CGCAATACCA GGTCGGCCAG AACGACGCCT CCATCGCAGC CGGCCGGCGC
GGTGTCGCCC TGAACCCGGA AAATGCAGAC CTCCTGGCAA AGCTCTCGAT GGCCCTCTTC
CTGAGCGGGC ACTGGGAAGA GGGTAGCCGC CTTGCGAAAG AGGCGACCGA CCTCGTCGGC
CAGACGATAC GTGACGCGAA TTTCGTCATG ATCCTCGACG CCTACCGCCA GGGGCACTAC
GCCGAAGCGG TTTTACTTGC CCGGCAGGTC CCGGCAGCCG ATACGCCGAC GACCGTGTTG
AAGCTGGCCG CCATCGCTCG CCTCGGTGAC CGGCCGGTGA CGGAGCAGGA GATAGCCGCA
GCCCGTCTGC AGCATCCCGA TCTCGACCGG ACAGTCGCAG CCATGTTCTC CGGAGCGCGT
TTTGACAGCA GTCTCAAAGC TGCCCTTCGA ACAGGCATTC TGGCAGCGGG CTTAAAATCG
CCGGAACTCG CCAGCAACGG GTCGATGTAA
 
Protein sequence
MSSGAQAVCE SEARSEAERL LADPRLHVSD RHRAFLRYIV DAVFEGRGDA VKAYSIAIDV 
FNRPASFDPS SDPIVRIEAT RLRETLAKYY EQLGDEPGAR LDIPRGRYVP VFVERGQPPC
PGEDVSDVEE DIVPAAEDRT PSSGSAVKRK GHIAIALASF AAIAMAGGYA VFRAAMPAPL
DTQKPFVSLS LNAAQKDTVA GEAVIEDLAV SLARFGTVRL KSDAMTRPGV EPAEQTTYDV
RMRYGEDATS VSLWWQLSDA ATGEAVWTDQ DRRQKGTGTR DDAMRALVYG VSRRIAGPVG
VVNTMELRRN LPASTTGNIC VLRGEFAAEQ RRLAALKAAR PCLEATIAAD PADADAMATL
ARVFMWTGRT TGDDSYFGRG LELANRAATV SPVSTRAALA QLATQYQVGQ NDASIAAGRR
GVALNPENAD LLAKLSMALF LSGHWEEGSR LAKEATDLVG QTIRDANFVM ILDAYRQGHY
AEAVLLARQV PAADTPTTVL KLAAIARLGD RPVTEQEIAA ARLQHPDLDR TVAAMFSGAR
FDSSLKAALR TGILAAGLKS PELASNGSM