Gene Rleg_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3041 
Symbol 
ID8013955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3036551 
End bp3037819 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content60% 
IMG OID644825609 
ProductCytochrome b/b6 domain protein 
Protein accessionYP_002976837 
Protein GI241205741 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.568197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.966888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGCC ATTCCAGCTA CGAGCCATCA ACCGGCCTCG AGAAATGGGT CGATGCGCGC 
CTTCCGCTGC CGCGCATGGT CTATGACAGC TTCGTCGCCT ATCCGGTTCC GCGTAACCTG
AACTACGCCT ACACCTTCGG CGCCATGCTC GCCGTTATGC TGGTGGTGCA GATCCTGACG
GGTGTTACGC TCGCCATGCA CTATGCGGCC GAATCGTCGG TTGCCTTCAA CTCCGTCGAA
AAGATCATGC GTGACGTGAA CCACGGTTGG CTGCTGCGCT ATATGCATGC CAACGGCGCA
TCCTTCTTCT TCGTCGCGGT CTACCTGCAC ATCGCCCGAG GCCTCTATTA CGGCTCCTAC
AAGGCGCCCC GCGAAATCCT CTGGATCCTC GGCGTGGTCA TCTACCTCCT GATGATGGCG
ACCGGCTTCA TGGGCTATGT TCTGCCCTGG GGCCAGATGT CCTTCTGGGG CGCCACCGTC
ATCACCGGCT TCTTCTCCGC CTTTCCGCTG GTCGGCGAAT GGGTGCAGCA GTTCCTACTC
GGCGGCTTTG CGGTCGAAAA CCCGACGCTG AACCGCTTCT TCTCGCTGCA CTACCTGCTG
CCCTTCATGA TCGCCGGCGT CGTCATCCTG CACATTTGGG CGCTGCACGT CGTCGGCCAG
ACGAACCCGA CGGGCGTCGA GGTCAAGACG AAGACGGACA CGGTGCGCTT CACGCCTTAT
GCGACGATGA AGGATGCGCT CGGCGTCTCA ATCTTCCTGA TGGTCTATGC CTATTTCGTC
TTCTACCTGC CGAACTTCCT AGGCCATGCC GACAACTACA TCCCGGCCGA CCCGCTGAAG
ACGCCGGCCC ACATCGTTCC GGAATGGTAC TTCCTGCCGT TCTACGCGAT GCTGCGCTCG
ATCACCTTCA ACGTCGGCCC GATCGACTCC AAGCTCGGCG GCGTGCTCGT GATGTTCGGC
GCGATCATCG TGCTGTTCTT CCTGCCCTGG CTCGATACCT CAAAGGTCCG TTCCGCCGTC
TACCGCCCCT GGTACAAGCT GTTCTACTGG CTGTTCGTGA TCAACGCGAT CATCCTCGGC
TGGCTCGGTT CGCAGCCGGC GGAAGGCTTG TTCACCACGA TCTCGCAGAT CTGCACGCTC
CTCTACTTCG CTTTCTTCCT GGTCGCGATG CCGGTTCTCG GCTTGGTGGA AACACCGCGT
CGCATCCCGA ACTCGATCAC CGAAGCGGTG CTCGAAAAGC GCAACAAGAC CGTAGCGGTC
AAGGCATAA
 
Protein sequence
MSGHSSYEPS TGLEKWVDAR LPLPRMVYDS FVAYPVPRNL NYAYTFGAML AVMLVVQILT 
GVTLAMHYAA ESSVAFNSVE KIMRDVNHGW LLRYMHANGA SFFFVAVYLH IARGLYYGSY
KAPREILWIL GVVIYLLMMA TGFMGYVLPW GQMSFWGATV ITGFFSAFPL VGEWVQQFLL
GGFAVENPTL NRFFSLHYLL PFMIAGVVIL HIWALHVVGQ TNPTGVEVKT KTDTVRFTPY
ATMKDALGVS IFLMVYAYFV FYLPNFLGHA DNYIPADPLK TPAHIVPEWY FLPFYAMLRS
ITFNVGPIDS KLGGVLVMFG AIIVLFFLPW LDTSKVRSAV YRPWYKLFYW LFVINAIILG
WLGSQPAEGL FTTISQICTL LYFAFFLVAM PVLGLVETPR RIPNSITEAV LEKRNKTVAV
KA