Gene Rleg2_5243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5243 
Symbol 
ID6978337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp873957 
End bp875870 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content60% 
IMG OID643394355 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_002279173 
Protein GI209547255 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.407043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.600413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGGC ACGGTAAAAT TCCCTCCCCT GCTGATCTGG CTGCCGGCGA TGAAAAGGCC 
GAAGCTTTCC TTAAGGCCAC ATGGTCGACG CCAGCAGGGA TCGTGGCGGC CCTATCGACC
GTCGATCACA AGATCATCGG CCGCCGCTAC ATTGCCACAG CCTTCGTCTT TCTGATGCTT
GGCGGCGTCC TGGCGATGGC CATGCGCCTC CAGCTCGCAT TCCCGGAAGC CCGCTTCATC
AGCGCTGACC GGTACAACCA GATCTTTACC GTGCATGGTA GCAACATGAT GTTCCTGTTC
GCCGTGCCGG TGATGGAGGC GATGGCGGTC TACCTGGTTC CCCTGATGGT CGGTACCCGC
AACATCGCCT TTCCCCGGCT GAATGCCTTC TCCTACTGGA TCTTTCTCGC CGGCGGGCTG
CTTCTGTGGG CGTCTCTCGC GCTCGACACT GCCCCCGATG TCGGATGGTT CGCCTACGTG
CCACTGGCAG GCCCGCAATA CGGCCCGGGC AAGCGGGCAG ACATTTGGGC TCAGATGATT
ACCTTTACGG AAGTCTCGGC CCTTGCCGTT GCCGTCGAAA TCGTCGTGAC CGTCTTCAAA
TTGCGGGCGC CTGGCATGTC GCTCGACCGG ATCCCGATCT TTGTCTGGTC GATGCTCGTC
ACAGCCTTTC TGGTCATCAT GGCGATGCCA GCGATCATGC TCGTCAGTAC GTCGCTCATC
CTTGATCGGC TGGTCGGCAC GCAGTTCTTC AATCCGGCAG AGGGCGGCGA TGCGCTCCTC
TGGCAGCATC TGTTCTGGTT TTTCGGCCAT CCGGAGGTCT ATATCATCTT CCTGCCTGCC
GTCGGCATGG TTTCGGCGAT GATCTCGACC TTCACGCAGC GCCCCGCCTT CGGCTATCTG
CCGCTGGTGC TGGCGATGAT TGCGACCGGT GTTCTCGCCT TCGGATTGTG GGTGCACCAT
ATGTTCGTGG CCGGCCTGCC GCGTGTAGGC AGCAGCTTCT TCACCGCCTC GAGCATGGCG
ATCGCCATTC CGGCAGGCAC CCAGATCTTT TGCTGGCTGG CAACTCTGTG GGACGGCCGT
CCCATCTTCA AGACGCCAAT GCTTTTCATC ATCGGCTTCC TCATCACATT CGTCATTGGC
GGTTTGACTG GCGTCATGGT GGCATCGGTG CCGTTCGACA CCCAGGTTCA CGACACTTAT
TTCGTCGTGG CGCATTTTCA CTATGTGCTG ATCGGCGGTT CCGTCTTTCC GCTGATCGGG
GCGATCTATT ATTGGTTTCC GAAAATGACC GGGCGAATGA TGAGCGAAAG GCTTGGGCGC
TGGGCCTTCG GCCTGATCTT CACCGGTTTT CATCTCACCT TCTTCCCGAT GCATATCCTC
GGCCTGCAAG GCATGCCGCG GCGCGTTTAT ACCTATCCGC CAGAATTACC CTGGACGGGC
CTGAACCTCT TCGTCAGCCT CAGTGCCGTC ATCCTGGCGG GTGGTTTCCT CGTCTTCTTC
ATCGACGTCC TGCGCAGTTT CCGGCATGGC CCTGCCGCGG GCTCCAATCC CTGGAATGCA
TCGACACTGG AATGGGCCAC GCCCTCGCCC CCGCCTCCTT ATAACTTCCG GCACATTCCC
GTGGTCGAAT CCCGGGAGCC GCTCTGGACG TCAGGCGATG CACTGGCGGT GGCGACAGGT
CTGCGGCTCG ACCGACGGGA ACTGATCGTC AGCAGTCTCA CAACTGCCGA TCCGGAAGCC
CGGGAATCCT CGCCGGCCAA TTCCATCTGG CCATTTCTGG CCGCCATTGC CACGAGCGTC
ATGCTGATTG CCTCGATCTT CAGTCCCTGG GCGGTCATCT GGGGCGCGAT CCCGGTAGCG
ATCACCTTGA CCGGTTGGTT CTGGCCGAAA CGAACGCCGG AGGACGAATC ATGA
 
Protein sequence
MTRHGKIPSP ADLAAGDEKA EAFLKATWST PAGIVAALST VDHKIIGRRY IATAFVFLML 
GGVLAMAMRL QLAFPEARFI SADRYNQIFT VHGSNMMFLF AVPVMEAMAV YLVPLMVGTR
NIAFPRLNAF SYWIFLAGGL LLWASLALDT APDVGWFAYV PLAGPQYGPG KRADIWAQMI
TFTEVSALAV AVEIVVTVFK LRAPGMSLDR IPIFVWSMLV TAFLVIMAMP AIMLVSTSLI
LDRLVGTQFF NPAEGGDALL WQHLFWFFGH PEVYIIFLPA VGMVSAMIST FTQRPAFGYL
PLVLAMIATG VLAFGLWVHH MFVAGLPRVG SSFFTASSMA IAIPAGTQIF CWLATLWDGR
PIFKTPMLFI IGFLITFVIG GLTGVMVASV PFDTQVHDTY FVVAHFHYVL IGGSVFPLIG
AIYYWFPKMT GRMMSERLGR WAFGLIFTGF HLTFFPMHIL GLQGMPRRVY TYPPELPWTG
LNLFVSLSAV ILAGGFLVFF IDVLRSFRHG PAAGSNPWNA STLEWATPSP PPPYNFRHIP
VVESREPLWT SGDALAVATG LRLDRRELIV SSLTTADPEA RESSPANSIW PFLAAIATSV
MLIASIFSPW AVIWGAIPVA ITLTGWFWPK RTPEDES