Gene Rleg_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0643 
Symbol 
ID8011822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp679297 
End bp681000 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content61% 
IMG OID644823233 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_002974486 
Protein GI241203390 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.365928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAC CTTCCGCTCA CGACGATCAT TCTCATGATC ACGCCGCCCA TCATGACCAC 
GCGCATGACG ATCACCATGA TCACGGGCAC AAGCCGAGCT TTGCCAATCG CTGGCTGTTC
TCGACCAACC ACAAGGACAT CGGCACGCTC TACCTGATCT TCGCGATCAT TGCCGGCATC
ATCGGCGGCG CGCTGTCGGT TGCCATGCGC ATGGAGCTGC AGGAGCCTGG CATCCAGATC
TTCCACGGCC TGGCCTCGAT GGTCTACGGC TATGAGGGTG ACGCTGCGAT CGACGGCGCC
AAGCAGATGT TCAACATGTT CACGACCGCG CACGCGCTGA TCATGATCTT CTTCATGGTC
ATGCCGGCGA TGATCGGCGG TTTCGCCAAC TGGATGGTGC CGATCATGAT CGGCGCGCCC
GACATGGCTT TCCCGCGCCT CAACAACATC TCCTTCTGGC TGATCGTTCC CGCCTTCGCG
CTGCTGCTGC TGTCGATGTT CGTCGAAGGC CCGGCAGGCG CTTATGGTAC GGGCGGTGGT
TGGACGATGT ATCCGCCGCT GGCGACAACC GGCACGCCGG GACCGGCGGT CGACCTTGCG
ATCTTCGCGC TCCACATTGC CGGCGCCTCG TCGATCCTCG GTGCGATCAA CTTCATCACC
ACGATCCTCA ACATGCGCGC TCCCGGCATG ACGCTGCACA AGATGCCGCT GTTTGCCTGG
TCCGTGCTGA TCACCGCCTT CCTGCTCTTG CTGTCGCTGC CGGTTCTGGC AGGCGGCATC
ACCATGCTGC TCACCGACCG TAACTTCGGC ACATCCTTCT TCTCGCCGGA AGGCGGCGGC
GACCCGATTC TTTACCAGCA CCTGTTCTGG TTCTTCGGTC ACCCCGAGGT CTACATCCTC
ATCCTGCCGG GCTTCGGCAT GGTCAGCCAC ATCATCTCGA CCTTCTCGAA GAAGCCGATC
TTCGGCTATC TCGGCATGGC CTACGCCATG GTCGCGATCG GCGCCGTCGG CTTCGTCGTC
TGGGCTCACC ACATGTACAC GGTCGGCCTG TCGCTCGACG CACAGCGCTA CTTCGTCTTC
GCGACGATGG TCATCGCCGT TCCGACGGGT GTGAAGATCT TCTCCTGGAT CGCGACGATG
TGGGGCGGCT CGATCTCGTT CCGCACGCCG ATGCTCTGGG CGATCGGCTT CATCTTCCTG
TTCACGGTCG GCGGCGTCAC CGGCGTCCAG CTCGCCAATG CCGGTCTCGA CCGCTCGCTG
CATGACACCT ATTACGTCGT GGCCCACTTC CACTACGTTC TGTCGCTCGG CGCCGTCTTT
GCGATCTTCG CCGGCTGGTA CTACTGGTTC CCGAAGATGA CCGGCTACAT GTACAACGAG
CTGGTCGGCA AGCTGCATTT CTGGATCATG TTCATTGGCG TCAACCTGGT GTTCTTCCCG
CAGCACTTCC TCGGTCTCGC CGGCATGCCG CGCCGCTACA TCGATTATCC GGATGCCTTT
GCCGGCTGGA ACTACGTTTC CTCGATCGGC TCCTACATCT CGGCCTTCGG TGTGCTGATC
TTCCTCTACG GCGTCTTCGA AGCCTTCGCC AAGAAGCGTG TGGCCGGCGA CAATCCGTGG
GGTGAGGGTG CAACGACGCT CGAATGGCAG CTGCCTTCGC CGCCGCCCTA TCACCAGTGG
GAACAGCTTC CGCGCATCAA GTAA
 
Protein sequence
MAGPSAHDDH SHDHAAHHDH AHDDHHDHGH KPSFANRWLF STNHKDIGTL YLIFAIIAGI 
IGGALSVAMR MELQEPGIQI FHGLASMVYG YEGDAAIDGA KQMFNMFTTA HALIMIFFMV
MPAMIGGFAN WMVPIMIGAP DMAFPRLNNI SFWLIVPAFA LLLLSMFVEG PAGAYGTGGG
WTMYPPLATT GTPGPAVDLA IFALHIAGAS SILGAINFIT TILNMRAPGM TLHKMPLFAW
SVLITAFLLL LSLPVLAGGI TMLLTDRNFG TSFFSPEGGG DPILYQHLFW FFGHPEVYIL
ILPGFGMVSH IISTFSKKPI FGYLGMAYAM VAIGAVGFVV WAHHMYTVGL SLDAQRYFVF
ATMVIAVPTG VKIFSWIATM WGGSISFRTP MLWAIGFIFL FTVGGVTGVQ LANAGLDRSL
HDTYYVVAHF HYVLSLGAVF AIFAGWYYWF PKMTGYMYNE LVGKLHFWIM FIGVNLVFFP
QHFLGLAGMP RRYIDYPDAF AGWNYVSSIG SYISAFGVLI FLYGVFEAFA KKRVAGDNPW
GEGATTLEWQ LPSPPPYHQW EQLPRIK