Gene Rleg_5947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5947 
Symbol 
ID8016367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp491383 
End bp493047 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content57% 
IMG OID644828060 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_002979260 
Protein GI241518632 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGCA ATGATGCAAC GCACTTAAAC ACCAATCTCC ATCGAGAGGG GCTTTCCCAG 
ACTCGGGAGG AACGCGTCAG CTATTTGAGG GCAGGCCACA GTCTGCGCTC ATGGCTTCTA
TCGACGGACC ATAAGCGTGT CGCGATTCTT TATCTCATCG CAATTACATT CTTCTTCTTC
ATCGGCGGCG TCGCCGCTGC TCTTGTGCGC GCGGATCTCC TGACACCGCA GGGAGATTTG
CTGACCAATG AGGGCTACAA TCGTGCCTTC ACCCTCCACG GCGTGATCAT GGTCTGGTTC
TTCCTCATTC CGTCTATTCC GAACACGTTT GGCAATTTCC TGATCCCGCT GATGATCGGC
GCGCGCGACC TTGCGTTCCC GCGCCTCAAT CTCCTGAGCT GGTATATCTT TGTATTGGCT
GGCTTGTTCA CATTGATTGT GGTGGTCACC GGCGGCGTCG ATACCGGATG GACATTCTAC
ACGCCGCTCT CCTCCATGTT TTCGAACGGA AACGTTGTTC TTGCAGCGAC GGCGGTTTTC
ATCGCCGGGT TCTCGTCCAT TCTAACCGGG CTGAACTTCA TCGTCACGAT CCACAAGCTT
CGATGCCCGG GTATGACCTG GGGTCGTCTG CCGCTCTTTG TCTGGTCGCA TTACGCGACA
TCGCTCGTTC TTGTCCTGGC GACGCCTGTT TTGTCGGTCA CGCTGGTGCT GATCGTCGCT
GAACGCTTTT TCCACCTCGG CGTCTTCGAT CCTGCTCTGG GTGGCGATCC TCTGCTTTAC
CAGCACCTGT TCTGGTTCTA CAGCCATCCT GCGGTCTATA TCATGGTGCT CCCGGCACTC
GGGGTGGTCA GCGAGCTGAT AGCTGCTGCC GCCCGCAAAC CCGTGTTTGG TTATCAGTTC
GTGGCTGGGT CCTCCATGGC GATCGCCGCA ATTGGCTTCC TTGTCTGGGG GCATCACATG
TTCGTTTCCG GCCAGTCGAT GTACGCCAGC GCCGCGTTCT CATTACTGAG CTTGGCCGTC
GCGGTTCCGT CAGGCATCAA GGTCTATAAT TGGACCGCAA CCCTCTATAA GGGCCACATT
GGCCTCGATC CGCCGTTTCT CTTTGCCATG GGCTTCATCG GTCTGTTCGT TGTCGGCGGA
TTGACTGGGC TCATGCTCGC CATGCTGGCT ATCGACCTCC ACGTCCACGA CACCTATTTC
GTGGTGGCGC ATTTTCACTA CATTATGGTT GGCGGTACCG TATCCGCCTT CTTCGGCGCC
CTGCACTATT GGTGGCCGAA GATCATCGGC CGCCGCTACA ACCACATCTG GGGCAGTATT
ACTGCCATTT TCATTTTTCT TGGATTCAAC ATGACTTTTT TCCCGCAGTT CCTGTTGGGT
TACTGGGGCA TGCCACGGCG CTACCATGTG TACCCGCCCG AGTTCCAGAC CCTGCACGTA
CTGTCGTCTG CCGGAGCGAC CATTCTTGGG TTCGCCTATC TAACGCCCTT CGTCTACCTG
TTCTATTCCA TGCGTTATGG CCAACCTGCG GGTGATAACC CTTGGGATGC ACGTGGACTG
GAGTGGACGG TGCCATCGCC GCCGCCGAAG CACAACTTCG ACCATCTGCC GGTCGTTAGC
GGCCCCCCTT ACGATTATCC GGTGGAGCGG GAGGGCGAAC AATGA
 
Protein sequence
MPRNDATHLN TNLHREGLSQ TREERVSYLR AGHSLRSWLL STDHKRVAIL YLIAITFFFF 
IGGVAAALVR ADLLTPQGDL LTNEGYNRAF TLHGVIMVWF FLIPSIPNTF GNFLIPLMIG
ARDLAFPRLN LLSWYIFVLA GLFTLIVVVT GGVDTGWTFY TPLSSMFSNG NVVLAATAVF
IAGFSSILTG LNFIVTIHKL RCPGMTWGRL PLFVWSHYAT SLVLVLATPV LSVTLVLIVA
ERFFHLGVFD PALGGDPLLY QHLFWFYSHP AVYIMVLPAL GVVSELIAAA ARKPVFGYQF
VAGSSMAIAA IGFLVWGHHM FVSGQSMYAS AAFSLLSLAV AVPSGIKVYN WTATLYKGHI
GLDPPFLFAM GFIGLFVVGG LTGLMLAMLA IDLHVHDTYF VVAHFHYIMV GGTVSAFFGA
LHYWWPKIIG RRYNHIWGSI TAIFIFLGFN MTFFPQFLLG YWGMPRRYHV YPPEFQTLHV
LSSAGATILG FAYLTPFVYL FYSMRYGQPA GDNPWDARGL EWTVPSPPPK HNFDHLPVVS
GPPYDYPVER EGEQ