Gene Smed_5935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5935 
Symbol 
ID5320237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp897729 
End bp899348 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content61% 
IMG OID640777628 
Productcbb3-type cytochrome c oxidase subunit I 
Protein accessionYP_001314560 
Protein GI150377965 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3278] Cbb3-type cytochrome oxidase, subunit 1 
TIGRFAM ID[TIGR00780] cytochrome c oxidase, cbb3-type, subunit I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.137652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGA CAGTCGAGAT GGTCGTACTT GCCGTCGGCG CCTTCCTGGC GCTGGTCGGA 
GCCGGTCTCG CCCAGGACCG CCTGTTCGGC GCGCACATGT GGATACTGTT CTTCGTGCTG
CTCGGCGGCA CATTGGTGCT CATGCGCCGC GTTGATTTCC GTGCGGCTTC CGCGGGTCGC
CGGGCCGGAG AGACGGAATA TTTCGACGAG GTCGTGAAGT ACGGCGTCAT CGCCACGGTA
TTCTGGGGTG TGGTCGGTTT TCTCGTAGGC GTCGTCGTTG CCCTGCAACT TGCCTTCCCC
GACCTCAATG TGGAGCCCTG GTTCAGCTTC GGCCGCATGC GGCCGCTCCA CACCTCGGCC
GTGATCTTCG CGTTTGGCGG CAACGCATTG ATCGCGACGT CCTTCTATGT GGTTCAGCGC
ACGAGTCGAG CGCGCCTCTT CGGCGGTGAT CTCGGCTGGT TCGTTTTCTG GGGTTATCAG
CTTTTCATCG TGCTTGCAGC GACCGGCTAC CTGCTCGGAA TTACCCAGAG CCGCGAATAC
GCGGAGCCGG AATGGTACGT CGATCTTTGG CTGACAATCG TATGGGTCGC TTATCTGGCC
GTCTTCCTCG GCACGATCTT GAAGCGCAAA GAACCGCACA TCTACGTGGC GAACTGGTTC
TATCTCGCCT TCATCGTCAC CATCGCGATG CTGCACATCG TCAACAACCT GGCGGTGCCG
GTATCGTTCC TGGGTTCCAA GAGCTATTCG GCCTTCGCCG GCGTCCAGGA CGCGCTGACG
CAATGGTGGT ACGGCCATAA CGCGGTCGGC TTCTTCCTGA CCGCCGGCTT TCTGGCAATG
ATGTACTACT TCATCCCGAA ACAGGTAAAT CGCCCCGTCT ATTCCTACCG GCTGTCGATC
ATCCACTTCT GGGCGCTGAT CTTCATGTAC ATCTGGGCAG GTCCCCACCA CCTGCACTAC
ACGGCGCTGC CCGACTGGGC TCAGACGCTC GGCATGGTCT TCTCCATCAT GCTCTGGATG
CCCTCCTGGG GCGGCATGAT CAACGGCTTG ATGACGCTCT CCGGTGCCTG GGACAAGATC
CGTACGGATC CGGTCGTCCG CATGATGGTC ATGGCCGTCG CCTTCTACGG CATGGCGACC
TTCGAGGGGC CGATGATGTC GATCAAGTCC GTCAATTCGC TGAGCCACTA TACCGACTGG
ACCATCGGTC ACGTCCATTC CGGCGCGCTC GGATGGAACG GCCTCATCAC CTTCGGTGCC
GTTTACTATC TGGTCCCGAA GCTCTGGAAC CGGGAGCGTC TTTACAGCCT GCAAATGGTC
AATTGGCACT TTTGGCTCGC CACCCTCGGC ATCGTCGTCT ATGCCGCCAC AATGTGGGTA
GCGGGCATCC AGCAGGGGCT GATGTGGCGC GAATACGATG ATCAGGGTTT CCTCGTCTAC
TCCTTCGCGG AGTCGGTGGC GGCGATGTTC CCGTACTACG TCATGCGTGC CGCGGGCGGT
GCCCTGTTCC TCGCCGGCGC GCTCGTAATG GCCTTCAACG TAACAATGAC CATCCTCGGC
CGCATGCGCG ATGAGGCCGC GGCCATGGAT GCCGCGCCGC TGCCGGCACC GGCAGAATAG
 
Protein sequence
MKQTVEMVVL AVGAFLALVG AGLAQDRLFG AHMWILFFVL LGGTLVLMRR VDFRAASAGR 
RAGETEYFDE VVKYGVIATV FWGVVGFLVG VVVALQLAFP DLNVEPWFSF GRMRPLHTSA
VIFAFGGNAL IATSFYVVQR TSRARLFGGD LGWFVFWGYQ LFIVLAATGY LLGITQSREY
AEPEWYVDLW LTIVWVAYLA VFLGTILKRK EPHIYVANWF YLAFIVTIAM LHIVNNLAVP
VSFLGSKSYS AFAGVQDALT QWWYGHNAVG FFLTAGFLAM MYYFIPKQVN RPVYSYRLSI
IHFWALIFMY IWAGPHHLHY TALPDWAQTL GMVFSIMLWM PSWGGMINGL MTLSGAWDKI
RTDPVVRMMV MAVAFYGMAT FEGPMMSIKS VNSLSHYTDW TIGHVHSGAL GWNGLITFGA
VYYLVPKLWN RERLYSLQMV NWHFWLATLG IVVYAATMWV AGIQQGLMWR EYDDQGFLVY
SFAESVAAMF PYYVMRAAGG ALFLAGALVM AFNVTMTILG RMRDEAAAMD AAPLPAPAE