Gene Smed_5931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5931 
Symbol 
ID5320233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp894148 
End bp895722 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content63% 
IMG OID640777624 
Productcytochrome c oxidase cbb3 type accessory protein FixG 
Protein accessionYP_001314556 
Protein GI150377961 
COG category[C] Energy production and conversion 
COG ID[COG0348] Polyferredoxin 
TIGRFAM ID[TIGR02745] cytochrome c oxidase accessory protein FixG 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.420299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCACC AGCCGATAAC CAAAGATCCA GTCGAACGCC TCGACGCCGA AGCGGTCAAT 
TCCGCCCGCG TGCGAGGGCC GCTCTACGAG AAGCGCCGGA AGATCTTCCC GAAGCGGGCC
GAGGGCCGCT TTCGCCGGTT CAAGTGGCTA TTGATGCTGA TGACGCTCGG CGTCTACTAT
CTGACGCCAT GGATCCGCTG GGACCGCGGG GCGCATGCGC CGGATCAGGC AGTCCTTATC
GACCTCGATT CCCGGCGCTT CTATTTCTTC TTCATAGAAA TTTGGCCGCA GGAATTCTTC
TTCGTCGCGG GGCTTCTGGT GATGGCAGGC TTCGGGCTTT TCCTCGTGAC CTCTGCGGTC
GGGCGGGCTT GGTGCGGCTA CGCTTGCCCG CAGACCGTCT GGGTCGATCT CTTCCTCGTC
GTCGAGCGCT TCATCGAGGG CGACCGCAAC GCGCGCATGC GCCTAGACGC CGCACCGTGG
AGCCTCGACA AGATCCGTAA GCGCGTGGCC AAGCATGCCA TATGGCTGGC GATCGGTGTT
GCGACCGGCG GCGCGTGGAT ATTCTATTTC GCCGACGCGC CGTCGCTGTT GGTGAGTTTG
GTCGCACTCG ACTCGCCGCC GGTCGCCTAT AGCACGATCG GCATCCTGAC TGCTACGACC
TACGTCTTCG GCGGACTGAT GCGGGAGCAG GTCTGCACCT ACATGTGCCC ATGGCCGCGC
ATCCAGGCGG CCATGCTGGA CGAGAACTCG CTCGTCGTCA CCTATAACGA CTGGCGGGGG
GAGCCGCGCT CGCGGCATGC GAAAAAATCG GCTGCGGCCG GCGAGGTCGT CGGAGATTGC
GTCGACTGCA ACGCCTGCGT CGCCGTCTGT CCAATGGGAA TCGACATCCG CGACGGCCAG
CAGCTCGAGT GCATCACCTG CGCGCTCTGC ATCGACGCCT GCGACGGCGT GATGGATAAG
CTCGGCCGCG AGCAGGGGCT AATCTCATAC GCGACGTTCA GCGACTATGC CGCCAACATA
GCTCTCGCGA CGAGCGGCAC GACCGCGGCG ATCGATCCGA GCCGCGTGCG CGACGCCGAT
GGCGCTTTCC GTGACAAGGT GAGGCATCTC AACTGCCGCA TCGTCTTCCG GCCCCGCGTT
CTCGTCTATT TCGGCATCTG GGCGATTGTC GGACTCGGTC TTCTTTTCGG CCTGCTGGCG
CGCGACCGGC TGGAGCTGAA CGTCCTGCAC GACCGCAACC CTCAGTTCGT CGTCGAATCC
GACGGCTCGG TGCGCAACGG CTACATGGTC AAGCTTCTCA ACATGATCCC AAAACAGCGC
ACCATCAGCC TGACGATCGA GGGTATGCCC GCCGCCACCA TACGCATGGC CGGACAGGCA
ACGGACGATG GACGCAGCAT CACCATCGGA GTCGAGCCGG ACAAGGTCAC CTCGCTCAAA
GTCTTCGTCA CTTTGCCGAA AGGCAGATTC GCCGAGGCTG AAGAGGGCTT CTCCCTCATC
GCGGAGGATC CGTCCAGCCA CGAACGCGAT GTGTATCAGG CCAATTTCAA TCTACCGGGA
GCAGCAAGAC GATGA
 
Protein sequence
MLHQPITKDP VERLDAEAVN SARVRGPLYE KRRKIFPKRA EGRFRRFKWL LMLMTLGVYY 
LTPWIRWDRG AHAPDQAVLI DLDSRRFYFF FIEIWPQEFF FVAGLLVMAG FGLFLVTSAV
GRAWCGYACP QTVWVDLFLV VERFIEGDRN ARMRLDAAPW SLDKIRKRVA KHAIWLAIGV
ATGGAWIFYF ADAPSLLVSL VALDSPPVAY STIGILTATT YVFGGLMREQ VCTYMCPWPR
IQAAMLDENS LVVTYNDWRG EPRSRHAKKS AAAGEVVGDC VDCNACVAVC PMGIDIRDGQ
QLECITCALC IDACDGVMDK LGREQGLISY ATFSDYAANI ALATSGTTAA IDPSRVRDAD
GAFRDKVRHL NCRIVFRPRV LVYFGIWAIV GLGLLFGLLA RDRLELNVLH DRNPQFVVES
DGSVRNGYMV KLLNMIPKQR TISLTIEGMP AATIRMAGQA TDDGRSITIG VEPDKVTSLK
VFVTLPKGRF AEAEEGFSLI AEDPSSHERD VYQANFNLPG AARR