Gene Smed_1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1540 
Symbol 
ID5322398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1631680 
End bp1632960 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content58% 
IMG OID640790485 
Productcytochrome b/b6 domain-containing protein 
Protein accessionYP_001327217 
Protein GI150396750 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.309418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.652412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTG ATCATTCAAC CTACACGCCA ACGACAGGCA TCGAGAAGTG GGTTGATTCC 
CGCCTTCCGT TGCCGCGGCT CGTCCACGAC TCGTTCGTCT CCTATCCGGT TCCGCGCAAC
CTGAATTATG CTTACACCTT CGGTGCGATG CTTTCGGTGA TGTTGATCGT GCAGATCCTC
ACCGGCATCG TGCTGGCCAT GCACTATGCC GCAGAAACCT CCGTCGCCTT CAATTCGGTC
GAGAAGATCA TGCGCGACGT CAATCATGGC TGGCTGCTGC GCTACCTGCA TGCCAACGGT
GCGTCGTTCT TCTTCATTGC GGTCTACCTT CACATCGCCC GCGGCCTCTA TTACGGCTCC
TACAAGGCGC CGCGCGAGAT CCTCTGGATA CTCGGCGTGG TCATCTATCT CCTGATGATG
GCGACAGGCT TCATGGGCTA TGTGCTCCCC TGGGGGCAGA TGTCTTTCTG GGGTGCCACC
GTCATCACCG GGTTCTTCTC GGCCTTTCCG CTTATCGGAG AGTGGATCCA GCAGTTCCTG
CTCGGCGGCT TCGCCGTAGA CCAGCCGACG CTGAACCGGT TCTTCTCGCT GCATTACCTT
TTGCCGTTCA TGATCGCCGG CGTGGTCGTC CTGCACATCT GGGCGCTGCA CGTCACCGGT
CAAACGAATC CGACTGGGGT CGAGGTCAAG TCCAAGACCG ATACCGTGCC GTTCACGCCC
TATGCGACGC TGAAGGATGC ACTGGGCGTA TCGGTCTTCC TGATCGTCTA TGCATGGTTC
GTCTTCTATA TGCCGAACTT CCTCGGTCAC CCGGACAACT ACATCCCCGC TGATGCGTTG
AAGACGCCCG CACACATCGT TCCGGAATGG TACTACCTGC CGTTCTACGC GATGCTGCGC
GCCATCACCT TCAATGTCGG CCCGATCGAC TCCAAGCTCG GCGGCGTTCT GGTGATGTTC
GGCTCGATCA TCATCCTGTT CTTCCTGCCT TGGCTCGATA CGTCGAAGGT CCGCTCGGCC
GTGTACCGCC CCTGGTATAA GCTGTGCTTC TGGATCTTCG TTGCTAACTG CATCATGCTC
GGCTGGTTGG GCTCGCGCCC CGCGGAAGGC CTCTATGTCG TGATGTCGCA GCTCGGCACG
TTGTACTACT TCGCCTTCTT CCTCGTCATC ATGCCGGTCC TCGGTCTGAT CGAGACGCCG
AAGCGCATTC CGAATTCCAT CACCGAAGCG GTCTTGGAAA AACAGAATGC CAAGGCGCAG
TTGAAGCCCG CACGCGCCTG A
 
Protein sequence
MSADHSTYTP TTGIEKWVDS RLPLPRLVHD SFVSYPVPRN LNYAYTFGAM LSVMLIVQIL 
TGIVLAMHYA AETSVAFNSV EKIMRDVNHG WLLRYLHANG ASFFFIAVYL HIARGLYYGS
YKAPREILWI LGVVIYLLMM ATGFMGYVLP WGQMSFWGAT VITGFFSAFP LIGEWIQQFL
LGGFAVDQPT LNRFFSLHYL LPFMIAGVVV LHIWALHVTG QTNPTGVEVK SKTDTVPFTP
YATLKDALGV SVFLIVYAWF VFYMPNFLGH PDNYIPADAL KTPAHIVPEW YYLPFYAMLR
AITFNVGPID SKLGGVLVMF GSIIILFFLP WLDTSKVRSA VYRPWYKLCF WIFVANCIML
GWLGSRPAEG LYVVMSQLGT LYYFAFFLVI MPVLGLIETP KRIPNSITEA VLEKQNAKAQ
LKPARA