Gene Smed_4759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4759 
Symbol 
ID5318483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1279527 
End bp1280648 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content65% 
IMG OID640776557 
Productcytochrome c class I protein 
Protein accessionYP_001313489 
Protein GI150376893 
COG category[C] Energy production and conversion 
COG ID[COG2863] Cytochrome c553 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.880991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0494649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCTGA GAAACCTGCA CTGGACGAAC ATCGCGAAGG TTGCGGGGCT CGGCACCGGG 
CTCCTTATCG TCCTCGGCGC CGTGTTCGTC TGGTCCGGCA TCTACAACGT CGCCGCATCG
AAGGACCATC TGCAGATCAC GACCTGGATC TTGACGCTGA TCCGTGAACG ATCGATCGCC
ACCCACAGTT TCAAGATCGA GGTGCCGGCG CTCGATGACG AGAGCAAGAT CCGGCTCGGG
GCGTCTCACT ACGAGGGCGG ATGCGTGCCG TGCCATAACC GCCCCGGCGA AGAGATAAAC
TCCATAGTCA AAGGCATGCT GCCACCGCCA CCCAACCTGC TGGAAATCGG CAAGCATCGC
CCGCCCGAGG AGATCTTCTG GATCGTAAAG CACGGCCTCA AATACACGGG CATGCCGGCA
TGGACGAATG TGTTACGCGA CGATGAGGTT TGGGCCCTTA CCGCGTTTCT CGCGAGCCTG
CCGGCCACGG CTGGCGATTA CGGCGAGCTC GCAGGTCTTT CGCGCGGTCA GGGCAATGCG
CGTGAGGAAC CGGCGAACGG GCGTGCCCTC AACGTCTGCG TGCGCTGCCA TGAACGCGAT
GGCATGAGCA CCAACGGCGA CCGTGTGCCG CGGCTCGCGG GCATGCCGGA GGCTTATCTT
CTTCGCAGTC TCCAGGAATA TGCACAAGGG ACACGCGCAA GCGGTGTCAT GGAACCGGTC
GCCGACCTGC TCTCCGAGGA GGCAATGCGG GAGCTGGCGG CGCATTATCA GGCGCTTCCG
CCTGTCGCCG GAACGGCCGA ACCAGATCCG GAGCAGCTCC GGCGGGGCGA GGCCATCGCC
AGGCGCGGCA TAGTGGGCCA AGGCGTGCCG GCCTGTCTAA GCTGCCATTC CGGGCGTCAG
TCGCAGCAGT TCCCGGTGCT CGCCGGACAG AATGCCGCCT ACATCGAGGA GCAGATACGG
CTCTGGCGTC GCGGTGGGCG GATCGGAACC CCCTATGGAA GGATTATGGC GGCAGTCGCC
GGGGCTCTCG ACGAAGGACA GATCGAGGAT GTCGCCGCCT ACCTTGCCTC ACTTCCCGCG
GGACGCGCGC CGGACGCGCC GGTGGCGGAG GCTGGCCGAT GA
 
Protein sequence
MDLRNLHWTN IAKVAGLGTG LLIVLGAVFV WSGIYNVAAS KDHLQITTWI LTLIRERSIA 
THSFKIEVPA LDDESKIRLG ASHYEGGCVP CHNRPGEEIN SIVKGMLPPP PNLLEIGKHR
PPEEIFWIVK HGLKYTGMPA WTNVLRDDEV WALTAFLASL PATAGDYGEL AGLSRGQGNA
REEPANGRAL NVCVRCHERD GMSTNGDRVP RLAGMPEAYL LRSLQEYAQG TRASGVMEPV
ADLLSEEAMR ELAAHYQALP PVAGTAEPDP EQLRRGEAIA RRGIVGQGVP ACLSCHSGRQ
SQQFPVLAGQ NAAYIEEQIR LWRRGGRIGT PYGRIMAAVA GALDEGQIED VAAYLASLPA
GRAPDAPVAE AGR