Gene Smed_4588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4588 
Symbol 
ID5319004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1083849 
End bp1085048 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID640776389 
Producthypothetical protein 
Protein accessionYP_001313321 
Protein GI150376725 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.597366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCAGC CGGCCGACAG GGAATTTTAC GCGAACCTTC CGCTTTTCGA GGCATTCGAA 
GGTGTCGCAG ACGAGGCCAA TTACCGGCCC CTGCCCGAAG GCTGGTGGCT TGCCGTTGCC
GACATCGTGG ATTCGACGGG AGCCATCGCG GAAGGGCGAT ACAAGAGCGT GAACATGGCC
GGCGCGAGCG TCATCTCCGC GCTCATGAAT GCGCTTGACG AGAAAAATCT CGCCTTCGTT
TTCGGTGGCG ACGGTGCGCT CGCCGTCGTG CCCGGCGGGC TGGCGGCAAA GGCAAAACAC
GCGCTCGCTG CGGCGAAAAC ATGGGTTGCG GAAGAGCTTG GACTGGAGCT TCGCGCTGCG
ATCGTCCCGG TCTCGGACGT GCGCGCCAAT GGCTTCGACA TGCGCGTTGC GCGCTTCAAG
GCGAGCGAGG TGGTCTCCTA TGCCATGTTC TCCGGCGGCG GCGCCAGCTG GGCGGAAGCG
GAAATGAAGG CAGGCCGTTA TCAGATTGCG GCCGCCCCGA CCGGCACACG GCCCGACCTG
ACCGGGTTGT CCTGCCGGTG GAACCCGATC GTCTCACATC ACGGGGCGAT CGTATCCATC
ATCGCAGTGC CGGGAGAGCG CGGCATCGGA CCTGAATTCC AGGCTTTGAT CGGCGACATC
GTGGAACTGG CCGAAGGGGA GGAGCGGGGT GGGCACCCCG TACCGGAAAA CGGTCCCGAG
CCGCATCTGT CGGTGCGCGG CATCACGGTG GAATCGCGCG CCGTCGCGCC GAGAGGCCGC
CGCTCCCTGG CTTGGTTCTT CGTCGCCGCG CAGAGCCTTG CTCTCTTTCT CTGCTTCAGG
CTCGGCATCA ATTTCGGCCC CTTCGACGTC AAGCGATATG CGCGCGACCT TGCCAGCAAT
TCGGACTTCC GCAAGTTTGA TGACGCTCTG AAGATGACGA TCGACGTCAG TCTCGATCGG
CTGCGCAGAA TCGAGGAGCG GCTGAAGCAA GGGGTCGCGG CGGGCATATG CCGCTACGGA
CTGCACCGGC AGGATGCGGC ACTGATGACG TGCATCGTAC CGACGCCGAT GAGCCGCGAC
CACATGCACT TCATCGACGG GGCGGCCGGC GGGTACGCGG TGGCCGCCCG GAACCTGAAG
GCCACCCTTG CCGGCAGCGT TTCACAGGCG GGAAGTCTAC CTTCGATGAT TAAGCCTTGA
 
Protein sequence
MVQPADREFY ANLPLFEAFE GVADEANYRP LPEGWWLAVA DIVDSTGAIA EGRYKSVNMA 
GASVISALMN ALDEKNLAFV FGGDGALAVV PGGLAAKAKH ALAAAKTWVA EELGLELRAA
IVPVSDVRAN GFDMRVARFK ASEVVSYAMF SGGGASWAEA EMKAGRYQIA AAPTGTRPDL
TGLSCRWNPI VSHHGAIVSI IAVPGERGIG PEFQALIGDI VELAEGEERG GHPVPENGPE
PHLSVRGITV ESRAVAPRGR RSLAWFFVAA QSLALFLCFR LGINFGPFDV KRYARDLASN
SDFRKFDDAL KMTIDVSLDR LRRIEERLKQ GVAAGICRYG LHRQDAALMT CIVPTPMSRD
HMHFIDGAAG GYAVAARNLK ATLAGSVSQA GSLPSMIKP