Gene Smed_4718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4718 
Symbol 
ID5318898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1240917 
End bp1242095 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content60% 
IMG OID640776516 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_001313448 
Protein GI150376852 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.440204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00516951 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTCCAC GGATCCTCGT GACGGGAATT CCTGGTCACT ACACGCGCCT CGCCAACGGC 
GCGCAGGGAT TATCCGTCTC CTATTCGGAG CGGCAGAAAC AGCCCGAAAC GAAGGAGGAG
TTCCTGCAGG AGCTTCGCAA TATCAGCAAT ACCGGCAATT ATCTGATCGG CGAGGGGGCA
CTGCGGGCGA TTGCCCCGCA TGCGAAGCAG GTGCCGTTTT GGCACCTTTA CAATTGCAGC
CAGAACGGCG TCGGCCTCGA GGAGTTCAAT GCCAATTTCG ACATCTGCGT GTTCACCTGC
GCGAACCTTT TGCGAAAGGG CCTGTCAGCG GATGCCGAGG CGGAAGTGCT GGGCAAGCTC
AAAATGCCGA TCGTCATGCT CGGCATCGGG CTGCAGAACC GGCGGGACCT GGAAAACAGC
CTTCCGGAAG GCACGAAGCG GCTTCTCGAC GTTCTTAAGG AGCGCGAGCA CTATTTCCTG
ACGCGTGGCT TCGAGACGGC AGGCTTCCTC AAGGATCAGG GTTTCTCCTA CGTCCAGCCG
ACCGGATGCC CTTCCATCTA TCTGATGCCG CACAATATGC GCGCCTCCCT GAAGAAGCTG
CCAAAGGTAC CGGTGGGCAA GGCGCGGACG ATCTTTTCCG GTTATCTAGG TGCCAATCAC
GACTGCATCG TCGATGCCGC GGCACTGGCG CCGGAGGGTT CGCGTCCCCA ATACGTTATT
CAGGACGAAT TCCTTCACTT CGACATGAAC GTGGAAGCGA ACGGCGATGG ACGGGTGTAC
GATTCCGCCT CGGGAGTGAT GCTCGGTGAG CTGAGTTATC CGGGCACGGA ACGGCTGAAG
ACGCCCTTCG ACGTCCGTAC CTTCTTCGAC ACGAACCAGT GGCGCGCCTG GGCTTCTTCC
ATGGATTTCA ATTTCGGCCG ACGCTTCCAC GGCTCGATTA TCGCCATGCA GGCAGCCGTG
CCTAGCCTGA TGGTGGCAGT AGATGACCGG ATGCGCGAGA TGCTCGGCTA TACCGGGCTG
CCGGCGATCG ACGCCGTCGA GGTCGACAAG GCGGATAACC GGGCTGAATT CGTCGCCGAC
CACCTGGCCG GACTGAACGC ATCCGAACTG GTCGACAGAT ATTCCGATCG CGAGCGCACG
TTCCGCTCGG CGCTCAGAGA GATCGGAATA GGTCAATAG
 
Protein sequence
MRPRILVTGI PGHYTRLANG AQGLSVSYSE RQKQPETKEE FLQELRNISN TGNYLIGEGA 
LRAIAPHAKQ VPFWHLYNCS QNGVGLEEFN ANFDICVFTC ANLLRKGLSA DAEAEVLGKL
KMPIVMLGIG LQNRRDLENS LPEGTKRLLD VLKEREHYFL TRGFETAGFL KDQGFSYVQP
TGCPSIYLMP HNMRASLKKL PKVPVGKART IFSGYLGANH DCIVDAAALA PEGSRPQYVI
QDEFLHFDMN VEANGDGRVY DSASGVMLGE LSYPGTERLK TPFDVRTFFD TNQWRAWASS
MDFNFGRRFH GSIIAMQAAV PSLMVAVDDR MREMLGYTGL PAIDAVEVDK ADNRAEFVAD
HLAGLNASEL VDRYSDRERT FRSALREIGI GQ