Gene Smed_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1474 
Symbol 
ID5322332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1558074 
End bp1561460 
Gene Length3387 bp 
Protein Length1128 aa 
Translation table11 
GC content63% 
IMG OID640790422 
Producthypothetical protein 
Protein accessionYP_001327154 
Protein GI150396687 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.105081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0154551 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA TCCGCGGCGA AAGAGTGGAC TTTCGCAGGG AAGACATCGT CGCGCTGCAC 
GCTTTGCCCT CGGCTCAAGC TCACGACCCG GTCATCGTGC ACACGCCGCG GCCTGGCGGT
GCCTGGCGCC TGTGCGGAAG GATTCTGCTC TGCTGCTCGC TGCTCGTCTT CATCGCCGTT
GCTTCGCTTA TCGCCATAAT AGAAAGCGGA ATTGTAGACG GGCCGCTGAA TGCCAGGGCC
AGGACGGCGC TCAACACCGC ACTCGGACAG GATTATAGCG CCGATGTCGA GAGCACGGTG
ATCCGGCTGA CCGGCGGCGG CGCACTTGCG CTCAAGGCGC GGGGCGTGAC GCTGAAGGAG
CGCGGATCCG GCCGGCATCT CGCCAAGCTC GGTGCAATTT CGATCGCCCT CGATCCATTT
GCCCTGGCGA CCGGCCGCAT CAATGTCTCG AGGCTTGAAG CGGAGGGTGG CGAGCTCGAC
ACCGGGCTCC TGCCGCGCGG CGAGCCCATC GATCTTACAG CCATCCGCAT AGCGGACGTA
GGCACTGCCC TCGAAGAATT GTTCGCACAG GGCGATCGGA TGTCGCGCTT GACCGCCGGA
CGGTCGACCC AGACCGTCGT CCTCTCTGAC TTCAGCCTAA CGGTCAGCGG CACGCGTGGG
CGGGCCGTTC CCGTCGAAAT CAAGACACTG CAATTCAGTC ACGACCCCGA CAGTTCGATG
CGAGTTGAGG GCACGATTGC GGTTGACGGC ATCGAATCTC AGCTTACGGC AAAGGCTCTC
GGCGACAGGG GGCGCATCGC CGCCTTCGAG GCCGGGCTCG ACGCGCTGCC GCTTTCGCCG
TTCCTCCATC ACGGCAAGTC GGGCAATGAG GAGGCCTTCG GCATCGAAGC CACCGCCAAT
GTGACGCTCA AAGCCGCCCG CGCTGCGGAT GGCACGAAGC CGGCACTCAC GGCGGCCGTG
AAGACGTCCA GGGGCTCCTT CCACGCCGGC GGCCTCGCCT CCAAACTCAA CTCCGCGGAA
CTGAACGTCT CCTACGATTT CGAGCGAGCC TCGGTCGAGA TATTGTCGTC GATGGTCAGG
ATCGGCCGGT CGAGCTTCCC CTTCACGGGA GCCCTGATCG ATCTCGACAA GATTGCCGGC
GCAGACCGGA AAGGATTCGC GGTCGATCTC CTGTTCAAGA ACGCCAGCTC CGACCCTGAG
GACATGCAGG CGCCCCCGCT TGCCTTCGAT GCCAAGGCAA GCGGGCGTTT CGAGTCCGAC
ACCCACCGGC TGATCTTCGA TCAACTCGCG ATATCGAGTC CGCTTGGCTC CATGGCGGGT
TCGCTTTCGG TCGCCTTCGG CAAGACGTCG CCGCGCATCA GTTTCGCTGC GGTCAGCGAC
AGGATGCACT CCAGCGCCGT CAAGCAGCTG TGGCCCTGGT GGCTGGCGAA GGGGGCTCGT
CGCTGGGCGT TAGGAAACCT CTTCGGAGGC ATGGTAAGCG ACGCACGCAT CGAGGTCTCG
ATTCCGGAGG GGCGTATTGC CAGTAGCGGC GGAGAATTGA GGCTCAACGA AAAGGAGCTC
AACATCAACT TTGCCGTCGA CGAGACGCGC ATCAACATCG CGGGAGAGAT ACCGCCGTTG
CGTGACACGG CCGGACGCTT CAGCCTGAGC GGCGAGCGGA TGTCCGTCGC CGTCGAGAAG
GGCGCCGCCT TCTTCCCGTC CGGTCGCTCC GTCGCCTTGA ATGGCGGGGA TTTCATCATT
GCCGACGTCT ATAAAAAGCC GCTGATGGCG GAAATGAAGA TCAAGGTCGC GGGCGAGGCG
GACGCGATAG CCGAGCTCGT CCGCTACAAA CCGATAGAGG CGCTTCGGAA GACGCCTTTC
ACTCCCGAGG ATTTTACCGG CCCGATGACG GCGCTGGTGG GTGCGCGGTT TGGTCTCATT
TCCGACCAGA AGCCGCCACG TCCACTGTGG CAGGTGGAAA TGGAGCTTGA GGACGTGACG
ATAAAGCGGC CCGTTGCAGG ACGTTCGATC GCCGATCTCG ACGGGACGAT GAGGATCGAC
AACGAGCGTG CAGTGCTCCA GGCGAACGCC CTAATCGACG GTGCGAAGAT GCGCGTCGCG
CTCACCGAGC CGGTTGGCAC CTCGGCCAAT GTGGCGAGAA CGCGCGAAAT CTCCGGAACG
CTCGACGACG CGGCGCGGGC GAAAATCGCT CCGGCCCTGT CCGGAATCGT CAGCGGGCCC
GTGGGCATAG ACGTTTCGCT TGCGGAAGAC GGCAGCCAGT CGGTCAAGGT TGACCTCGGA
AAGGCCGTGC TGTCGCTTCC CTGGATTGGC TGGAGCAAGG GCTCGGGCAT CCCAGCAAAG
GCGCAATTCA CAATTCGGGC GGCCGGTGGC ATCACGGAGA TAAACGACCT GCGGCTCACC
GGAGAAGGTT TTGGCGGCAA TGGCGAATTG CGCGTAGACG AATCCGGCCT TGCAGCGGCG
CGGCTGAGCG GCGTGCGCCT GGCAAGCGGC GACGATTTTT CCGTCACAGT GGGACGCAGC
AAGGGAGGCT ATTCGGTAAA CCTAACCGGC ACCGCAGCGG ATATCCGACC GGCGCTCGCT
CGCGTCAAGG GCGGCGCGAG TTCAAAGGAT GGCGGCAATG TGAAGATCAA GGCGCGGCTC
GACCGGGTCA CCGGTTTCAA CGGCGAAGTC CTGTCGAATG TTGATCTCAC CTATTCGAGC
CGCGGCCAGC AGATCGACGA TGTCAACCTT TCAGCGATAA CGGCAAGCGG CCAGGCAGTC
GTTGCCAGGT TGGTCAAGGC CGGTGCGGAC AACACGCTGG AACTGACGAC AAGCGATGCC
GGAGCCTTTG CACGCTTCAT CGACATCTAC CGCAATATGC GGGGCGGACT CCTCAATTTG
CGCCTGCGTG ATCGCGGTGC CAACTCGTGG CGCGGAACCG TGGACATCCG CAAGTTCTCG
CTGGTCGGCG AACAAAGGCT GCAATCGATG GTCTCGACCC GGGCGGGCCA GGACGGTCGC
AGTCTCAACG AAGCGGTTCG ACGCGATATC GACGTGAGCA CGGCTCAGTT CGAGCGTGGC
TTCGCGCAAC TCCTGCTGGA TCAGGGCGCA ATTCGGGTCG GCAGCGGAGT GGTCCGCGGT
ATCGATGTGG GCGCGACCTT CCAGGGAACT GTCCGCGACG CCAATGGTCG TATGGACATG
ACGGGTACGT TCATGCCGGC TTACGGATTA AACCGGCTCT TCGGGGAATT GCCGCTGATC
GGTGTCCTGC TCGGAAATGG GCGCGACCGG GGCCTGTTGG GGATCACGTT CAAACTCGCC
GGACCGTTCA GCCAGCCAAG TCTGACGATC AACCCGCTGT CGATCATAGC GCCGGGCGTC
TTCCGCAATA TCTTCGAGTT TCAATGA
 
Protein sequence
MSDIRGERVD FRREDIVALH ALPSAQAHDP VIVHTPRPGG AWRLCGRILL CCSLLVFIAV 
ASLIAIIESG IVDGPLNARA RTALNTALGQ DYSADVESTV IRLTGGGALA LKARGVTLKE
RGSGRHLAKL GAISIALDPF ALATGRINVS RLEAEGGELD TGLLPRGEPI DLTAIRIADV
GTALEELFAQ GDRMSRLTAG RSTQTVVLSD FSLTVSGTRG RAVPVEIKTL QFSHDPDSSM
RVEGTIAVDG IESQLTAKAL GDRGRIAAFE AGLDALPLSP FLHHGKSGNE EAFGIEATAN
VTLKAARAAD GTKPALTAAV KTSRGSFHAG GLASKLNSAE LNVSYDFERA SVEILSSMVR
IGRSSFPFTG ALIDLDKIAG ADRKGFAVDL LFKNASSDPE DMQAPPLAFD AKASGRFESD
THRLIFDQLA ISSPLGSMAG SLSVAFGKTS PRISFAAVSD RMHSSAVKQL WPWWLAKGAR
RWALGNLFGG MVSDARIEVS IPEGRIASSG GELRLNEKEL NINFAVDETR INIAGEIPPL
RDTAGRFSLS GERMSVAVEK GAAFFPSGRS VALNGGDFII ADVYKKPLMA EMKIKVAGEA
DAIAELVRYK PIEALRKTPF TPEDFTGPMT ALVGARFGLI SDQKPPRPLW QVEMELEDVT
IKRPVAGRSI ADLDGTMRID NERAVLQANA LIDGAKMRVA LTEPVGTSAN VARTREISGT
LDDAARAKIA PALSGIVSGP VGIDVSLAED GSQSVKVDLG KAVLSLPWIG WSKGSGIPAK
AQFTIRAAGG ITEINDLRLT GEGFGGNGEL RVDESGLAAA RLSGVRLASG DDFSVTVGRS
KGGYSVNLTG TAADIRPALA RVKGGASSKD GGNVKIKARL DRVTGFNGEV LSNVDLTYSS
RGQQIDDVNL SAITASGQAV VARLVKAGAD NTLELTTSDA GAFARFIDIY RNMRGGLLNL
RLRDRGANSW RGTVDIRKFS LVGEQRLQSM VSTRAGQDGR SLNEAVRRDI DVSTAQFERG
FAQLLLDQGA IRVGSGVVRG IDVGATFQGT VRDANGRMDM TGTFMPAYGL NRLFGELPLI
GVLLGNGRDR GLLGITFKLA GPFSQPSLTI NPLSIIAPGV FRNIFEFQ