Gene Smed_1880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1880 
Symbol 
ID5322738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1950684 
End bp1952780 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content61% 
IMG OID640790817 
Producthypothetical protein 
Protein accessionYP_001327549 
Protein GI150397082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.384763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATG TCGCTACGCT CGGACTGCAG GTTGAAAGTG GCTCCGTCGA GAAGGGGGCC 
GACGCCCTCA ATCAGCTGAC GGGAGCGGCC GCCCGCGCAG AAGCAGCCGC AAACGGGCTG
TCTGGGGCAA ATCGTGGAGC AACTGGTGCG GCTTCTGCTG CCGCAAAGGC TTACGCCGCC
GAGGGGGCGG CCGCCGCGTC GGCATCGAAA CAGATCGAGA TGATGAACCG CGCGGCCAAT
CAGAACCGCG CATCGTCGCG CGGCAATCTT GGAAATATAG CCGCTCAATT CCAAGACATT
GCCGTCAGTG CGCAAATGGG GATGGGTCCG CTGCAAATTG CCCTTCAGCA AGGCACGCAG
TTGGCCGCGG TGCTTTCATC CATGGAGAGG CCGGTCCAGG GACTAGGTGC GGCCTTTTTG
TCGGTGCTCT CGCCTGTCAG TCTCCTTACG ATCGGCATAA TCGCGCTGGC AGCCGCTGGC
CTGCAGACGG TTGATTGGGC AAAGCTCGCT CAATCGGCGC TGATCGCCTT GGCGGACGTT
CTCGAAACGA TCGCACCGTA CGCCGTCGCA GCTGCGGCGG CGCTAGCGTT GATCTATGCG
CCCGCGATCG TTGGCGGCAT CATCTCGTTG ATCGCGCTGC TCGGTCGATT GGTAGTCCAG
CTTGGTATTG TCGCGGGAGC TTTCATTCTG GCGAACCCTG CCGTCGCATT CGTCGCCGGT
ATCACGGCGG CGATAGCGGC GGCCAACATC TTCCGCGACG AACTCGCACA GATCTTCGGG
CGAGATATCG TGCAAGATGC CAAGAACGGC GTGAACTTTG TCATTGGTGC ATTCGTTGGC
GCATACGAAG CGATCAAAGC CACGTGGTCG CTGCTGCCGA GCGCACTTGG CGACATCGTG
TATTCGACGG CTCAGAACGT GATTGATGGC ATCGAGAGCA TGGTACAGAC CGCTATCGAC
GCTTTGAACA ATTTGACTAA TAAATACGCA CTATGGACCG CGTCCATCGG CAAGCCACTC
AGTCCAGAAG CTTACAACAA TATGATCCTT GGACCGGTTG AGTTCGGCAG CATCAGCAAT
CCTTATAAAG GTTCGGCGGG TGCAGCGGCG AAAGCCGCCA AAGCGGCCTT TGCGGACGCC
CAAGGCACTG ACTTCGCCGG CGAGGGTCTC CGCATCATCG GCGAGTACGC GTCGACGGCG
GCCGGAAAAA TCAAGGAGCT CGTCAAGGGC CTCATCGAAG TCGACGAGAA ATCGAAGAAG
CGCACCGGCG GCAAGAGCGA GCAGGAGAAG TACGCCGACA TCGTTGCAGG AGCCGAGCGC
CAGATCGCAG CGCTTGAGGC GGAGCGTGAT GCTATCGGGC TCACGGAGCA GGCGGCAGCC
GCGCTTCGCT ACGAGACGCA GCTTCTCAAC GAGGCCCAGC AGCGCGGCAT CTCCCTCACA
GATGCACAGA AGAGCGAGCT TTCGTCCCTT GCACGGGTCA TGGCCTCGAT CGAGGAAGAG
ACCCGCCAGA TGGGTGTCGC GCTCGATTTT GCTAAAGAAG TAACCGGAGG CTTCTTCGAT
GACTTCTTCG CGGGAATTGA GAACGGCAAA TCGGTATGGG AGTCTTTCGG CGACGCGGCT
TTGGGGGTGC TCGACCGCAT CGCCGACAAA CTGCTGAACG ACGTCCTCGA TGCCGTGTTT
CAGGTCAGCG GCGCCGGCGC AGGGGCGGGC GGAGGAGGAC TCCTCAGTTG GCTCTTCGGC
GGGGGCCCAA AGGTGGACCC GTGGGCTGGG CTGCGCGGGT ATGCGAACGG AACGAGCTCC
GCTCGACCTG GCGTCGCATG GGTTGGTGAA AAGGGGCCGG AGCTCGTCCG TTTCAAGGGT
GGCGAGGAGG TCATTCCGAA CCATCACCTT CAACGACCGG CCAATGGGAA CGTGGCGCCA
TCGGGCGGTC AGCTAAACCA GAATGGGCCG CGCGAGATCA TCCTTCGGGT GATTGCTGAG
GAGGGGCCGA TGTTCAGGCC CGTCATTCGG TCGGAGAGCC GAGGCGTCTC CGTCGAGACC
GTAAAACAGT ATGACGTGGC GAAGGCAAAC ATCTACCAAA ACGGCGAAGA CCGCTAA
 
Protein sequence
MADVATLGLQ VESGSVEKGA DALNQLTGAA ARAEAAANGL SGANRGATGA ASAAAKAYAA 
EGAAAASASK QIEMMNRAAN QNRASSRGNL GNIAAQFQDI AVSAQMGMGP LQIALQQGTQ
LAAVLSSMER PVQGLGAAFL SVLSPVSLLT IGIIALAAAG LQTVDWAKLA QSALIALADV
LETIAPYAVA AAAALALIYA PAIVGGIISL IALLGRLVVQ LGIVAGAFIL ANPAVAFVAG
ITAAIAAANI FRDELAQIFG RDIVQDAKNG VNFVIGAFVG AYEAIKATWS LLPSALGDIV
YSTAQNVIDG IESMVQTAID ALNNLTNKYA LWTASIGKPL SPEAYNNMIL GPVEFGSISN
PYKGSAGAAA KAAKAAFADA QGTDFAGEGL RIIGEYASTA AGKIKELVKG LIEVDEKSKK
RTGGKSEQEK YADIVAGAER QIAALEAERD AIGLTEQAAA ALRYETQLLN EAQQRGISLT
DAQKSELSSL ARVMASIEEE TRQMGVALDF AKEVTGGFFD DFFAGIENGK SVWESFGDAA
LGVLDRIADK LLNDVLDAVF QVSGAGAGAG GGGLLSWLFG GGPKVDPWAG LRGYANGTSS
ARPGVAWVGE KGPELVRFKG GEEVIPNHHL QRPANGNVAP SGGQLNQNGP REIILRVIAE
EGPMFRPVIR SESRGVSVET VKQYDVAKAN IYQNGEDR