Gene Smed_5009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5009 
Symbol 
ID5318658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1527046 
End bp1530087 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content63% 
IMG OID640776791 
ProductAsmA family protein 
Protein accessionYP_001313723 
Protein GI150377127 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0536968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGCG TCATACTTTG TGTACTGGGT GCTGCCGCGC TCGCGGTCAC CGCAATTGCC 
ATTCTGCCCA GCCTGATTTC CAGCGATTGG ATGCGTGCCG AACTCGGCAG GCAATTGTCG
TCCGCGACCG GCAGCTCGAT CGCGTTCAAC GGCCCCGTCA AGTTCTCCGC CTTCCCCTCT
CTGGCGGTCG TGGCCAAAGA TGTGACGCTT TCCGCGGAGG CTCAAGGGGT GACTGCGGAG
TTTGCGGAGG TCACCGGCTC GGTGGCATTT TCCTCCCTTT GGGCGGATCG GCTGCGCATC
GAGGAGATCA GTGTGGATCG GCCGGTCATA ATTCTTGACG AAAAGGCGGC GGATGAAACG
ACGCCGGTAC AAGAAGGCAG CGGAGAGGCC CGCTCGAGCG ACCCGCTTCC CGATCTCGCC
GCGCTGCTCG AACGAAGCGC CATCGACTCG GTCTCGATCA CCTCCGGCAC GTTCATACGG
CGAAGCGCAG CACGCCGCGA AGAGGTCGTT TCCGATGTCG AGCTGACGCT ATCTATTCCC
GATATCGATG ACACCTTCTC TCTTTCCGCG TCGGCCCGCA TGGGGGAACT GACCTACGAG
GCCAGCTTCA CAATATCGAC CTTGCGTTCC TTGCTGGGAC GCCAGCCTGC GGACCTCGAC
CTGACGGTCG AGGCAGACCC CGCTCCCGCG CCGGGGCTGA CCAGCCTTTC GGCCAGCGGA
CAAGTGACCC TGAACGCGAA TGGCAGCTAT CAGATCCGCG GCGGTAAAGT CGAAACCGGG
GAACAGGCTT TCGGGCTGAA CGCGCTCTTC GTGCCCGGCA AGCGTGCGCG TTTCCTTGCC
GATCTCGACG CGGACAGGCT CGACCTGACG CCGTTCGTCG ATACCACGCC GGCAGCGTCG
CCGTCGAAAG TGCTCTCTAC AGGCACGAAA ACGGACGCCG GCAATCGGGT AGGCTTGCAG
TTCCTGGCCA GCTTCGATGC CGATGTGAGC ATCAACGCGG CTGCTATCAC CTTCGGCGAG
ATGTCCGCGT CGGATGTTTC CGTCTCGGCC GAACTCAAGG ACGGAAACTT GGCAGTGGAA
CTCGGACAGT TGGGCATCGA TGCCGGTGTC GTGACCGCAG ACGTCGCTAC GGACGTGCGC
TCCGGCGATC CTGTCTTCCG GGGCCGTCTC TCCAGCGAGG GTCTCGACAT AGGCAGTCTT
TCAGCCCTCG CGGATCGATC CGTCCCGGTT TCCGGAGCCC TGACCATCGA CGCCGATTTC
GCCTTTCGCG GTCTTGACGC GGACTCCATC CGAAAGACGA TCGATCTGAC CGGGACGGTC
GGCCTGCGCG ATGCAAGCTT CGCACTGCCC CTTGCCGATG AAGCGATGCA GGAGGTGAAG
GCGAAGAAGA TCTCGGCCCG GATCGAAAGC CTTCGCAAAC CGGTGCGCTT GGAGGGCCGG
CTGGAATGGC GCAACAAGCC GGTTGACGTG GATCTGCAAA TTCCCGCTGC AGATACTCTG
CTGACGGGCG ATCTCGCCTC CCAGGGCATC CCGCTCAAGC TCCAAATGCA GATGCCGGAC
GCACGCCTGT CTCTCGATGG CAGAGCGGCA CTGCCGGGGA CCTATGCCGG AAGGCTCGAT
TTTTCGGTTG CCGACCTCTC TGCCTTCCTT TCGGGCTTCG GGCAGAGCGG TGCGGAGAGC
ATCGGCCCGC TGGCGTTCAG CGGCAACGTC GTTTCAGGCG CGCACGGCGT TTCTTTCGAC
AAGGCCTCCG TTTCCGTCAA CGGCATCGAA GCCAAGGGCA ACGGCTCCGT GGAATTCGCG
ACGCCGCTGA AGATCGAGAC GTCGCTAGAC TTCGCGGAAC TGGATCTTGC GCGGCTCGCG
GGAGCGGGCG CACCCGCGCC TCAGCCGAAA GCCAAAGCAA AAGCGAAGGC GGTTATAGGC
GCCGACGTTC CGCTGGATCT TTCATCTCTC CGATCGGTCG ATGCGACGAT CGGGATCAAT
GCCGAAAAGC TCGGTTATGG CAGAGTCTTT GCCGGGCCGG TCACCACCAT GCTGGTCGTC
TCCGAAGGCG TCGCAAGCCT GACGCTGCCT GAAAGTCCGT TTTACGGCGG ACATGTTGTC
GCGAGCCTGA AGGCAGACGG CTCGCAGCCG GAGCCATCTA TCGCCTTCCA GGCCTCCATC
TCGCGCGCTT CGTCCGCGCC GTTGCTGACC GACATGGCGG GTTTCAGATA TCTCGAGGGC
GCATTGAACG CCCGGTTCGA CGTGACCGGT GCGGGCGGCA CGACCAAGAC GCTCGCGAAA
TCGCTGCAGG GCACCGCGGA GGTAACCTTC GCGGATGGCG CCATTCGCGG CATCGACATC
GCCGATGTCT ACAACAACCT CGTGCGGCTG ATGTCGTCCG GCTTCAAACA GGACGACAGC
AAAGCGACCA CCTTCACCGA ACTTGGGGCC TCCTTTGCCA TCGAGGGTGG CGTCGCGCGC
ACCGAGGATA TTAAGCTTGT CGGCCCGCTG GTCCGCATGA CCGGGAAGGG CGCGGCCGAC
CTCGCGCAAA GCACCCTGAA CTTCCGGCTG GAACCCCGCA TCGTCGCCTC GCTTCAAGGC
CAGGGCGCCG AAATCTCCAC CGACGGCGTC GGCGTCCCCG TCGTGGTCGA AGGCAGCTTC
GCCGCACCTC GCATCTACCC GGACCTTTCG GACTTGCTGA AGAATCCCGA TGCCGCACTG
GCAAAGCTCA AGGATTTCGG ACTGCCGATC GACAAGTTGC CGATTGGCGA CCTCCTTAAC
GGAGACGGCG CCGGCGCAGC CGTTAAGGAT TTGCTCGGCG GCACGCTGGA CGAGGCGCTA
AAAGACAAGG CTTTCAAACA AGAAGAACAG CAGCTCTCGA TCGAGGAGAT CATCGGTGGC
GATCCTGCAC CCAAGCCCGA CGAGGCGGCT GCAGAAGTCC CCCCAAACGA GGAAGCGGCC
AAAGCCGACG CACCAACCGA ACAACCGGCA AACGGAACGG AAGAGCCAAG TGAGGAGGCC
GATGGTCCAA TGGAGGGTTT TTTCAAGCAG CTCCTGCGGT AG
 
Protein sequence
MRRVILCVLG AAALAVTAIA ILPSLISSDW MRAELGRQLS SATGSSIAFN GPVKFSAFPS 
LAVVAKDVTL SAEAQGVTAE FAEVTGSVAF SSLWADRLRI EEISVDRPVI ILDEKAADET
TPVQEGSGEA RSSDPLPDLA ALLERSAIDS VSITSGTFIR RSAARREEVV SDVELTLSIP
DIDDTFSLSA SARMGELTYE ASFTISTLRS LLGRQPADLD LTVEADPAPA PGLTSLSASG
QVTLNANGSY QIRGGKVETG EQAFGLNALF VPGKRARFLA DLDADRLDLT PFVDTTPAAS
PSKVLSTGTK TDAGNRVGLQ FLASFDADVS INAAAITFGE MSASDVSVSA ELKDGNLAVE
LGQLGIDAGV VTADVATDVR SGDPVFRGRL SSEGLDIGSL SALADRSVPV SGALTIDADF
AFRGLDADSI RKTIDLTGTV GLRDASFALP LADEAMQEVK AKKISARIES LRKPVRLEGR
LEWRNKPVDV DLQIPAADTL LTGDLASQGI PLKLQMQMPD ARLSLDGRAA LPGTYAGRLD
FSVADLSAFL SGFGQSGAES IGPLAFSGNV VSGAHGVSFD KASVSVNGIE AKGNGSVEFA
TPLKIETSLD FAELDLARLA GAGAPAPQPK AKAKAKAVIG ADVPLDLSSL RSVDATIGIN
AEKLGYGRVF AGPVTTMLVV SEGVASLTLP ESPFYGGHVV ASLKADGSQP EPSIAFQASI
SRASSAPLLT DMAGFRYLEG ALNARFDVTG AGGTTKTLAK SLQGTAEVTF ADGAIRGIDI
ADVYNNLVRL MSSGFKQDDS KATTFTELGA SFAIEGGVAR TEDIKLVGPL VRMTGKGAAD
LAQSTLNFRL EPRIVASLQG QGAEISTDGV GVPVVVEGSF AAPRIYPDLS DLLKNPDAAL
AKLKDFGLPI DKLPIGDLLN GDGAGAAVKD LLGGTLDEAL KDKAFKQEEQ QLSIEEIIGG
DPAPKPDEAA AEVPPNEEAA KADAPTEQPA NGTEEPSEEA DGPMEGFFKQ LLR