Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5009 |
Symbol | |
ID | 5318658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1527046 |
End bp | 1530087 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776791 |
Product | AsmA family protein |
Protein accession | YP_001313723 |
Protein GI | 150377127 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2982] Uncharacterized protein involved in outer membrane biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0536968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCGCG TCATACTTTG TGTACTGGGT GCTGCCGCGC TCGCGGTCAC CGCAATTGCC ATTCTGCCCA GCCTGATTTC CAGCGATTGG ATGCGTGCCG AACTCGGCAG GCAATTGTCG TCCGCGACCG GCAGCTCGAT CGCGTTCAAC GGCCCCGTCA AGTTCTCCGC CTTCCCCTCT CTGGCGGTCG TGGCCAAAGA TGTGACGCTT TCCGCGGAGG CTCAAGGGGT GACTGCGGAG TTTGCGGAGG TCACCGGCTC GGTGGCATTT TCCTCCCTTT GGGCGGATCG GCTGCGCATC GAGGAGATCA GTGTGGATCG GCCGGTCATA ATTCTTGACG AAAAGGCGGC GGATGAAACG ACGCCGGTAC AAGAAGGCAG CGGAGAGGCC CGCTCGAGCG ACCCGCTTCC CGATCTCGCC GCGCTGCTCG AACGAAGCGC CATCGACTCG GTCTCGATCA CCTCCGGCAC GTTCATACGG CGAAGCGCAG CACGCCGCGA AGAGGTCGTT TCCGATGTCG AGCTGACGCT ATCTATTCCC GATATCGATG ACACCTTCTC TCTTTCCGCG TCGGCCCGCA TGGGGGAACT GACCTACGAG GCCAGCTTCA CAATATCGAC CTTGCGTTCC TTGCTGGGAC GCCAGCCTGC GGACCTCGAC CTGACGGTCG AGGCAGACCC CGCTCCCGCG CCGGGGCTGA CCAGCCTTTC GGCCAGCGGA CAAGTGACCC TGAACGCGAA TGGCAGCTAT CAGATCCGCG GCGGTAAAGT CGAAACCGGG GAACAGGCTT TCGGGCTGAA CGCGCTCTTC GTGCCCGGCA AGCGTGCGCG TTTCCTTGCC GATCTCGACG CGGACAGGCT CGACCTGACG CCGTTCGTCG ATACCACGCC GGCAGCGTCG CCGTCGAAAG TGCTCTCTAC AGGCACGAAA ACGGACGCCG GCAATCGGGT AGGCTTGCAG TTCCTGGCCA GCTTCGATGC CGATGTGAGC ATCAACGCGG CTGCTATCAC CTTCGGCGAG ATGTCCGCGT CGGATGTTTC CGTCTCGGCC GAACTCAAGG ACGGAAACTT GGCAGTGGAA CTCGGACAGT TGGGCATCGA TGCCGGTGTC GTGACCGCAG ACGTCGCTAC GGACGTGCGC TCCGGCGATC CTGTCTTCCG GGGCCGTCTC TCCAGCGAGG GTCTCGACAT AGGCAGTCTT TCAGCCCTCG CGGATCGATC CGTCCCGGTT TCCGGAGCCC TGACCATCGA CGCCGATTTC GCCTTTCGCG GTCTTGACGC GGACTCCATC CGAAAGACGA TCGATCTGAC CGGGACGGTC GGCCTGCGCG ATGCAAGCTT CGCACTGCCC CTTGCCGATG AAGCGATGCA GGAGGTGAAG GCGAAGAAGA TCTCGGCCCG GATCGAAAGC CTTCGCAAAC CGGTGCGCTT GGAGGGCCGG CTGGAATGGC GCAACAAGCC GGTTGACGTG GATCTGCAAA TTCCCGCTGC AGATACTCTG CTGACGGGCG ATCTCGCCTC CCAGGGCATC CCGCTCAAGC TCCAAATGCA GATGCCGGAC GCACGCCTGT CTCTCGATGG CAGAGCGGCA CTGCCGGGGA CCTATGCCGG AAGGCTCGAT TTTTCGGTTG CCGACCTCTC TGCCTTCCTT TCGGGCTTCG GGCAGAGCGG TGCGGAGAGC ATCGGCCCGC TGGCGTTCAG CGGCAACGTC GTTTCAGGCG CGCACGGCGT TTCTTTCGAC AAGGCCTCCG TTTCCGTCAA CGGCATCGAA GCCAAGGGCA ACGGCTCCGT GGAATTCGCG ACGCCGCTGA AGATCGAGAC GTCGCTAGAC TTCGCGGAAC TGGATCTTGC GCGGCTCGCG GGAGCGGGCG CACCCGCGCC TCAGCCGAAA GCCAAAGCAA AAGCGAAGGC GGTTATAGGC GCCGACGTTC CGCTGGATCT TTCATCTCTC CGATCGGTCG ATGCGACGAT CGGGATCAAT GCCGAAAAGC TCGGTTATGG CAGAGTCTTT GCCGGGCCGG TCACCACCAT GCTGGTCGTC TCCGAAGGCG TCGCAAGCCT GACGCTGCCT GAAAGTCCGT TTTACGGCGG ACATGTTGTC GCGAGCCTGA AGGCAGACGG CTCGCAGCCG GAGCCATCTA TCGCCTTCCA GGCCTCCATC TCGCGCGCTT CGTCCGCGCC GTTGCTGACC GACATGGCGG GTTTCAGATA TCTCGAGGGC GCATTGAACG CCCGGTTCGA CGTGACCGGT GCGGGCGGCA CGACCAAGAC GCTCGCGAAA TCGCTGCAGG GCACCGCGGA GGTAACCTTC GCGGATGGCG CCATTCGCGG CATCGACATC GCCGATGTCT ACAACAACCT CGTGCGGCTG ATGTCGTCCG GCTTCAAACA GGACGACAGC AAAGCGACCA CCTTCACCGA ACTTGGGGCC TCCTTTGCCA TCGAGGGTGG CGTCGCGCGC ACCGAGGATA TTAAGCTTGT CGGCCCGCTG GTCCGCATGA CCGGGAAGGG CGCGGCCGAC CTCGCGCAAA GCACCCTGAA CTTCCGGCTG GAACCCCGCA TCGTCGCCTC GCTTCAAGGC CAGGGCGCCG AAATCTCCAC CGACGGCGTC GGCGTCCCCG TCGTGGTCGA AGGCAGCTTC GCCGCACCTC GCATCTACCC GGACCTTTCG GACTTGCTGA AGAATCCCGA TGCCGCACTG GCAAAGCTCA AGGATTTCGG ACTGCCGATC GACAAGTTGC CGATTGGCGA CCTCCTTAAC GGAGACGGCG CCGGCGCAGC CGTTAAGGAT TTGCTCGGCG GCACGCTGGA CGAGGCGCTA AAAGACAAGG CTTTCAAACA AGAAGAACAG CAGCTCTCGA TCGAGGAGAT CATCGGTGGC GATCCTGCAC CCAAGCCCGA CGAGGCGGCT GCAGAAGTCC CCCCAAACGA GGAAGCGGCC AAAGCCGACG CACCAACCGA ACAACCGGCA AACGGAACGG AAGAGCCAAG TGAGGAGGCC GATGGTCCAA TGGAGGGTTT TTTCAAGCAG CTCCTGCGGT AG
|
Protein sequence | MRRVILCVLG AAALAVTAIA ILPSLISSDW MRAELGRQLS SATGSSIAFN GPVKFSAFPS LAVVAKDVTL SAEAQGVTAE FAEVTGSVAF SSLWADRLRI EEISVDRPVI ILDEKAADET TPVQEGSGEA RSSDPLPDLA ALLERSAIDS VSITSGTFIR RSAARREEVV SDVELTLSIP DIDDTFSLSA SARMGELTYE ASFTISTLRS LLGRQPADLD LTVEADPAPA PGLTSLSASG QVTLNANGSY QIRGGKVETG EQAFGLNALF VPGKRARFLA DLDADRLDLT PFVDTTPAAS PSKVLSTGTK TDAGNRVGLQ FLASFDADVS INAAAITFGE MSASDVSVSA ELKDGNLAVE LGQLGIDAGV VTADVATDVR SGDPVFRGRL SSEGLDIGSL SALADRSVPV SGALTIDADF AFRGLDADSI RKTIDLTGTV GLRDASFALP LADEAMQEVK AKKISARIES LRKPVRLEGR LEWRNKPVDV DLQIPAADTL LTGDLASQGI PLKLQMQMPD ARLSLDGRAA LPGTYAGRLD FSVADLSAFL SGFGQSGAES IGPLAFSGNV VSGAHGVSFD KASVSVNGIE AKGNGSVEFA TPLKIETSLD FAELDLARLA GAGAPAPQPK AKAKAKAVIG ADVPLDLSSL RSVDATIGIN AEKLGYGRVF AGPVTTMLVV SEGVASLTLP ESPFYGGHVV ASLKADGSQP EPSIAFQASI SRASSAPLLT DMAGFRYLEG ALNARFDVTG AGGTTKTLAK SLQGTAEVTF ADGAIRGIDI ADVYNNLVRL MSSGFKQDDS KATTFTELGA SFAIEGGVAR TEDIKLVGPL VRMTGKGAAD LAQSTLNFRL EPRIVASLQG QGAEISTDGV GVPVVVEGSF AAPRIYPDLS DLLKNPDAAL AKLKDFGLPI DKLPIGDLLN GDGAGAAVKD LLGGTLDEAL KDKAFKQEEQ QLSIEEIIGG DPAPKPDEAA AEVPPNEEAA KADAPTEQPA NGTEEPSEEA DGPMEGFFKQ LLR
|
| |