Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4786 |
Symbol | |
ID | 5318408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1303585 |
End bp | 1305144 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640776582 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001313514 |
Protein GI | 150376918 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.419568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATG GTGCGGATGC ACGCTTTTAT TTTTCAATCC TTCTGAGGAG ATTGCCGTAT CTGGTGGCGA TCGTCGGTTC CGTGATTGCG CTTACCGTCA TTGTCGCTAG CATTTTACCG CCCCGTTATC GTGCGAGCGC CAAAATTCTC GTAGAGGCTC CGCAAATTCC TGTGGAACTG GCGCGATCGA CTGTCCCAAT ACAGGCGGCC CAGCAATTGC AGATTATTCG GCAGCAGATC ACGACGCGCG ACGACCTTCT CGCGCTTGCC GACATGCTCG ACATTTACGG CAAAGAAGAA GACGAACTGT CCAAGGATGA CATCGTAGAT AATATGCGCT CCCGCATCAC ATTCGAGGAA CTCGCGTTGA GCGCGCCATA CGGTGATACT GGTGCCTCTG TGGCAAGCGT GAGCTTCACC GCGGCAGATC CTGATCTTGC GGCCAGGGTC GCTAACGAGC TTGTCGACTT TATTCTGCTG AAGCAGCAGC AACAACGCAC CAGTCGTGCC GCGGACACGG TCAAGTTCTT TGATCAAGCG GTCGCGAGAC TTGGCACGGA TCTGAGTAGG GCCGAACTCG AAATTCTAAG ATACAAGAAC GAACATGCAG ACACTCTTCC CGAAAGTCTT GATTTTCGCC GCAGTCAACA AACAGGCCAG CAACAGAGAT GGATCACGCT CGAGCGCGAA GAGTCGGATC TCCGGGCTAA GCGGAGCACC CTTGTCGAGA GTTACGTCCT TGGTGGCCAA GCTCCCGATG GTAAGGCGGC GACACCGGAA CAGCTGGCCC TGCAAGAGCT GACGCGTGCG CTTGCCGAGC AGCGTGCGAT TTTCTCGGAG AACAGCCCCA ATATAATGGC CCTTCGCGGC CGCATTGCTT CATTGCAGGC CACATTGCGC ACGACGCAGA CAAGCGAAGC TAGTTCGGCC CAGGACAGGG TCGCGCGTTC CCCGCTGGAC CAACAGTTGG CGTATATCGA CGAGCGCTTG CGCGCGATTG TCGGGGAGAA GGCCGCAATC ACCGATCGCA TAGACGAACT GAGCAAATCA ATCAGTGCGA CGCCGGAAAG TGAAACCGTT CTTTATTCGT TCGAGCGCGA CCGGGCAAAT CTTCAATCAC AATACAATAC TGCGATAGCC CGACGCGCCG AGGCGATCAT CGGCCAGCAG ATCGAGAGGC GGTCCGACGG GAGTAGTTTC TCAGTGCTTG AGCGCGCGAC CGCACCTGAG ATGGCGGAGA GCCCAAATCG CCGCCGTATC GTGCTCCTTG GCGCGCTGGC CGGAACGGCT CTCTCCGTGG CCTTCATCGG GCTGCTTGAG TTTTTCAACG CGGCCATACG CAGACCCAAT GAACTTGCGC GGCTGCTCGA CCGTCAGCCG CTTGCCACTA TACCGTATAT TTCGACTGTA GCCGAGGTAC GTAGTCGGAT CAGACGAACG GTCGCCGCGG TACTTGCTGC TGCGGCTGCT CCCGCGGCAT TGATCGTAGT CCACCAGTTC TATATGCCGC TGCCGATAGC ATTTCAGAAT CTCTATCGGT GGCTCACCAC GCTGGCCTAG
|
Protein sequence | MMNGADARFY FSILLRRLPY LVAIVGSVIA LTVIVASILP PRYRASAKIL VEAPQIPVEL ARSTVPIQAA QQLQIIRQQI TTRDDLLALA DMLDIYGKEE DELSKDDIVD NMRSRITFEE LALSAPYGDT GASVASVSFT AADPDLAARV ANELVDFILL KQQQQRTSRA ADTVKFFDQA VARLGTDLSR AELEILRYKN EHADTLPESL DFRRSQQTGQ QQRWITLERE ESDLRAKRST LVESYVLGGQ APDGKAATPE QLALQELTRA LAEQRAIFSE NSPNIMALRG RIASLQATLR TTQTSEASSA QDRVARSPLD QQLAYIDERL RAIVGEKAAI TDRIDELSKS ISATPESETV LYSFERDRAN LQSQYNTAIA RRAEAIIGQQ IERRSDGSSF SVLERATAPE MAESPNRRRI VLLGALAGTA LSVAFIGLLE FFNAAIRRPN ELARLLDRQP LATIPYISTV AEVRSRIRRT VAAVLAAAAA PAALIVVHQF YMPLPIAFQN LYRWLTTLA
|
| |