Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2277 |
Symbol | |
ID | 5323138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2352475 |
End bp | 2355288 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640791215 |
Product | hypothetical protein |
Protein accession | YP_001327944 |
Protein GI | 150397477 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02226] N-terminal double-transmembrane domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.30304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.933566 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGCA GCCTCTCCTT CATCTTTGCG AACCCGGCAA TATTGGCGGC ACTCATCGCA CTGCCGGTGA TCTGGTGGCT GCTGCGCATG ACGCCGCCAC GGCCGGCCGC CGAGATCTTT CCGCCGCTGC GCATCCTTGC TTCGGTGATG AAGCGGGAGG AGACGCCCTC GAAAAGCCCC TGGTGGCTGA CGCTTCTCAG AATGCTGATG GCGGCGGCCG TCATTTTCGC GATAGCCGAC CCCGTTTTCA ATCCGCGCAG CAATACGCTC GCAACCTCCG GACCTCTGGC ACTGCTGATC GACAACGGTT GGGCGACGGC GCCCGACTGG GAGCGGCGTG TCGAAACGGC CTCGGCGCTG ATCGACGACG CCGAATCGAG GGACGTCGCG ATTTCGGTCG TCTTCACCGG AGAGCGTCAG CACGATGCGA CCCCCGGCTC GGCGAGCGCA GCGCGCAACA GGCTTGCTGC GGCTAGGCCC GAGCCTCTGC CTGCCGACCG GGCATCCGCG ATCGCAGCCT TGCAAACGGC CTTCGAGGAC GCCCCTCCAG CTACCCTGGC CTTCATCACA GACGGCATCG CAAGTGGCGA CGGTAAAACG ATGGAGAGGC TGGCGGCGAT GGGCGCGTCC AATCTCAAGG TGATCGAGGC GGACGGCAGC GAGGCGGTGG CGCTTACCGC GACGAGCAAC GAGTCCGGCG GCATGGCTGT GACGGCGAGC CGGCTGAAGA CAGACGACGG CCGATCTCTG CCCATCGTCG CCTTCGATAC GCGTGGCCGC GCGATTGCCA GCGGAACGAT CGACTTCGCA CCCGGTGCGG CGACGGCCAG GGGGACGATC GAAGCCCCGT TCGAACTGCG CAACGATTTC GCCCGAATCG GCATCGAAGG AGCAGCAACG GCCGGCGCCC AGCATCTCCT CGATGACGGC TTCCGACGCC GGCGCGTGGC GCTGTTGAGC GGCGAGGCAC GCGACCTGTC GCAGCCCTTG CTCTCGCCGC TTTACTATAT CAACCGGGCG CTCGCCCCTT ATGCGGACCT GATCGAGCCC CGCCAGAACG ACCTCGCCGT TGCGATCCCG GAACTCCTGG AGCAGCGCCC GTCGGTCCTG ATCATGGCCG ACATTGGCAG GCTGCCGGAA GAGACGTATC CCCTGCTTGC AAAATGGATC GAAAACGGCG GCACGCTGAT CCGCTTCGCC GGCCCGCGCC TCGCCGCCGC ACCAGCCGAC GATCCGCTCG TCCCGGTCAC GCTTCGGCAG GGTGAGCGTG CGCTCGGCGG CGCGCTCTCC TGGTCAGAAC CGCAGCCGCT CGCGGATTAT CCGGGCAACA GCCCGTTTGC CGGCATGGCA CGCCCGACCG ACATTCTCGT CAAGCGGCAG GTTCTCGCCG AGCCTACGGC CGATCTGGCC GAGCGCACAT GGGCAAATCT CGCCGACGGC ACACCGCTCG TCACGACGGC GGTGCGCGGC ACGGGCCGGA TCGTGCTCTT TCACGTCAGC GCCGAGGCCA CATGGTCGAA CCTGCCGATA TCAGGGCATT TCGTTGAGAT GCTCCGGCGC AGCGTCCAGC TTTCGCGGGG CGGCGGCATT GCAAGCGACG GGAAGGCCGT AAAGGCCCAG ACCTTGCCGC CCTACCGGCT GCTGAACGCG GACGGAGTGC TCATCGCCGA GACCGGCAAT GTCCGGCCGC TTGAGATCGT GCCGGGGAAG CCACCGGTCG CAACAGCCGA TAACCCTCCG GGGCTTTACG GCTCGGAAGA CGGCTTTACG GCACTGAACC TCCTGCCGCG CGACGCGGAG CTCAAGCGGA TCGACATGAC CGCCGCCGGC CCTCACACCT CCGAGCGCTT GACTGGTGTC GAAAGCTGGT CGCTGAAGCC GGCCCTGATG ACGATCGCGC TGCTTCTGTT GCTTGCGGAC GCACTGATCG TCCTCTTCAT GGGCGGAGTC TTTTCGCGCG CGCGGATAGC AAGGGCCGCT TCGGCAATCA TCGTCGTTCT CGCCGGGGCC TTCGTCCTCG CCCCGCCGCA TGCCTTTGCC GACGACGCAC GCCCGGACGA TCAACGTATC CTCGAGCGGC TGGACACCAC ACATCTCGCT TATGTCGTCA CCGGCGAAGA GGACGTAGAC CGCCTCTCCG AAAGCGGATT GAGAGGGCTT ACCGATTTCC TGACCTATCG CACCACGCTC GAGCCCGGGG CACCCGTCGG CCTCGATATA ACCAAGGACG AGTTGAGCCT GTACCCGATC ATCTATTGGC CGATTTCCGC CACTGCTCCA ATGCCGTCCC CGCAAGCCGT CAGCCGCGTC GACGCCTATA TGCGCGCGGG CGGCACGGTT CTGTTCGACA CGCGAGATCA GTTTTCCTCG CTCGGCTCCT CATCGAACGG CACCAGCCAG AACACCGAAC GCCTCCAGGC TATCCTCGGC AATCTCGACA TACCGCCGCT GGAGCCGGTT CCAGCCGATC ATGTCCTGAC CAAGGCCTTT TATCTGCTGA CGAACTTTCC CGGCCGCTAT ACAGGCAGCC CACTCTGGAT CGAGGCGCAG CTGGACAATG GCGAGGAGAC GAGCAACCGG CCGGCGCGGC CTGGCGACGG TGTGACGCCC ATCATGATCA CCGGCAACGA CCTTGCCGGC GCATGGGCGG TCGACTCGAA CGGAGTCGCT CTGCTGCCGA CCGTGCCGCC GGACGAAATG CAGCGTGAAT ACGCCTTCCG CTCCGGCGTC AACATCATGA TGTATATGCT GACCGGCAAC TACAAGGCCG ATCAGGTGCA CGTGCCCGCG CTGCTCGAGC GGCTCGGACA ATGA
|
Protein sequence | MIGSLSFIFA NPAILAALIA LPVIWWLLRM TPPRPAAEIF PPLRILASVM KREETPSKSP WWLTLLRMLM AAAVIFAIAD PVFNPRSNTL ATSGPLALLI DNGWATAPDW ERRVETASAL IDDAESRDVA ISVVFTGERQ HDATPGSASA ARNRLAAARP EPLPADRASA IAALQTAFED APPATLAFIT DGIASGDGKT MERLAAMGAS NLKVIEADGS EAVALTATSN ESGGMAVTAS RLKTDDGRSL PIVAFDTRGR AIASGTIDFA PGAATARGTI EAPFELRNDF ARIGIEGAAT AGAQHLLDDG FRRRRVALLS GEARDLSQPL LSPLYYINRA LAPYADLIEP RQNDLAVAIP ELLEQRPSVL IMADIGRLPE ETYPLLAKWI ENGGTLIRFA GPRLAAAPAD DPLVPVTLRQ GERALGGALS WSEPQPLADY PGNSPFAGMA RPTDILVKRQ VLAEPTADLA ERTWANLADG TPLVTTAVRG TGRIVLFHVS AEATWSNLPI SGHFVEMLRR SVQLSRGGGI ASDGKAVKAQ TLPPYRLLNA DGVLIAETGN VRPLEIVPGK PPVATADNPP GLYGSEDGFT ALNLLPRDAE LKRIDMTAAG PHTSERLTGV ESWSLKPALM TIALLLLLAD ALIVLFMGGV FSRARIARAA SAIIVVLAGA FVLAPPHAFA DDARPDDQRI LERLDTTHLA YVVTGEEDVD RLSESGLRGL TDFLTYRTTL EPGAPVGLDI TKDELSLYPI IYWPISATAP MPSPQAVSRV DAYMRAGGTV LFDTRDQFSS LGSSSNGTSQ NTERLQAILG NLDIPPLEPV PADHVLTKAF YLLTNFPGRY TGSPLWIEAQ LDNGEETSNR PARPGDGVTP IMITGNDLAG AWAVDSNGVA LLPTVPPDEM QREYAFRSGV NIMMYMLTGN YKADQVHVPA LLERLGQ
|
| |