Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1474 |
Symbol | |
ID | 5322332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1558074 |
End bp | 1561460 |
Gene Length | 3387 bp |
Protein Length | 1128 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640790422 |
Product | hypothetical protein |
Protein accession | YP_001327154 |
Protein GI | 150396687 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.105081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0154551 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACA TCCGCGGCGA AAGAGTGGAC TTTCGCAGGG AAGACATCGT CGCGCTGCAC GCTTTGCCCT CGGCTCAAGC TCACGACCCG GTCATCGTGC ACACGCCGCG GCCTGGCGGT GCCTGGCGCC TGTGCGGAAG GATTCTGCTC TGCTGCTCGC TGCTCGTCTT CATCGCCGTT GCTTCGCTTA TCGCCATAAT AGAAAGCGGA ATTGTAGACG GGCCGCTGAA TGCCAGGGCC AGGACGGCGC TCAACACCGC ACTCGGACAG GATTATAGCG CCGATGTCGA GAGCACGGTG ATCCGGCTGA CCGGCGGCGG CGCACTTGCG CTCAAGGCGC GGGGCGTGAC GCTGAAGGAG CGCGGATCCG GCCGGCATCT CGCCAAGCTC GGTGCAATTT CGATCGCCCT CGATCCATTT GCCCTGGCGA CCGGCCGCAT CAATGTCTCG AGGCTTGAAG CGGAGGGTGG CGAGCTCGAC ACCGGGCTCC TGCCGCGCGG CGAGCCCATC GATCTTACAG CCATCCGCAT AGCGGACGTA GGCACTGCCC TCGAAGAATT GTTCGCACAG GGCGATCGGA TGTCGCGCTT GACCGCCGGA CGGTCGACCC AGACCGTCGT CCTCTCTGAC TTCAGCCTAA CGGTCAGCGG CACGCGTGGG CGGGCCGTTC CCGTCGAAAT CAAGACACTG CAATTCAGTC ACGACCCCGA CAGTTCGATG CGAGTTGAGG GCACGATTGC GGTTGACGGC ATCGAATCTC AGCTTACGGC AAAGGCTCTC GGCGACAGGG GGCGCATCGC CGCCTTCGAG GCCGGGCTCG ACGCGCTGCC GCTTTCGCCG TTCCTCCATC ACGGCAAGTC GGGCAATGAG GAGGCCTTCG GCATCGAAGC CACCGCCAAT GTGACGCTCA AAGCCGCCCG CGCTGCGGAT GGCACGAAGC CGGCACTCAC GGCGGCCGTG AAGACGTCCA GGGGCTCCTT CCACGCCGGC GGCCTCGCCT CCAAACTCAA CTCCGCGGAA CTGAACGTCT CCTACGATTT CGAGCGAGCC TCGGTCGAGA TATTGTCGTC GATGGTCAGG ATCGGCCGGT CGAGCTTCCC CTTCACGGGA GCCCTGATCG ATCTCGACAA GATTGCCGGC GCAGACCGGA AAGGATTCGC GGTCGATCTC CTGTTCAAGA ACGCCAGCTC CGACCCTGAG GACATGCAGG CGCCCCCGCT TGCCTTCGAT GCCAAGGCAA GCGGGCGTTT CGAGTCCGAC ACCCACCGGC TGATCTTCGA TCAACTCGCG ATATCGAGTC CGCTTGGCTC CATGGCGGGT TCGCTTTCGG TCGCCTTCGG CAAGACGTCG CCGCGCATCA GTTTCGCTGC GGTCAGCGAC AGGATGCACT CCAGCGCCGT CAAGCAGCTG TGGCCCTGGT GGCTGGCGAA GGGGGCTCGT CGCTGGGCGT TAGGAAACCT CTTCGGAGGC ATGGTAAGCG ACGCACGCAT CGAGGTCTCG ATTCCGGAGG GGCGTATTGC CAGTAGCGGC GGAGAATTGA GGCTCAACGA AAAGGAGCTC AACATCAACT TTGCCGTCGA CGAGACGCGC ATCAACATCG CGGGAGAGAT ACCGCCGTTG CGTGACACGG CCGGACGCTT CAGCCTGAGC GGCGAGCGGA TGTCCGTCGC CGTCGAGAAG GGCGCCGCCT TCTTCCCGTC CGGTCGCTCC GTCGCCTTGA ATGGCGGGGA TTTCATCATT GCCGACGTCT ATAAAAAGCC GCTGATGGCG GAAATGAAGA TCAAGGTCGC GGGCGAGGCG GACGCGATAG CCGAGCTCGT CCGCTACAAA CCGATAGAGG CGCTTCGGAA GACGCCTTTC ACTCCCGAGG ATTTTACCGG CCCGATGACG GCGCTGGTGG GTGCGCGGTT TGGTCTCATT TCCGACCAGA AGCCGCCACG TCCACTGTGG CAGGTGGAAA TGGAGCTTGA GGACGTGACG ATAAAGCGGC CCGTTGCAGG ACGTTCGATC GCCGATCTCG ACGGGACGAT GAGGATCGAC AACGAGCGTG CAGTGCTCCA GGCGAACGCC CTAATCGACG GTGCGAAGAT GCGCGTCGCG CTCACCGAGC CGGTTGGCAC CTCGGCCAAT GTGGCGAGAA CGCGCGAAAT CTCCGGAACG CTCGACGACG CGGCGCGGGC GAAAATCGCT CCGGCCCTGT CCGGAATCGT CAGCGGGCCC GTGGGCATAG ACGTTTCGCT TGCGGAAGAC GGCAGCCAGT CGGTCAAGGT TGACCTCGGA AAGGCCGTGC TGTCGCTTCC CTGGATTGGC TGGAGCAAGG GCTCGGGCAT CCCAGCAAAG GCGCAATTCA CAATTCGGGC GGCCGGTGGC ATCACGGAGA TAAACGACCT GCGGCTCACC GGAGAAGGTT TTGGCGGCAA TGGCGAATTG CGCGTAGACG AATCCGGCCT TGCAGCGGCG CGGCTGAGCG GCGTGCGCCT GGCAAGCGGC GACGATTTTT CCGTCACAGT GGGACGCAGC AAGGGAGGCT ATTCGGTAAA CCTAACCGGC ACCGCAGCGG ATATCCGACC GGCGCTCGCT CGCGTCAAGG GCGGCGCGAG TTCAAAGGAT GGCGGCAATG TGAAGATCAA GGCGCGGCTC GACCGGGTCA CCGGTTTCAA CGGCGAAGTC CTGTCGAATG TTGATCTCAC CTATTCGAGC CGCGGCCAGC AGATCGACGA TGTCAACCTT TCAGCGATAA CGGCAAGCGG CCAGGCAGTC GTTGCCAGGT TGGTCAAGGC CGGTGCGGAC AACACGCTGG AACTGACGAC AAGCGATGCC GGAGCCTTTG CACGCTTCAT CGACATCTAC CGCAATATGC GGGGCGGACT CCTCAATTTG CGCCTGCGTG ATCGCGGTGC CAACTCGTGG CGCGGAACCG TGGACATCCG CAAGTTCTCG CTGGTCGGCG AACAAAGGCT GCAATCGATG GTCTCGACCC GGGCGGGCCA GGACGGTCGC AGTCTCAACG AAGCGGTTCG ACGCGATATC GACGTGAGCA CGGCTCAGTT CGAGCGTGGC TTCGCGCAAC TCCTGCTGGA TCAGGGCGCA ATTCGGGTCG GCAGCGGAGT GGTCCGCGGT ATCGATGTGG GCGCGACCTT CCAGGGAACT GTCCGCGACG CCAATGGTCG TATGGACATG ACGGGTACGT TCATGCCGGC TTACGGATTA AACCGGCTCT TCGGGGAATT GCCGCTGATC GGTGTCCTGC TCGGAAATGG GCGCGACCGG GGCCTGTTGG GGATCACGTT CAAACTCGCC GGACCGTTCA GCCAGCCAAG TCTGACGATC AACCCGCTGT CGATCATAGC GCCGGGCGTC TTCCGCAATA TCTTCGAGTT TCAATGA
|
Protein sequence | MSDIRGERVD FRREDIVALH ALPSAQAHDP VIVHTPRPGG AWRLCGRILL CCSLLVFIAV ASLIAIIESG IVDGPLNARA RTALNTALGQ DYSADVESTV IRLTGGGALA LKARGVTLKE RGSGRHLAKL GAISIALDPF ALATGRINVS RLEAEGGELD TGLLPRGEPI DLTAIRIADV GTALEELFAQ GDRMSRLTAG RSTQTVVLSD FSLTVSGTRG RAVPVEIKTL QFSHDPDSSM RVEGTIAVDG IESQLTAKAL GDRGRIAAFE AGLDALPLSP FLHHGKSGNE EAFGIEATAN VTLKAARAAD GTKPALTAAV KTSRGSFHAG GLASKLNSAE LNVSYDFERA SVEILSSMVR IGRSSFPFTG ALIDLDKIAG ADRKGFAVDL LFKNASSDPE DMQAPPLAFD AKASGRFESD THRLIFDQLA ISSPLGSMAG SLSVAFGKTS PRISFAAVSD RMHSSAVKQL WPWWLAKGAR RWALGNLFGG MVSDARIEVS IPEGRIASSG GELRLNEKEL NINFAVDETR INIAGEIPPL RDTAGRFSLS GERMSVAVEK GAAFFPSGRS VALNGGDFII ADVYKKPLMA EMKIKVAGEA DAIAELVRYK PIEALRKTPF TPEDFTGPMT ALVGARFGLI SDQKPPRPLW QVEMELEDVT IKRPVAGRSI ADLDGTMRID NERAVLQANA LIDGAKMRVA LTEPVGTSAN VARTREISGT LDDAARAKIA PALSGIVSGP VGIDVSLAED GSQSVKVDLG KAVLSLPWIG WSKGSGIPAK AQFTIRAAGG ITEINDLRLT GEGFGGNGEL RVDESGLAAA RLSGVRLASG DDFSVTVGRS KGGYSVNLTG TAADIRPALA RVKGGASSKD GGNVKIKARL DRVTGFNGEV LSNVDLTYSS RGQQIDDVNL SAITASGQAV VARLVKAGAD NTLELTTSDA GAFARFIDIY RNMRGGLLNL RLRDRGANSW RGTVDIRKFS LVGEQRLQSM VSTRAGQDGR SLNEAVRRDI DVSTAQFERG FAQLLLDQGA IRVGSGVVRG IDVGATFQGT VRDANGRMDM TGTFMPAYGL NRLFGELPLI GVLLGNGRDR GLLGITFKLA GPFSQPSLTI NPLSIIAPGV FRNIFEFQ
|
| |