Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4058 |
Symbol | |
ID | 5318881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 517850 |
End bp | 521068 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640775865 |
Product | hemolysin-type calcium-binding region |
Protein accession | YP_001312798 |
Protein GI | 150376202 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.402749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTCA TCAACGGCAC TTCTGGAAAC AATATTCTCA TCGGCACGGG TTCGGACGAC AGCCTCAATG GTTTAGCCGG CGACGACCTG CTTCAGGGAC TTGGAGGCGC AGACATCATC AATGGCGGCG CGGGTGTGGA CACGGCCGAC TATCGCGAGA AGGCGGCCCC AATCGTAGTC ACATTGACCG GTGCGACGGC GGCGACCGTC TTTATCAATG GCGTCGCGGA GGACACGCTT GCCAACGTAG AGAACGTTTA CGGGGGCTCC GGCGACGACT TTATCACCGG CGACGCGCTG AACAATCTCT TCCGGGGAGG CGGCGGAAAC GACGTGCTCG ACGGCGGCGG CGGGAATGAT ACGGCCGACT ATACCGACAA GACAGCCTCC GTCGTGGTCA CGCTGGCCGG GGCGACACCT GTGACAGTTT TCGTGAACGG CATCGCCGAG GACATGATCA GCAACTTCGA AAACGTCTAT GGCGGTTCGG GCGACGACAT CCTAACGGGC GATGATCGGG ACAACATTCT TCGCGGTGAG GTAGGAAACG ACATTCTCAA TGGCGGCATT GGCGCCGATT CCCTGTTCGG TGGCGAGGGC AACGATACCG TCGATGGCGG CAACGGGGTC GACACCTTCG ACCTGCGCGA AAAGACATCC TCCGTCCTCG TGCAGCTCAA TGGTGCGAAT GGAGCTACAG TCTTCGTCGG CGGCGTTGCG GAGGATACGA TCCGCAACGT CGAGAACATC GTAGGTGGAT CGGCAGGCGA TACGCTTACC GGCGATGCCG CCGCCAACAA GCTCTCGGGT GCGCGCGGCA ACGACTGGCT CATGGGCGGC AGCGGTGCCG ACATACTGGA CGGCGGCGAG GACAGCGATA CGGCGGACTA CAGCGACAAG CAGGCTGCCA TCGTCGTGGC GCTGAATGGC GGCAATCCCG TCACAGTGAC AGTCGGCGGC CTTGCCGAGG ACTCGATCGC CAAGATCGAA AACATCGTCG GGGGTTCGGG AGACGATACG CTCAGCGGCG ATGCCGCTGC CAACACGTTT CGCGGTGGTC TTGGCGCCGA CGCGCTTGAC GGCGGCTCCG GCAGCGATAC CGCGGACTTC AGCGACAAGG CGCAATCCGT CGTGCTTGCC CTCAACGGAG CGATCGATGC CGTTGCGACC GTCGGCGGAA CGGCGGAGGA CTCGGTCCGA AACATCGAAA ACATCATCGG CGGTGCGGGA AACGATCAGC TCACGGGCGA TGCCGCCGCC AATATGTTCC GTGGCGGCCT CGGCGCCGAT GTGCTGGATG GCGCTGCCGG CAGCGACACG GCGGACTTCA GCGACAAGAC GTCATCGCTT GTCGTAACTC TGGCCGGCGC GAGCCCGGCA ACGGTCCTCG TCGGCGGCAT CGCGGAGGAT ACGCTGCGCA ACATCGAGAA CATCATCGGC GGTTCGGGCA ACGACGTGTT CGTTGGCGAC GGCTTGCAGA ACGTGTTCGA TGGCGGGGCG GGTACCGACA CGGCCGACTA TTCGGCCTCG GCGAAGGCGA TCGCTGTAAC GCTCAACGGC GCCATTGACG CGAGGGTGTT CATAGGCAAC GCGGCCGAGG ACACGTTGCG CAACGTCGAA AGCATCACTG GAAGCGCCCT GGCGGATGTG ATCACCGGCG ATGCGCTGGC AAACAGCCTC CTCGGAGGCG GCGGCGCCGA CCTGCTAAAG GGGGGCGGCG GCCAGGACGT TGTGGATGGC GGCAGCGGAT CAGACACCAT AGACTTCGGC GACAAGACCG CTGCCGTCGT CCTGACCCTT GCGGGTGTGG CTAATACGAC AGCCACTGTC GGCGGGGTGG CCGAGGACAC GGTCCGCAAT ATCGAGAACA TCTTCGGAGG CGCCGGCGCC GACGTGCTGA CAGGCGACGG CAACAGCAAC ACAATCCGCG GCGGGGCCGG GGCAGACGGC CTGGACGGCG GCGCGGGGGT CGATACGGCC GATTATCGGG ACAAGGTGAC GTCGGTTGTC GTCACGCTCA GTGGCGCGAA TGCCGCGCTT GTCAAGGTGG GCGGCCTTAA CGAAGATACG ATCCGCAACT TCGAAAACGT TGCAGGCGGT TCGGCCGGCG ACACGCTTGT CGGCGACGAC CTGGCCAATG TGCTGCTTGG CAATGACGGT GCCGATACGC TGAAAGGCGG CTTGGGCAGT GACGTGCTGG ATGGCGGCAA TGGCATCGAC ACTGCGGATT ACCTCGAGAA AGCCGACGCG ATCTCCGTCA CGCTCAACGG GACCACGAAC GCGACGGTCT TGGTCGGCGG AGTTGCGGAA GACGTCATTC GCGGTGTAGA AAACATATTG GCGGGGTCCG GTGCCGACAC ACTTGTCGGC GATGGCGCGA GCAACACGTT TCGTGGAGCG CTCGGCGCTG ACTTCATCGA TGGCGGGGCG GGGTCCGATA CGGCCGATTA TCGCGAGAAG GCGGCGGCTG TCGATGTGAC CCTCTTCGGC GCCGGTGACA GCTTCGTCTT CGTCGGCGGG GTCGCGGAAG ACACAATCCG AAACATTGAA AACGTATTCG GCGGCAAGGG CAACGATACG CTGACGGGCG ACGACTCCGC CAACACCCTC AATGGCAATG ACGGCAAGGA TTTGCTTACC GGCGGCGGCG GAGCGGACAT TCTTGACGGT GGGGCGGCCT CCGACACGGC GAACTACCGC GACAAGAGTG CATCGGTTTC CGTCACCCTC AACGGTGCCG CCTCCACGGC TGTCATGGTC GGCGGGGTGG CCGAGGATAC GGTCCGCAAT ATCGAGAACG TCTGGGGCGG GACCGGCAAT GACAGCCTGA GCGGCGACGG CAATGCAAAT CTGCTGTCGG GCGGCGGCGG AAGCGACATG CTGTCCGGCG GCGCGGGGGC GGATATTTTC CAGTTCGACT TCGCTTTGGG AGCGAGCAAT GTCGACATGG TTCTGGATTT TACCGCCGGA GACCGGCTCT TTCTGTCGAA GAGCGTTTTC ACCAGCCTGA GCGGCGGTTC GCTGGCAAGC AACCAGTTCT ACGCGGCGGC CGGCGCAACG GAGGCGCAAG GCGTGAACCA AAGAGTCGTT TACGATACGA CGACGGGCGC GCTCTACTAT GATGCTGACG GCAACCTTTC GGGGCATACG GCCGTGCAGT TTGCAGTCCT CTCTACACAG CCGGGACTGA CTGCAGGAGA TTTCGTGCTC GTCGTGTGA
|
Protein sequence | MAVINGTSGN NILIGTGSDD SLNGLAGDDL LQGLGGADII NGGAGVDTAD YREKAAPIVV TLTGATAATV FINGVAEDTL ANVENVYGGS GDDFITGDAL NNLFRGGGGN DVLDGGGGND TADYTDKTAS VVVTLAGATP VTVFVNGIAE DMISNFENVY GGSGDDILTG DDRDNILRGE VGNDILNGGI GADSLFGGEG NDTVDGGNGV DTFDLREKTS SVLVQLNGAN GATVFVGGVA EDTIRNVENI VGGSAGDTLT GDAAANKLSG ARGNDWLMGG SGADILDGGE DSDTADYSDK QAAIVVALNG GNPVTVTVGG LAEDSIAKIE NIVGGSGDDT LSGDAAANTF RGGLGADALD GGSGSDTADF SDKAQSVVLA LNGAIDAVAT VGGTAEDSVR NIENIIGGAG NDQLTGDAAA NMFRGGLGAD VLDGAAGSDT ADFSDKTSSL VVTLAGASPA TVLVGGIAED TLRNIENIIG GSGNDVFVGD GLQNVFDGGA GTDTADYSAS AKAIAVTLNG AIDARVFIGN AAEDTLRNVE SITGSALADV ITGDALANSL LGGGGADLLK GGGGQDVVDG GSGSDTIDFG DKTAAVVLTL AGVANTTATV GGVAEDTVRN IENIFGGAGA DVLTGDGNSN TIRGGAGADG LDGGAGVDTA DYRDKVTSVV VTLSGANAAL VKVGGLNEDT IRNFENVAGG SAGDTLVGDD LANVLLGNDG ADTLKGGLGS DVLDGGNGID TADYLEKADA ISVTLNGTTN ATVLVGGVAE DVIRGVENIL AGSGADTLVG DGASNTFRGA LGADFIDGGA GSDTADYREK AAAVDVTLFG AGDSFVFVGG VAEDTIRNIE NVFGGKGNDT LTGDDSANTL NGNDGKDLLT GGGGADILDG GAASDTANYR DKSASVSVTL NGAASTAVMV GGVAEDTVRN IENVWGGTGN DSLSGDGNAN LLSGGGGSDM LSGGAGADIF QFDFALGASN VDMVLDFTAG DRLFLSKSVF TSLSGGSLAS NQFYAAAGAT EAQGVNQRVV YDTTTGALYY DADGNLSGHT AVQFAVLSTQ PGLTAGDFVL VV
|
| |