Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3525 |
Symbol | |
ID | 5324413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 3728785 |
End bp | 3733803 |
Gene Length | 5019 bp |
Protein Length | 1672 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640792475 |
Product | alpha beta-propellor repeat-containing integrin |
Protein accession | YP_001329176 |
Protein GI | 150398709 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGATCG GAGAGGGCGG GCATCTGCTG CTCGACCCCA AAAACATCAT CATCGGCACC CCGGCAACCG TCTCGGGATG GGCCTATCAG GCGATCATTG GCGCAGGATA TGGCAAGAAC CGCAATGTCA TCGCGCTCGG TGCAAATGAT GGATTCGGTC TCTCTGTTTC GCTGAACGCG GCGGGGGACC GTTTGGCGGT CGGGGCGTAT CAGGACGACG GGTCCAGCGG CAACGTGTCC AATTCGGGCG CGGTGTATCT GTTCAGCTTC ACCGACACGA CGTTTTCGGG CGGCATGCTC GAGGCGGTGA TCGGCAAGGA CTATACGGGT GGCAAGAATG TCGATGTCGG CGCGCTCGGT GCGGATGATG GGTTCGGTGC TTCTGTTTCG CTGAACGCCG CGGGGGACCG ACTGGCGGCC GGGGCGTATC AGGACGACGG GTCCGGCGGC AACGTGTCCA ATTCGGGCGC GGTGTATCTG TTCAGCTTCA CCGACACGAC GTTTTCGAAT GGCTCGCTCG AGGCGGTGCT CGGCAAGGGC TATACGGGCG GCAAGAATGT CGATGTCGCC GCGCTCGCTC GGGAGGATCA GTTCGGTGTT TCTGTCTCGC TGAACGCGTC GGGGGACCGC CTGGCGGTCG CGGCGGATCT GGACGACGGG TCCGGCAAGA ACGTCTCCAA ATCGGGAGCG GTGTATCTGT TCAGCTTTAC GGATGCAGCG TTTTCGGGCG GCACGATCGA GGCGGTGCTC GGCAAGGGCT ATACGGACGG CAAGAATGTC GATGTCGCCG CGCTCGCTCC GGATGATCAG TTCGGTATTT CTGTCTCGCT GAACGCGGCG GGGAACCGGC TGGCGGTTGG AGCGATCGGC GACAACGGGT CTGGTGGCAC CTCCGTGAGC AGAGCCGGGG CGGTGTATCT GTTCAGCTTT ACGGATGCAG CGTTTTCGGG CGGAACGCTC GAAGCGGTGC TCGGCAAGGG CTATACGGGC GGCAAGAATG TCGATGTTGC CACGCTTGAG GTACTTGATG CGTTCGGCTC GTCGGTCTCG CTGAACGCGG ATGGGGACCT TCTGGCGGTC GGCCCATTTC TGGACGACGG GTCCGGCAAC GGCGTGAGAG ATTCGGGCGC GGTGTATCTG TTCAGCTTTA CCGACACGAA GTTTTCAGGT GGCCTGCTCG AGGCGGTGAT CGGCAAGGGT TATTCTAACG GTAAGAATGT CAATGTCGAC GCGCTCGGTG TAAACGATGG ATTCGGTGGT TCCGTCTCCC TGAACGGGGC GGGGGACCGT CTTGCGGTCG GGGCGAACCT GGACGACGGG TTCGGCAACC GCGTGAAAGA TTCGGGCGCG GTCTATCTGT TCAGTTTTAC GGATGCAGCG TTTTCGGGTG GCACGCTCGA GGCGGTGATC GGCAAGGGCT ATACGGGCGG CAAGAATGTC GATGTCGCTG CACTCGAGAA CGATGACTGG TTTGGTCATT CCGTCTCGCT GAACGCGTCG GGGGATCGTC TTGCGGTTGG GGCGAACCTG GACGACGGGT TCGGCAACGG CGTGAAAGAC TCGGGCGCGG TGTATCTGTT CAGCTTTACG GATGCAGCGT TTTCGGGTGG CGGGCTCAAG GCGGTGATTG GCAGGGGCTA TGATAATACT ACAGGCGATA AGAATGTGGA TGTCGCTGCG CTTGAGAGCG ACGATCGGTT CGGCTCGTCC GTCTCGCTGA ACGCGGCGGG GGATCGTCTG GCGGTCGGGG CGCCTGGCGA CGGGTCCGTG GTCGAAGCGG GCGCGGCATA TCTGTTCAGC TTCACCGATA GGGCATTTTC CGATGGGGCG CCCGAGCTGG TGATCGACAA GGACTATCCG GGCTTTAATG GTGCCGCGCT CGAGGAAGCT GATCGGTTCG GCTCGTCGGT CTCGCTGAAC GCGGCGGGGG ATCGGCTGGC GGTCGGGGCA CCTGGCGACG ACGGGCACGA CACCAGCCCG ATCATTGAAG ATAATAATTT TGGAGCGGTG TATCTGTTCA GCTTCACCGA TACGGCGTTT TCGGACAGTA CTCACGAGGC GGTGATCGGC AATGGCTATT CGGGCGGCAA GAATGTCGAT ATTGCCACGC TCGAGGACGG TGATACGTTC GGTTCGTCCG TCTCGCTGAA CGCGTCGGGG AACCGCCTGG CGGTCGGAGC GCTTGGCGGT AACGGGGCCA ATAACATCGA CGATTCGGGT GCAGCCTATC TGTTTCGCTT CACCGACACG GCGTTTTCGG GTGGTACGCA GGAGGCGGTG ATCGGCAATG GCTATTCGGG CGCCAACAAT GTCGATGTCT CGCTCGAGGA GGATGATAAG TTCGGTTCAT CCGTCTCGTT GAACGCGCTG GGGAACCTTC TGGCGATCGG GGCGCCTCTT GACGACGGAG CCGGCAACAG GGTGTCCGAT TCGGGAGCGG TTTATCTGTT TGCGGGCATC CTCGACGGCG ATTCGGTTTC GTCTGCCAAC TATGGCGATG ATCCCTCGGC TGACAGCTAT ATCCTGCCCT CCGACATCGT GTCGCTTCTT TCGGCGGGCA CGAATGTGAC CCTGCAGGCC AATAACGACA TCACTGTTGC CGAGGCGGTC GCCGTCACGG ACGGTTCGGC GAGGCTGACC TTGCAGGCAG GGCGTTCCGT TCTGATCGAT GCCGGCATAA CCAGCAATGG CGGGGATGTG ACGCTGATCG CCAATGATCT TTTGGAGAAC GGTGTGGTCG ACGCGCATCG GGACTCCGGC GCGGCGGTGA TCACCATGGC CCCGGGCACG GTGCTCGACG CGGGCACGGG GGCTGTGATC TTCGACCTGC GTGCCGGTAC GGGCAAGACC AAAAGGCAAG GCGGCGACAT CACGGTGGGC ACGGTGAATG CGGGGTCGAT CCTGGCGGTA AACGCCGGGC CGAACGGGAA GTCGGGGATC GTGCTCGGCA GCGGATCGGT GCTGACGGCT TCGGCCACGG GCAATGCGAT CGTTCTGGCC GGTGATCGCT TTACCAACAG GTCAGGCGCC TCGCCGCTGC AGGCGTCAGG CGGGCGCTGG CTGGTGTGGT CCGGGAACCC GGCGGACGAT ACGCGCGGCG GTCTTTCCTA CGGGTTCAAG CAGTATAATG CGAAATATGG TGAGACGGCA GTCGCCCAGG GCGCGGGCAA CGGTTTTCTC TATAGCCTTG CCCCCAAGAT CACGGTGGGT CTGACCGGCA CGGTGTCGAA GGCCTATGAC GGGACGACCG GTGCGGTTCT GGCCGGGGGC AATTACACGG TCTCGGGCGC GGTGGATGGC GATACGGTGA GTATTACGCA GACCGCGGGC AGCTACGACA CAAAGCATGT CGGCACGGGC AAGACCGTGA CGGCGAGCCT GGCCGACAGC CACCTGAGCG CGGTCAATGG CACGGTGAAG GTCTATGGCT ACAAAACGGT CAATATGAGC GCAGCGGGCC CGGTGGGCGA GATCACGGCG CGTGCGCTGA CGGTTTCGAC AGAGGCCGTG AGCAAGGTCT ATGACGGGAC CGTTTCGGCG TCGGGCACGG CGATCGTGAC GTCGGGCGCG CTGCAGGGTA GCGACACGCT CTCCGGCGGC AGCTTTGCGT TCGCCGACAA ACATGCGGGC GCCGGCAAGA CGGTGACGGT CTCGGATGTG ACGATCGACG ACGGCAATTC GGGCGGCAAC TACATCCTGA CCTATGCCGA TAATACCGCC AGCGAGATCA CGGCGCGTGC GCTGACGGTT TCGACAGAGG CCGTGAGCAA GGTCTATGAC GGGACCGTTT CGGCGTCGGG CACGGCGATC GTGACGTCGG GTGCGCTGAA GGGCAGCGAC ACGCTCTCCG GCGGTAGCTT TGCGTTCGCC GACAAACATG CGGGCGCCGG CAAGACGGTG ACGGTCTCGG ATGTGACGAT CGACGACGGC AATTCGGGCG GCAACTACAT CCTGACCTAT GCCGATAATA CCGCCAGCGA GATCACGGCG CGTGCGCTGA CGGTTTCGAC AAAGGCCGTG AGCAAGGTCT ATGACGGGAC CGTTTCGGCG TCGGGCACGG CGATCGTGAC GTCGGGTGCG CTTCAGGGCA GCGACACGCT CTCCGGCGGT AGCTTCGCGT TCGCCGACAA GCATGCGGGC GCCGGCAAGA CGGTGACGGT TTCGGATGTG ACGCTCAATG ACGGCAATTC CGGCGGCAAC TACATCCTGA CCTATGCCGA TAATACCGCC AGCGAGATCA CGGCGCGTGC GCTGACGGTT TCGACAAAGG CCGTGAGCAG GGTCTATGAC GGGACCGTTT CGGCGTCGGG CACGGCGATC GTGACGTCGG GTGCGCTTCA GGGCAGCGAC ACGCTCTCCG GCGGTAGCTT CGCGTTCGCC GACAAGCATG CGGGCGCCGG CAAGACGGTG ACGGTTTCGG ATGTGACGCT CAATGACGGC AATTCCGGCG GCAACTACAT CCTGACCTAT GCCGATAACA CCGCCAGCGA GATCACGGCG CGTGTGCTGA CGGTTTCGCT CAGCGGTACG GTGTCGAAGG TCTATGACGG CGCGACGGCA GCGACGCTGT CTCCCGGCAA TTACAGCCTG TCGGGCCTTG TGCCGGGCGA CGTCGTTTCG ATTGTGCTCC TGTCGAGCAA TTACGATACC GCGGATATCG GCACAGGCAA GACGGTGAGC GTTGCCGGGC TTAGTCTGTC GGGTGTGGAT AAGGCCAACT ATCTGCTCGG CTCGAGTGCG GCGAGCGCGG CGATCGGGGA GATCACGTCG GCCGTCACCC CGTGGGATGA CAGCGTCAAG CAGGTCGTAG AGCCCTTGTT CGATCAGGAA GAGTCGGGCA AGCCGGACCG CGTGAGCTTG GATGAGACGC TGGGAATCAG AACCGGTAAC CGCCTCGACT CGGGCGCTGG TTTGCTGGTG AACTGCATGG AGCCGGAGGG GCGGGTGTTG AAACTGGTCG GCTCGCCCGT CGATGTCACG GGGTGGCAGG TCGCAACCTG TATGAGCGGT AGCCTATAG
|
Protein sequence | MKIGEGGHLL LDPKNIIIGT PATVSGWAYQ AIIGAGYGKN RNVIALGAND GFGLSVSLNA AGDRLAVGAY QDDGSSGNVS NSGAVYLFSF TDTTFSGGML EAVIGKDYTG GKNVDVGALG ADDGFGASVS LNAAGDRLAA GAYQDDGSGG NVSNSGAVYL FSFTDTTFSN GSLEAVLGKG YTGGKNVDVA ALAREDQFGV SVSLNASGDR LAVAADLDDG SGKNVSKSGA VYLFSFTDAA FSGGTIEAVL GKGYTDGKNV DVAALAPDDQ FGISVSLNAA GNRLAVGAIG DNGSGGTSVS RAGAVYLFSF TDAAFSGGTL EAVLGKGYTG GKNVDVATLE VLDAFGSSVS LNADGDLLAV GPFLDDGSGN GVRDSGAVYL FSFTDTKFSG GLLEAVIGKG YSNGKNVNVD ALGVNDGFGG SVSLNGAGDR LAVGANLDDG FGNRVKDSGA VYLFSFTDAA FSGGTLEAVI GKGYTGGKNV DVAALENDDW FGHSVSLNAS GDRLAVGANL DDGFGNGVKD SGAVYLFSFT DAAFSGGGLK AVIGRGYDNT TGDKNVDVAA LESDDRFGSS VSLNAAGDRL AVGAPGDGSV VEAGAAYLFS FTDRAFSDGA PELVIDKDYP GFNGAALEEA DRFGSSVSLN AAGDRLAVGA PGDDGHDTSP IIEDNNFGAV YLFSFTDTAF SDSTHEAVIG NGYSGGKNVD IATLEDGDTF GSSVSLNASG NRLAVGALGG NGANNIDDSG AAYLFRFTDT AFSGGTQEAV IGNGYSGANN VDVSLEEDDK FGSSVSLNAL GNLLAIGAPL DDGAGNRVSD SGAVYLFAGI LDGDSVSSAN YGDDPSADSY ILPSDIVSLL SAGTNVTLQA NNDITVAEAV AVTDGSARLT LQAGRSVLID AGITSNGGDV TLIANDLLEN GVVDAHRDSG AAVITMAPGT VLDAGTGAVI FDLRAGTGKT KRQGGDITVG TVNAGSILAV NAGPNGKSGI VLGSGSVLTA SATGNAIVLA GDRFTNRSGA SPLQASGGRW LVWSGNPADD TRGGLSYGFK QYNAKYGETA VAQGAGNGFL YSLAPKITVG LTGTVSKAYD GTTGAVLAGG NYTVSGAVDG DTVSITQTAG SYDTKHVGTG KTVTASLADS HLSAVNGTVK VYGYKTVNMS AAGPVGEITA RALTVSTEAV SKVYDGTVSA SGTAIVTSGA LQGSDTLSGG SFAFADKHAG AGKTVTVSDV TIDDGNSGGN YILTYADNTA SEITARALTV STEAVSKVYD GTVSASGTAI VTSGALKGSD TLSGGSFAFA DKHAGAGKTV TVSDVTIDDG NSGGNYILTY ADNTASEITA RALTVSTKAV SKVYDGTVSA SGTAIVTSGA LQGSDTLSGG SFAFADKHAG AGKTVTVSDV TLNDGNSGGN YILTYADNTA SEITARALTV STKAVSRVYD GTVSASGTAI VTSGALQGSD TLSGGSFAFA DKHAGAGKTV TVSDVTLNDG NSGGNYILTY ADNTASEITA RVLTVSLSGT VSKVYDGATA ATLSPGNYSL SGLVPGDVVS IVLLSSNYDT ADIGTGKTVS VAGLSLSGVD KANYLLGSSA ASAAIGEITS AVTPWDDSVK QVVEPLFDQE ESGKPDRVSL DETLGIRTGN RLDSGAGLLV NCMEPEGRVL KLVGSPVDVT GWQVATCMSG SL
|
| |