Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6522 |
Symbol | |
ID | 5320825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009622 |
Strand | - |
Start bp | 211609 |
End bp | 215082 |
Gene Length | 3474 bp |
Protein Length | 1157 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640778071 |
Product | hypothetical protein |
Protein accession | YP_001315003 |
Protein GI | 150378409 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0162687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.104781 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTTCAA AAGCGACTAA AGCGAGGCGC GCCCCGGCAG CGTCCGACGA GTTTTCAATT GCCAGAGCCC AAGTTCGAAA GAAGAACAAC CCCGCTATGG GGTTGAGGAC GCTTAGGAGA AATGAAGGAC CTGCTAAACT GTACAGGCTA GTGGGACGAT TTGCTCGACA TCCCTCCTAC CGTGTTCTGA CGCTTGCCCA TCCCTTTCCC AAGAACATGT CGGAGCTGTC GAGGAACGTC ACTTACAAGA CAGAGGGTTT CGCCACGGAA TTCGCCTGGT TGGCAAGCAT AGTGAAAGCC TACGCTGCTG AAATCCAAGT TTTCGTGGGA ATGAGGCAGG AGCTTGAGAC AGCTTTCCTG TCAGGAAATC ATCAAAGGAT TGAAGAACTC CACCAGGAAG CGGAATCGAC TTTCGGACAA TCCCTTTGGC TTTCAGAAAG CCGCATCCTT TATTTCGATC TGCCTGGCGT AACCGGGAAC AGAGCCCATT ATACACAGTC GTTTCGGGAG ACAGACGCTC CACCGCTGTT CAATTTCCTA GTCAGTTGGA TATCTTACAG GGCTAGCGCC GGAGTCTCTG CCGGTGAAGT GGACCGATTG CTGGGCCAGG TCTTGAAGGA AGCCGCAGGT CTCGGTGGCG TAGCGTTTGC CCTAAATGGT CGATATCCAG AGGTTGATCA TGAGGCTGCG GCAGACATGC TGGGTTACAG TGATACGCTG CCGCTGATCG ACCGCTATCA AGTGATGTTG CTGGCGTTGC AGTGCCTCTT CGCCGACGGG GACCTTTCCG TCGAGGATGA TGATGCGATC AAAGTTGTGC TTTCCGATCT CTGCAGGCAC ATCGCGGACC CGCAGCTTTA TCGGCTGGCG CGAGCGCTGG GAATTCAAGT CTCCATTCCC TTGGATGAGC GGTTTCTTTC GCTGCAGGAC GACTTCAATG CTGGACTTTT CACCAAAGTC ATCGCTGAGA TCGGTGCCGC CGATGAAACG AAAGATATTG AGACTCTTGA GCTGCTGCTT CGCTCTAAAA TGCAGGTCGG GGAGCGCGAG GAGGCTGTCC CGGCTGAGAC TCAAGTTCGA ACGCTTTACA AGGAAATCGA GAACGATCTG GCGGCTGTTC TCTCATGTGC AGATGATGGT GTTGAAGCGC GCCTTAGACT TCAGAAAGTT ATTCTGTCGC ACTGCAATTC GGCTTGGGCG GCATCGCTCC GCCTCATCCT AGAACGGCAG ACGCACGATG AGCGGGTTTT CGAACCTAAT CGCATCCAAA CCGCGCTTGG CCTTCGATCG GCGATTGAGC AGCCCTATCT CATTCTTACG CTGCCCACAG GTAAAGTTAG GCACGAATTG GCAGCGCGGA TTGCGAATGA CCGCCCAAAC AGCGTCACCA TCGCTATGCT GGAGCAGCTT TGTGGGGATG AGAGCCCCTC TGCCGCAAGC TTCATCGGGA TGGATCAGGC CATAGCCGTC TTAAGGCATG GAGAGTCAGT CGAGGCGATC GAAATTTTAA AGAAGTTGAC CGCTTTAACT GAGCCTCAGA TCGTTCAGTT TGAAGCCTTT CGATTGCTTG CTGCCGCTTA TTGGCGCGAT GGTCAGATTG CGGAAATGGC CGATCTGACG GCTCGATTGT TCGTGCAATC ACGATATTTC GGTGCCATTT TGCCGATACG AGATCTGGTC GACGAGCTCC TTAAAAGACA CAACGAGCCG TTGGTCGGTT CGGCTACACG CGGTCGGCTC AGTGTCGCGA TCGTGTTCGA TATATTTTCT CGATATATCT CATCGGAGTA CGATGCTGAG CGGGCTGATG CCTACAAGGA CGTCCTAAAA GCCAATTGCA TTCGAAAAGC GTCTGAACTC ACCCAGCATC TTGACCGCTT CCCAGTCAAT GAGCTGGTCT ACTTCCTGAG ATACGTGTGC GTTTCCGACG TCCTTGATCA ATCGCTCGCA TTGGACACTT CGCGAGCAGT TGAAGACGAG CGTGTCGCGA TTCTGCTTCT CATCAGCGAA CTGATAGCAG GCAAAACCCC AGCAGCAATC AAAGACGAAC TCCGCGAGAT CAGGACAAGA CAGGTAGTTA GAGAAACCAC GCTCAGGCTG GATGAGAGTA AAATCTACGT CAACATTGAA GGGATCAGAC GCTCGATCGA TGTCTCGATG CGTGATGACT GGAATCGTTA CCGCCTTCTG AACGATGGTC TCGAGACTGC GCTATTCGAG ATTATTGAAA AGGTGAGAGG CGAAAAGGCC GATCCTCTAC GCATTATTAT CGCGAACCCA AGCGAGAGGT TTGCGCTCTT GCGGAAAATA ATTACCGTTC TACGTGACGA ATTCACAATG AATAAGGAGT TTGGGCTGAA TTCTAATCTC AGCACTAATA TTCGTCACGG TTATGTTGAG CGAGAGATAC GTGCCCCGCT TCTCGCCCGC AACCTGGTGA CGAATACCGA CAAGGGCTTC TATCTTCCAA ACAAGTTTTG GCTTGAACGG CTGGATAGCG GTGACCCGGA AGAGGAAGCT CGACTTTCGG CGCTTTTCAA CAAGTTCTCT ATGAGGATCG ATGACGAGAT CGATCGCGTG AATCGCAAAC TGCTGCGCGT AAATTCCGAA GAAACGCCTG AGGGTTTGTT CAAGTACGGG ATATCCGACA CCGCTTTAAT GGCTGCCGAT GCAACTTGGT CATCCAAGGA GTCGTTCGAC GAATTTATCG ACGCAGTCTT CTCGATCTTC TGGGAGGGAA CTCAGCGCAA TCTGGTTATG GTTCGCGCTG CGCTTACGAG CTATGTTCTC GCGGAGTTCA TCGACTCTTT GAACGCTTTG GCGCAAGACA TAGCGACCGA TGGATTCAGT CAGCGCCTAC CGGACTTGGA ACATGCCATA ACAATGGTGC AGACGGAAAC ACGAGCCGCA GTGGAACGTG TCGCGAGTTG GTTCACGCTG TCGAGCAATA ACGAATACCA AGATTTCGAT CTCGCGATTG CATTCCAGGC GGGAATGAAC ACCGTTAGGA CCTACTTCCG CAATCTTTCG ATACAGGAAG ACTACGCCTC GAACGGCGAA ATAATAATGA ACGGGTGGTG CTTGCCGATT TTCGCGCGCC TGTTCTTCCT CATTCTAGAT AACGCCGCGA CACATGGAGC CCGCAATCGA TCGCATCTCA AGATATCGAT GTTCGTCGAA GTTCGCGAAA ACCATCTGTT CATCAAGGCA ACCAATGATC TCCCTGTCGA CCATAACAAT GCCGAGCTTG CCGAGCGTGC AGCGGAGATC AATCGCGACT ACGGTCAGGC GAGAGCCATG GAACTGCTCT CCGAGGAGGG TGGCTCGGGT TATCCCAAGA TATGGAAGCT TCTGAAGACC GACCTAAGGA AAAACCACGA CCTGTTCGTG AGCGTTTCAA AAAATGAATT TGGCGTTGAG ATACTGCTGC CAACAGCGGG GATAGTAAAT GAGACTGTTG ATAGTCGAGG ATGA
|
Protein sequence | MRSKATKARR APAASDEFSI ARAQVRKKNN PAMGLRTLRR NEGPAKLYRL VGRFARHPSY RVLTLAHPFP KNMSELSRNV TYKTEGFATE FAWLASIVKA YAAEIQVFVG MRQELETAFL SGNHQRIEEL HQEAESTFGQ SLWLSESRIL YFDLPGVTGN RAHYTQSFRE TDAPPLFNFL VSWISYRASA GVSAGEVDRL LGQVLKEAAG LGGVAFALNG RYPEVDHEAA ADMLGYSDTL PLIDRYQVML LALQCLFADG DLSVEDDDAI KVVLSDLCRH IADPQLYRLA RALGIQVSIP LDERFLSLQD DFNAGLFTKV IAEIGAADET KDIETLELLL RSKMQVGERE EAVPAETQVR TLYKEIENDL AAVLSCADDG VEARLRLQKV ILSHCNSAWA ASLRLILERQ THDERVFEPN RIQTALGLRS AIEQPYLILT LPTGKVRHEL AARIANDRPN SVTIAMLEQL CGDESPSAAS FIGMDQAIAV LRHGESVEAI EILKKLTALT EPQIVQFEAF RLLAAAYWRD GQIAEMADLT ARLFVQSRYF GAILPIRDLV DELLKRHNEP LVGSATRGRL SVAIVFDIFS RYISSEYDAE RADAYKDVLK ANCIRKASEL TQHLDRFPVN ELVYFLRYVC VSDVLDQSLA LDTSRAVEDE RVAILLLISE LIAGKTPAAI KDELREIRTR QVVRETTLRL DESKIYVNIE GIRRSIDVSM RDDWNRYRLL NDGLETALFE IIEKVRGEKA DPLRIIIANP SERFALLRKI ITVLRDEFTM NKEFGLNSNL STNIRHGYVE REIRAPLLAR NLVTNTDKGF YLPNKFWLER LDSGDPEEEA RLSALFNKFS MRIDDEIDRV NRKLLRVNSE ETPEGLFKYG ISDTALMAAD ATWSSKESFD EFIDAVFSIF WEGTQRNLVM VRAALTSYVL AEFIDSLNAL AQDIATDGFS QRLPDLEHAI TMVQTETRAA VERVASWFTL SSNNEYQDFD LAIAFQAGMN TVRTYFRNLS IQEDYASNGE IIMNGWCLPI FARLFFLILD NAATHGARNR SHLKISMFVE VRENHLFIKA TNDLPVDHNN AELAERAAEI NRDYGQARAM ELLSEEGGSG YPKIWKLLKT DLRKNHDLFV SVSKNEFGVE ILLPTAGIVN ETVDSRG
|
| |