Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4084 |
Symbol | |
ID | 8744712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 336912 |
End bp | 341153 |
Gene Length | 4242 bp |
Protein Length | 1413 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514644 |
Product | Ricin B lectin |
Protein accession | YP_003405591 |
Protein GI | 284167313 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAATGG TGATAGACTC TCGAACCGTG ACAGAGAACC ACAAGCGGTC CGCTAGTACC GACAGCATCG GCCACGTCAA TCGGCGCGAT TGGTTGCAGA TCTTCGGGGA GGTCGCCGTT TCGGCTAGCA TCGGCGGTGC GCTCACCGAG TCCGTAACGG CGTCGCACAC CGAATCCAAG ATCGGTGACT GGCCTCTCGA CGAGGGAAGC GGTGATTCCA CCGTCGAAAG CGTCGCAGGA TACCGAAAAC ACGTCGCTTA TAAGGGTGAC CGAGATCCGC TGTGGGTCGA CGGGAAAGGC CAGAACGGCC TGCTGTTCGA CGGCTACACC ACGTGGGCGG CCTGGGAGGC CGACGAACTC GCGCCGGAGT TCGGTCAGGA GCTCTCGGAG CTAACGATCG ACACCTGGGT CGCTCCGCGT TCTCACGGCT CCGAGTCGAA TTACATCGAC CCGATCATCG AGAAGCGCGA CCGGAGCGCC CAGCAGGGGT TCACGTTTGG CGTCGACAAC TACGGCCACT GGACGTTCCA GGTGGGTCTC GGCGACAGCT GGGAGGAGAT TTGGGTCGAG AGCGGCGACC TGATCGACGT CTACCAGTGG AACCACCTCA CAGCCGTCTT CGACGGAAAC GCGGGGTCAT TGCGGCTCTA CAAGGACGGG AGCCTCGTCG CCGAGAACTC GACTCCAACC GGGACGATCG GCCCCGCCGA TGTTGCCCTC AAGCTCGGGA GGAACTCCCA GATGGACTAC ATTGGCGAGG GCAAAGACGT CTGGAAGCAG AACATGTTCA ACGGCGCCGT CAGCCAGCTG GAAACCTACG ACGCTGCACT CTCCGGCTCC GAGATCCGGA GCAAGCACGA CAACGAAGTC AGCGGCCTCC CCGCGACTGG CTACCAGGAG TTGACGGTCA ATCCAAAGCA GTACGACGGC GACAATCACC GGCCGGAGTA CCACGCGATC GCGCCGACAC ACTGGATGAA CGAACCCCAC GGTCCGCTGT ACTTCGACGG GAAGTATCAT CTGTTCTACC AGCACAACCC GAAGGGATCC TACTGGCGGC AGATCCACTG GGGTCACTGG GTCAGCGACG ACATGGTCCA CTGGAACCCC GTCGAGGAGG CGCTCCGACC GGAGGAGGGA ATCGACCCGG CGGGTTGCTG GTCGGGCGAC GCCGTCGTCG ACGTCGACGG CTGTCCGAAG CTCCTCTACA CCGCGGGCCT CACGGAAGGA ACGGACGATC AAGCCATCGC CGAGGCAACG TCGACGTTCC CCGAGGACGA CGACGTAGCG CTCACTGACT GGAACAAACA GGGGGTCGTG ATGCGTCAGC CCGACGACCC CGGCCTGATG AAAAACGAGT TTCGCGACCC CCACGTCTGG CAGGAGAACG GTGAATGGTA CTGTCTCGTC GGCTCGGGTC TCCAGAACGG CGGCGGGGGT ACCGTGCTCG CGTTTCACTC CACGGATTGC GTCAACTGGA ACTACGAGGG ACGGGCGCTC CAGCTGGACA GTCCCGACGA CTACCCGCAC CTCGGCGACA ACTGGGAACT GCCGGTTCTG CTGCCGATCG GCACGGACGC CAACTGCAAC GAGAAGCACG TCCTTTGCAT CAGCCCTCAG GGCGGCGACA CAGAGGTGTG GTACTGGATC GGGACTTGGG ACACCGCTAA CTTCGAGTTC GTCCGTGATC ACCAAGATCC GCTGCTCATC GATGAGGGCA ACTTCCACTT CACCGGGCCC AGCGGCTTCG TCGATCCGCA GACGGGCCGC TCAATCCTGT TCACGATCGC GCAGGATCAC CGCAAGGAAC AGCTCTGGCA CGACTCCGGG TGGGCGCACA ACGCCGGGAC GCCGATAGAG CTGTCGCTGC GCGACGACGG CCGCCTCGGC ATCGCGCCGA TCGGGGAGAT GGAGAGCGCC CGATCAGAGA AGCTCCTAGA GATGAACGAC GCTGATCCAG CGCTCGTCGA CGACGCGCTA GAGAACCTCA CCGCCGATAC CGTCGAACTG CAGTTAGAGA TCGAGTCGAA CGGCGCGACG GAGTACGGCT TCTACTCCCA CTACTCGCCC GGCGGCGAGG AAAAGACGCT CGTCTACTAC GACGAATCGA CCGGCGAGAT CAAGACCGAC CGGAACCAGA CCAGTCAGGA CTCGACGCTG ATGGAAGCGG AGCAGGACAA GAGCAGTCTC ACGACGCAGG GGCCGGTCGA CGTCGGCAGC GACAACCTGC GTCTGCGGGC GTTCATCGAC AAGTCGCTGA TCGAGTGCTA CGTCAACTCG CTCAAGAGCG TCACGACGCG TGCGTACCCG TCGCGCGATG ACTCGACGGA GCTGCGACTG TACCGAGACG GCGACATCAC CGTCCGATCA ATCGAGATCT GGGAGATGGA GGACATCCAG AACGGACGGC CTAGTTCGGA ACGGTACCGA CCCGGGTACC ACTTCGAACG CGAGAGCGGC TGGATGAACG ATCCAAACGG TCTGGTGTAC CACGACGGTA CGTACCACAT GTTCTACCAG GCCGGCGAGT CCCGCCGACG GTGGGATCAC GCCACCAGCA CGGACCTGGT CAACTGGACC GAGCAGGGAA CCAAGATCCC GGACACGGAC AGCGTTCAGG CGTACTCCGG TGGCGCGGTG GTCGACGTGA ACGACACGGC CGGGTTCGGT GAGAACACAA TCGTGACCAT GTACACGGGC CACCACGACG GCGGAGAAGA GGACCAGCGC ATCGCCTACA GCACCGACGG CGGCGACACC TTCACCAAGT ACGGCGGGAA CCCGGTGCTC GACGAGGACA CCGGCAACTG GCGCGATCCG AACCCGTTCT GGTACGATTC AACCGGCAAC TGGCGGATGG TCGTCGCGCG CGTCGAGGGG AACGGCCCCG ACCGGCCGGC GGGGATCGAG ATCTACGAGT CCGACGACCT GAAGAACTGG ACGTACCTGA GCACCTACGA GTCCGACGGC GTCGCCTGGG AGTGTCCCGA TCTGTTCGAG CTCCCGGTCG AGAACGCCGA CGAGCGGCGT TGGGTGATGA CCGTCTCCGT CGACGCCGAC CATGTCGAAC ACCACGTCGG CCAGTTCGAC GGCACGACGT TCGTCGCCGA TAACGAGGTG TACGCGGACT CGGGACGGGA CTTCTACGCC GCCCAGAGCT GGACCAACGA ACCGGCGACG GACTCTCGCC TGGGACTGGC GTGGACGAGC CACTGGGATT ACGCAGCTGA CACGCCCGAG GACGGTTGGA AAGGCGCCCA GTCGTTCCCT CGCCGGATCA CCCTCAGGGA CACCGGGAGC GGGATCGTAC CGATCCAGCG CCTCGACGGG GCGGTCGAGT CGAACCGCAA GGGCGTACTC GCGGATCTGA GCGACGAACC CCTTTCGCCT ACCCACGACC CGTTGGCGGG CACCAACGTG AAAGGTGAGA TGCTCGAGTT GCTCGCGACG ATCGATCCGG GCACTGCTGA CGCCGTGGTC CTCGAGCTTC GCAAGGGCAT CAGTCAGGAG ACGCGCGTCA CCTACGACGT CGGCGCGGAG GAGTTGTTCG TCGATCGAGG CGACGCGGGC GCCTTCTTCG GCGACACCGA CAAGGACGTC GCGAGCCAAC CGGTGGCGTT GCGCGACGAC GGGACGCTCA CGGTGCGCGT GTTCGTCGAT CGGAGTATCA TCTCCACGTT CGCCAACGAC GGCAAGAAGA CGATGACGAA TCGGATCTAC CCGGACGAAA CGAGCGTCCA CGCGACGATG ACCGCGAGCG GCGGTACGGC GACCATCGAA AGCCTCACCG CTTGGGACTA CTCCGAAGGA CTTGTAGACG GCGCGACCTA CCGAATCGAG AACCGCAATA GCGGGAAAGT CCTTGAGATA CAGGACGGAG GAACCGGAGA CGGCGACACC GTTCAGCAGT GGGAGTGGTG GGGCGGTGAT AATCAGAAAT GGAGCGCTCA CGAGGTTGAT GGCGGTGTCT TCCGGTTCGA GAATGCCAAT AGCGGAAAAG TACTGGATAT CAAAGATGCA GCCACTGACG ACGGCGCCTA CGCCATGCAA TGGGAGTGGT GGGGCGGCGA CAACCAGCAG TGGACTGTCG ATAGGACACC GGATGGGTAC TACCGAGTCC GGAATGTCAA CAGCGGGAAG GTGCTCGATA TCGAGGACAC TTTGACGCGC GACGGCGCCT ACGCTATGCA GTGGGAGTGG TGGGGCGGCA ACAACCAGCA GTGGAAATTT GAGCGACTCT AA
|
Protein sequence | MLMVIDSRTV TENHKRSAST DSIGHVNRRD WLQIFGEVAV SASIGGALTE SVTASHTESK IGDWPLDEGS GDSTVESVAG YRKHVAYKGD RDPLWVDGKG QNGLLFDGYT TWAAWEADEL APEFGQELSE LTIDTWVAPR SHGSESNYID PIIEKRDRSA QQGFTFGVDN YGHWTFQVGL GDSWEEIWVE SGDLIDVYQW NHLTAVFDGN AGSLRLYKDG SLVAENSTPT GTIGPADVAL KLGRNSQMDY IGEGKDVWKQ NMFNGAVSQL ETYDAALSGS EIRSKHDNEV SGLPATGYQE LTVNPKQYDG DNHRPEYHAI APTHWMNEPH GPLYFDGKYH LFYQHNPKGS YWRQIHWGHW VSDDMVHWNP VEEALRPEEG IDPAGCWSGD AVVDVDGCPK LLYTAGLTEG TDDQAIAEAT STFPEDDDVA LTDWNKQGVV MRQPDDPGLM KNEFRDPHVW QENGEWYCLV GSGLQNGGGG TVLAFHSTDC VNWNYEGRAL QLDSPDDYPH LGDNWELPVL LPIGTDANCN EKHVLCISPQ GGDTEVWYWI GTWDTANFEF VRDHQDPLLI DEGNFHFTGP SGFVDPQTGR SILFTIAQDH RKEQLWHDSG WAHNAGTPIE LSLRDDGRLG IAPIGEMESA RSEKLLEMND ADPALVDDAL ENLTADTVEL QLEIESNGAT EYGFYSHYSP GGEEKTLVYY DESTGEIKTD RNQTSQDSTL MEAEQDKSSL TTQGPVDVGS DNLRLRAFID KSLIECYVNS LKSVTTRAYP SRDDSTELRL YRDGDITVRS IEIWEMEDIQ NGRPSSERYR PGYHFERESG WMNDPNGLVY HDGTYHMFYQ AGESRRRWDH ATSTDLVNWT EQGTKIPDTD SVQAYSGGAV VDVNDTAGFG ENTIVTMYTG HHDGGEEDQR IAYSTDGGDT FTKYGGNPVL DEDTGNWRDP NPFWYDSTGN WRMVVARVEG NGPDRPAGIE IYESDDLKNW TYLSTYESDG VAWECPDLFE LPVENADERR WVMTVSVDAD HVEHHVGQFD GTTFVADNEV YADSGRDFYA AQSWTNEPAT DSRLGLAWTS HWDYAADTPE DGWKGAQSFP RRITLRDTGS GIVPIQRLDG AVESNRKGVL ADLSDEPLSP THDPLAGTNV KGEMLELLAT IDPGTADAVV LELRKGISQE TRVTYDVGAE ELFVDRGDAG AFFGDTDKDV ASQPVALRDD GTLTVRVFVD RSIISTFAND GKKTMTNRIY PDETSVHATM TASGGTATIE SLTAWDYSEG LVDGATYRIE NRNSGKVLEI QDGGTGDGDT VQQWEWWGGD NQKWSAHEVD GGVFRFENAN SGKVLDIKDA ATDDGAYAMQ WEWWGGDNQQ WTVDRTPDGY YRVRNVNSGK VLDIEDTLTR DGAYAMQWEW WGGNNQQWKF ERL
|
| |