Gene Htur_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4084 
Symbol 
ID8744712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp336912 
End bp341153 
Gene Length4242 bp 
Protein Length1413 aa 
Translation table11 
GC content63% 
IMG OID646514644 
ProductRicin B lectin 
Protein accessionYP_003405591 
Protein GI284167313 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAATGG TGATAGACTC TCGAACCGTG ACAGAGAACC ACAAGCGGTC CGCTAGTACC 
GACAGCATCG GCCACGTCAA TCGGCGCGAT TGGTTGCAGA TCTTCGGGGA GGTCGCCGTT
TCGGCTAGCA TCGGCGGTGC GCTCACCGAG TCCGTAACGG CGTCGCACAC CGAATCCAAG
ATCGGTGACT GGCCTCTCGA CGAGGGAAGC GGTGATTCCA CCGTCGAAAG CGTCGCAGGA
TACCGAAAAC ACGTCGCTTA TAAGGGTGAC CGAGATCCGC TGTGGGTCGA CGGGAAAGGC
CAGAACGGCC TGCTGTTCGA CGGCTACACC ACGTGGGCGG CCTGGGAGGC CGACGAACTC
GCGCCGGAGT TCGGTCAGGA GCTCTCGGAG CTAACGATCG ACACCTGGGT CGCTCCGCGT
TCTCACGGCT CCGAGTCGAA TTACATCGAC CCGATCATCG AGAAGCGCGA CCGGAGCGCC
CAGCAGGGGT TCACGTTTGG CGTCGACAAC TACGGCCACT GGACGTTCCA GGTGGGTCTC
GGCGACAGCT GGGAGGAGAT TTGGGTCGAG AGCGGCGACC TGATCGACGT CTACCAGTGG
AACCACCTCA CAGCCGTCTT CGACGGAAAC GCGGGGTCAT TGCGGCTCTA CAAGGACGGG
AGCCTCGTCG CCGAGAACTC GACTCCAACC GGGACGATCG GCCCCGCCGA TGTTGCCCTC
AAGCTCGGGA GGAACTCCCA GATGGACTAC ATTGGCGAGG GCAAAGACGT CTGGAAGCAG
AACATGTTCA ACGGCGCCGT CAGCCAGCTG GAAACCTACG ACGCTGCACT CTCCGGCTCC
GAGATCCGGA GCAAGCACGA CAACGAAGTC AGCGGCCTCC CCGCGACTGG CTACCAGGAG
TTGACGGTCA ATCCAAAGCA GTACGACGGC GACAATCACC GGCCGGAGTA CCACGCGATC
GCGCCGACAC ACTGGATGAA CGAACCCCAC GGTCCGCTGT ACTTCGACGG GAAGTATCAT
CTGTTCTACC AGCACAACCC GAAGGGATCC TACTGGCGGC AGATCCACTG GGGTCACTGG
GTCAGCGACG ACATGGTCCA CTGGAACCCC GTCGAGGAGG CGCTCCGACC GGAGGAGGGA
ATCGACCCGG CGGGTTGCTG GTCGGGCGAC GCCGTCGTCG ACGTCGACGG CTGTCCGAAG
CTCCTCTACA CCGCGGGCCT CACGGAAGGA ACGGACGATC AAGCCATCGC CGAGGCAACG
TCGACGTTCC CCGAGGACGA CGACGTAGCG CTCACTGACT GGAACAAACA GGGGGTCGTG
ATGCGTCAGC CCGACGACCC CGGCCTGATG AAAAACGAGT TTCGCGACCC CCACGTCTGG
CAGGAGAACG GTGAATGGTA CTGTCTCGTC GGCTCGGGTC TCCAGAACGG CGGCGGGGGT
ACCGTGCTCG CGTTTCACTC CACGGATTGC GTCAACTGGA ACTACGAGGG ACGGGCGCTC
CAGCTGGACA GTCCCGACGA CTACCCGCAC CTCGGCGACA ACTGGGAACT GCCGGTTCTG
CTGCCGATCG GCACGGACGC CAACTGCAAC GAGAAGCACG TCCTTTGCAT CAGCCCTCAG
GGCGGCGACA CAGAGGTGTG GTACTGGATC GGGACTTGGG ACACCGCTAA CTTCGAGTTC
GTCCGTGATC ACCAAGATCC GCTGCTCATC GATGAGGGCA ACTTCCACTT CACCGGGCCC
AGCGGCTTCG TCGATCCGCA GACGGGCCGC TCAATCCTGT TCACGATCGC GCAGGATCAC
CGCAAGGAAC AGCTCTGGCA CGACTCCGGG TGGGCGCACA ACGCCGGGAC GCCGATAGAG
CTGTCGCTGC GCGACGACGG CCGCCTCGGC ATCGCGCCGA TCGGGGAGAT GGAGAGCGCC
CGATCAGAGA AGCTCCTAGA GATGAACGAC GCTGATCCAG CGCTCGTCGA CGACGCGCTA
GAGAACCTCA CCGCCGATAC CGTCGAACTG CAGTTAGAGA TCGAGTCGAA CGGCGCGACG
GAGTACGGCT TCTACTCCCA CTACTCGCCC GGCGGCGAGG AAAAGACGCT CGTCTACTAC
GACGAATCGA CCGGCGAGAT CAAGACCGAC CGGAACCAGA CCAGTCAGGA CTCGACGCTG
ATGGAAGCGG AGCAGGACAA GAGCAGTCTC ACGACGCAGG GGCCGGTCGA CGTCGGCAGC
GACAACCTGC GTCTGCGGGC GTTCATCGAC AAGTCGCTGA TCGAGTGCTA CGTCAACTCG
CTCAAGAGCG TCACGACGCG TGCGTACCCG TCGCGCGATG ACTCGACGGA GCTGCGACTG
TACCGAGACG GCGACATCAC CGTCCGATCA ATCGAGATCT GGGAGATGGA GGACATCCAG
AACGGACGGC CTAGTTCGGA ACGGTACCGA CCCGGGTACC ACTTCGAACG CGAGAGCGGC
TGGATGAACG ATCCAAACGG TCTGGTGTAC CACGACGGTA CGTACCACAT GTTCTACCAG
GCCGGCGAGT CCCGCCGACG GTGGGATCAC GCCACCAGCA CGGACCTGGT CAACTGGACC
GAGCAGGGAA CCAAGATCCC GGACACGGAC AGCGTTCAGG CGTACTCCGG TGGCGCGGTG
GTCGACGTGA ACGACACGGC CGGGTTCGGT GAGAACACAA TCGTGACCAT GTACACGGGC
CACCACGACG GCGGAGAAGA GGACCAGCGC ATCGCCTACA GCACCGACGG CGGCGACACC
TTCACCAAGT ACGGCGGGAA CCCGGTGCTC GACGAGGACA CCGGCAACTG GCGCGATCCG
AACCCGTTCT GGTACGATTC AACCGGCAAC TGGCGGATGG TCGTCGCGCG CGTCGAGGGG
AACGGCCCCG ACCGGCCGGC GGGGATCGAG ATCTACGAGT CCGACGACCT GAAGAACTGG
ACGTACCTGA GCACCTACGA GTCCGACGGC GTCGCCTGGG AGTGTCCCGA TCTGTTCGAG
CTCCCGGTCG AGAACGCCGA CGAGCGGCGT TGGGTGATGA CCGTCTCCGT CGACGCCGAC
CATGTCGAAC ACCACGTCGG CCAGTTCGAC GGCACGACGT TCGTCGCCGA TAACGAGGTG
TACGCGGACT CGGGACGGGA CTTCTACGCC GCCCAGAGCT GGACCAACGA ACCGGCGACG
GACTCTCGCC TGGGACTGGC GTGGACGAGC CACTGGGATT ACGCAGCTGA CACGCCCGAG
GACGGTTGGA AAGGCGCCCA GTCGTTCCCT CGCCGGATCA CCCTCAGGGA CACCGGGAGC
GGGATCGTAC CGATCCAGCG CCTCGACGGG GCGGTCGAGT CGAACCGCAA GGGCGTACTC
GCGGATCTGA GCGACGAACC CCTTTCGCCT ACCCACGACC CGTTGGCGGG CACCAACGTG
AAAGGTGAGA TGCTCGAGTT GCTCGCGACG ATCGATCCGG GCACTGCTGA CGCCGTGGTC
CTCGAGCTTC GCAAGGGCAT CAGTCAGGAG ACGCGCGTCA CCTACGACGT CGGCGCGGAG
GAGTTGTTCG TCGATCGAGG CGACGCGGGC GCCTTCTTCG GCGACACCGA CAAGGACGTC
GCGAGCCAAC CGGTGGCGTT GCGCGACGAC GGGACGCTCA CGGTGCGCGT GTTCGTCGAT
CGGAGTATCA TCTCCACGTT CGCCAACGAC GGCAAGAAGA CGATGACGAA TCGGATCTAC
CCGGACGAAA CGAGCGTCCA CGCGACGATG ACCGCGAGCG GCGGTACGGC GACCATCGAA
AGCCTCACCG CTTGGGACTA CTCCGAAGGA CTTGTAGACG GCGCGACCTA CCGAATCGAG
AACCGCAATA GCGGGAAAGT CCTTGAGATA CAGGACGGAG GAACCGGAGA CGGCGACACC
GTTCAGCAGT GGGAGTGGTG GGGCGGTGAT AATCAGAAAT GGAGCGCTCA CGAGGTTGAT
GGCGGTGTCT TCCGGTTCGA GAATGCCAAT AGCGGAAAAG TACTGGATAT CAAAGATGCA
GCCACTGACG ACGGCGCCTA CGCCATGCAA TGGGAGTGGT GGGGCGGCGA CAACCAGCAG
TGGACTGTCG ATAGGACACC GGATGGGTAC TACCGAGTCC GGAATGTCAA CAGCGGGAAG
GTGCTCGATA TCGAGGACAC TTTGACGCGC GACGGCGCCT ACGCTATGCA GTGGGAGTGG
TGGGGCGGCA ACAACCAGCA GTGGAAATTT GAGCGACTCT AA
 
Protein sequence
MLMVIDSRTV TENHKRSAST DSIGHVNRRD WLQIFGEVAV SASIGGALTE SVTASHTESK 
IGDWPLDEGS GDSTVESVAG YRKHVAYKGD RDPLWVDGKG QNGLLFDGYT TWAAWEADEL
APEFGQELSE LTIDTWVAPR SHGSESNYID PIIEKRDRSA QQGFTFGVDN YGHWTFQVGL
GDSWEEIWVE SGDLIDVYQW NHLTAVFDGN AGSLRLYKDG SLVAENSTPT GTIGPADVAL
KLGRNSQMDY IGEGKDVWKQ NMFNGAVSQL ETYDAALSGS EIRSKHDNEV SGLPATGYQE
LTVNPKQYDG DNHRPEYHAI APTHWMNEPH GPLYFDGKYH LFYQHNPKGS YWRQIHWGHW
VSDDMVHWNP VEEALRPEEG IDPAGCWSGD AVVDVDGCPK LLYTAGLTEG TDDQAIAEAT
STFPEDDDVA LTDWNKQGVV MRQPDDPGLM KNEFRDPHVW QENGEWYCLV GSGLQNGGGG
TVLAFHSTDC VNWNYEGRAL QLDSPDDYPH LGDNWELPVL LPIGTDANCN EKHVLCISPQ
GGDTEVWYWI GTWDTANFEF VRDHQDPLLI DEGNFHFTGP SGFVDPQTGR SILFTIAQDH
RKEQLWHDSG WAHNAGTPIE LSLRDDGRLG IAPIGEMESA RSEKLLEMND ADPALVDDAL
ENLTADTVEL QLEIESNGAT EYGFYSHYSP GGEEKTLVYY DESTGEIKTD RNQTSQDSTL
MEAEQDKSSL TTQGPVDVGS DNLRLRAFID KSLIECYVNS LKSVTTRAYP SRDDSTELRL
YRDGDITVRS IEIWEMEDIQ NGRPSSERYR PGYHFERESG WMNDPNGLVY HDGTYHMFYQ
AGESRRRWDH ATSTDLVNWT EQGTKIPDTD SVQAYSGGAV VDVNDTAGFG ENTIVTMYTG
HHDGGEEDQR IAYSTDGGDT FTKYGGNPVL DEDTGNWRDP NPFWYDSTGN WRMVVARVEG
NGPDRPAGIE IYESDDLKNW TYLSTYESDG VAWECPDLFE LPVENADERR WVMTVSVDAD
HVEHHVGQFD GTTFVADNEV YADSGRDFYA AQSWTNEPAT DSRLGLAWTS HWDYAADTPE
DGWKGAQSFP RRITLRDTGS GIVPIQRLDG AVESNRKGVL ADLSDEPLSP THDPLAGTNV
KGEMLELLAT IDPGTADAVV LELRKGISQE TRVTYDVGAE ELFVDRGDAG AFFGDTDKDV
ASQPVALRDD GTLTVRVFVD RSIISTFAND GKKTMTNRIY PDETSVHATM TASGGTATIE
SLTAWDYSEG LVDGATYRIE NRNSGKVLEI QDGGTGDGDT VQQWEWWGGD NQKWSAHEVD
GGVFRFENAN SGKVLDIKDA ATDDGAYAMQ WEWWGGDNQQ WTVDRTPDGY YRVRNVNSGK
VLDIEDTLTR DGAYAMQWEW WGGNNQQWKF ERL