Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2678 |
Symbol | |
ID | 8743292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2748463 |
End bp | 2751372 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646513265 |
Product | hypothetical protein |
Protein accession | YP_003404225 |
Protein GI | 284165946 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCAT CAAGTGAAAT TCACCAAAAC TTGTTTCGGC TATACGAACA CTACGTCGGT GAACCCGACT CGAGCAAGGA CGTCTACGGC TACTGGGTGT TTATCGTGGG CTACCTGCTC GGGGCGGCCG GCATTTTCAC CTACGTCGTC GGCTACGCCG GCAGCGCGGG ATCCTATACC CTGATCAGGA TCTCCGGCGT CACCGCGGCA ACGGGGCTGG CGCTCTGTCT GTTCGGCATC GTCTTGATGC TCCCGGTGCG CAGGATCGGG ATCTACGCGA GCGCGCTCGG ACTGCTGGTT GCACTCGGCG GCGTCGTCTT TTTCGGCTGG GCCTATCCCA ACAACTGGCG GGATCTGGGA ACCGACTACA GCGTTCAGGT CATCTCTGTT TACACGCTCG GGATCGGACT GATCGCGGGC GTCACCGCTC TCGTTCCCAT TCTGACGGGC CAGAAGGGGA TGTTCGTCGA GGAAGAGGGT GCGACCGACG ATCCGGAGAT CATGACCGGC GACGCGATGG AGGGCGCCCA GTTCGCCGTC TTCCGCGACG AGGACGGCGA CTGGAAGTGG AACGTTCTCC ACCTCGAGGC GCTCGCGACG AGCAACGACA GCGCGGTCAC ACGACCCGAG GCGACCGAGA GCATCGAGCG CGTCCAATCC CAGATCAGTT CCGCGGGGCT GATGGAGTTG ACCACCTCCG CGTTCCGGCT CTACGAGGAT CGCGACGGGA CCTGGCAGTG GACGCTGGCC CGCGACGACG GCAGCGTCGT CGGCGCCAGC ACCGGTGAGT TCGAAGAGCG CGACGGCGCC GAGAACTCGG TCAGCTTCCT CAAGGACCGC GGCCCCGAGG CGGACGTTAT CGAGATCGAG GGCGCGGCGT TCACCTACGA GGAACGGCGC GACCAGTGGT ACTGGCAGTT GGTCGACGAC GAGCGAACGC CGCTGGCCTC GACCGATACC GGCCACCGAA CGCTGGAGGC CGCCGAGGGG GCGGCCCGGA CGTTCGCCGA GCGGTTCGAT CGGGCGCGGC TGCTCGACAT CGAGCACGTC GGCGTCGAAC TCGTCGAGCG CGCCGGCGAC TGGACGTGGC GGTTCGTCGA CGACCGCGAC GAGGTCGTCG CCACCGCCTC CGACGACTAC AACTCCCGCC GCGACGCCGA GGCGGCCGCC GAAGCGCTGC TCCCCGCGCT CGAGTCGGCC GCCGTCACGG TCGCCGGGGA GCCGACCTAC GAATGCTACG AGTCGGGTTC GCAGTGGCGC TGGCGGCTCG TCGACGAGTC CGAACACGTC GTCGCCCGAA GTCCGACCGA CGCGACGGCC CGCGGGGCCG TCGAGGAGAC CGCCGATCGG TTCGGCGATC ACGCTCGCGT CGCCGACGTC GTCGAGATCG ACGACGCGGA GTACGAGGTC TACCCCGCGG AGAACGGGCC CTCGGCCGCC GCGGACGATG GGGACAACCT CCCGGCGGCG GTCGACGAGG CGATGACCGA CGGCGGCACC GAACTCGAGT TCGAGGACGG GGCGGGCCAG ACGCCGACCG GTCCCGACTG GAACTGGCGG CTCGTCACGG AGGACCGCGA CGTCGTCGCC GCGAGCACTG AGCCTCATCC CGACGCCGAG TCGGCGACCG ACGCCATCGA GCGCGTTCGC CAACAGGCCA GCGAGGCCGA TCTCATCGAG TTCGAACACG CCGCGTTCCA GGTCTACGAG GCCGACGACA GCGAGTGGCG CTGGCGGCTC ATCGACGAGG ACGGCAACGT GCTGGCCGAC AGCGGTGAGG AGCACACCTC CCGCGGCGAG GCCGCCGAGG CGATGATGAC GCTCAAGGAA CAGGCCCCCG AGGCCGAACT GCTCGAGATC GAGACCGCCG CCTTCGAGCT GTTCGTCAAC GAGGACGACG AGTGGGGCTG GCGGCTCATC GACGAGGCCG GCAAACTCGT CGCCGAGGAC CCCGCGACCC ACCCCACCCG CGGCGCCGCC CGCCAGGCGA TGAACCGCCT GCTCGAGCAT CTCGATTCGG ACGTCCGGAC GATGGACGAC GCGATCTTCC AGCCCTACGC GGACGAGGAC TGGCAGTGGC GCTTCGTCCT GCCCTCGGGC GAGACCGTCG CGGAGGCCGG GGACACCCAC GCCACCCGCG ACGAGCTCGT CGCGGACTTA GACGACGTCC GCGAGTCGGC CGCGTCGGCC CGCACGCACG CGATCGGCGA GGTCGCCGTC CAGCTCTACG ACAGCGGCGG CTGGCACTGG CGGCTGCTCG ACCGCGACCG CGAGGAGGTC GCCGACTCGA CGGTCACCTA CGCCGACTGC GACGGCGCCG TCGGCGGCGT CGAAGCGCTG CAGACCCACG TCGCCGACGC GCCGATCTTC GCCATCGAGG ACGCCGCGAT CCGGCTGAAC CGCGACGACG GCTGGTCGTG GGAGCTCGTC GACCGCGAGC GCGAGGTGAT CGCGAGCGCG GTCGGTGCGG GGGCATCCAA GGCCGCCGTG CTCGACGATA TCGAGGCCGT GCGCCAGCTG GCACCGATGG CCGGCCGCGT CGACTTCGAC GTCGCCTCGT TCGAACTGGT CGCCGACGAC GAGGGCCGCT GGCAGTGGCG GCTCATCGAC GAGGACGGGC GCACCATCGC GAGCGGCACC GAAGCGCACG ACTCGACCGA GGCCGTCCGC GCGGCCCTAG AGGACGTCCG CGAACTGATC GCCGACGCGA GCATCCTCGA GATCGACAGC GTCTCCTTCG AACTCCACAC CGCCGAAGGC GGCGACGGCT GGGTCTGGCA GCTGATCGAC GAGTACGGGA CGACGATGGC CGAGAGCACT CAGACCTACG AGAACCGCAC CGAGGCCCGC GAGGCGATGA ACGACGTGAA GGCCCACGCG CCCAACGGCT GGATCACCTT CACGGAGTAA
|
Protein sequence | MSSSSEIHQN LFRLYEHYVG EPDSSKDVYG YWVFIVGYLL GAAGIFTYVV GYAGSAGSYT LIRISGVTAA TGLALCLFGI VLMLPVRRIG IYASALGLLV ALGGVVFFGW AYPNNWRDLG TDYSVQVISV YTLGIGLIAG VTALVPILTG QKGMFVEEEG ATDDPEIMTG DAMEGAQFAV FRDEDGDWKW NVLHLEALAT SNDSAVTRPE ATESIERVQS QISSAGLMEL TTSAFRLYED RDGTWQWTLA RDDGSVVGAS TGEFEERDGA ENSVSFLKDR GPEADVIEIE GAAFTYEERR DQWYWQLVDD ERTPLASTDT GHRTLEAAEG AARTFAERFD RARLLDIEHV GVELVERAGD WTWRFVDDRD EVVATASDDY NSRRDAEAAA EALLPALESA AVTVAGEPTY ECYESGSQWR WRLVDESEHV VARSPTDATA RGAVEETADR FGDHARVADV VEIDDAEYEV YPAENGPSAA ADDGDNLPAA VDEAMTDGGT ELEFEDGAGQ TPTGPDWNWR LVTEDRDVVA ASTEPHPDAE SATDAIERVR QQASEADLIE FEHAAFQVYE ADDSEWRWRL IDEDGNVLAD SGEEHTSRGE AAEAMMTLKE QAPEAELLEI ETAAFELFVN EDDEWGWRLI DEAGKLVAED PATHPTRGAA RQAMNRLLEH LDSDVRTMDD AIFQPYADED WQWRFVLPSG ETVAEAGDTH ATRDELVADL DDVRESAASA RTHAIGEVAV QLYDSGGWHW RLLDRDREEV ADSTVTYADC DGAVGGVEAL QTHVADAPIF AIEDAAIRLN RDDGWSWELV DREREVIASA VGAGASKAAV LDDIEAVRQL APMAGRVDFD VASFELVADD EGRWQWRLID EDGRTIASGT EAHDSTEAVR AALEDVRELI ADASILEIDS VSFELHTAEG GDGWVWQLID EYGTTMAEST QTYENRTEAR EAMNDVKAHA PNGWITFTE
|
| |