Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3647 |
Symbol | |
ID | 8744273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3756337 |
End bp | 3757737 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646514234 |
Product | selenium-binding protein |
Protein accession | YP_003405182 |
Protein GI | 284166903 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATG TTAACGAACC CAGTGACGTC GAACCGGACC ACGAGCACGA CCACCACCAC GAGGGCCCCG GCTACGCGAC GCCGCAGGCC GCCATCGAGG AGGGCGAGCG AGAGGAACTG GCCTACGTGA TGAGCCTCTA CGTCGGCACG GACGTCGACG CGCCGGACTT CGTCTCGGTC GTCGACCTCG ATCCCGACTC CGACACCTAC TGCGAGATCG TCGACCGCAT CGAACTGCCC AACCGCGGCG ACGAACTCCA CCACTTCGGG TGGAACGCCT GCTCGTCGTC GTGTCACATG GAGGGCCTCG AGCGCCGCCA CCTGATCGTC CCCGGCCAGC GCTCCTCGCG GATCCACGTG ATCGACGCGA AGGATCGGCG CAACCCCGAA CTCGAGACGG TGATCGAACC CGAGGAGGTC TTCGAATACG ACCTCTCGGC ACCGCACACC GTCCACTGCA TCCCGGACGG CGAGATCCTG ATCAGCATGC TCGGCGACGC CGACGGCGAG TTACCGGGCG GCTTCCTCGA GCTGAACGAC GACTTCGAGA TCGAGGGCCG GTGGGAGCCG CCGGGCGAGA TCGAGATGAA CTACGACTAC TGGTACCAGC CCCGGCAGAA CGTGATGGTC TCGAGCGAGT GGGCTGCCCC CAAAACGTAC TACCCGGGCT TCGACCTTGA GGACGTCGAG GCCGGGAACT ACGGCCAGCG CCTCCATTTC TGGGACTGGG AGGCCGGCAC CGTCGAGCAG ACCATCGACC TCGGCGAGGA GGGGTTGATC CCGCTCGAGG TGCGCTTCCT CCACACCCCC GAGTCGACCC ACGGGTTCGT CGGGGCCGCG CTCTCGTCGA ATATCTTCCA CTTCTGGCGC GACGGTGAGT CCGGCGAGTA CCGCGCCGAG AAGGTCATCG ACTTCGAGAG CCGGGAGCAC GACGACTGGG ACATGCCCGT CCCCGCGCTC CCGACGGATA TCCTGATCTC GATGGACGAC CGCTACCTGT TCGGCTCGAA CTGGCTCCAC GGCGAGGTCT GGATGTACGA TATCTCGGAC CCGTCGAACC CGCGGCGGGC CGACTCGCTG TCGGTCGGGG GAACCTTCGG CGAGGTGCAG GAGGTCCAGA ACCGCGAACT GTCCGCGGGC CCCCAGATGA TTCAGCTCTC GCTGGATGGC GAACGGCTCT ACTGGACCAC CTCGCTGTTC TCCTCGTGGG ACGAGCAGTT CTACCCCGAG GAGGGCGAGC GCGGCTCGGT GATGCTGAAG GCCGACGTCG ATCCTCGGAA AGGAACGATG GAACTCGACG AGGACTTCCT CGTGGACTGG GGCGAGTGTC CTGAGGGTCC AGCCCGCGCT CACGAGATCC GCTGGCCCGA CGGCGACTGC ACGAGCGACG TCTGGCAGTG A
|
Protein sequence | MSDVNEPSDV EPDHEHDHHH EGPGYATPQA AIEEGEREEL AYVMSLYVGT DVDAPDFVSV VDLDPDSDTY CEIVDRIELP NRGDELHHFG WNACSSSCHM EGLERRHLIV PGQRSSRIHV IDAKDRRNPE LETVIEPEEV FEYDLSAPHT VHCIPDGEIL ISMLGDADGE LPGGFLELND DFEIEGRWEP PGEIEMNYDY WYQPRQNVMV SSEWAAPKTY YPGFDLEDVE AGNYGQRLHF WDWEAGTVEQ TIDLGEEGLI PLEVRFLHTP ESTHGFVGAA LSSNIFHFWR DGESGEYRAE KVIDFESREH DDWDMPVPAL PTDILISMDD RYLFGSNWLH GEVWMYDISD PSNPRRADSL SVGGTFGEVQ EVQNRELSAG PQMIQLSLDG ERLYWTTSLF SSWDEQFYPE EGERGSVMLK ADVDPRKGTM ELDEDFLVDW GECPEGPARA HEIRWPDGDC TSDVWQ
|
| |