Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2025 |
Symbol | |
ID | 8742624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2096452 |
End bp | 2098038 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646512607 |
Product | protein of unknown function DUF790 |
Protein accession | YP_003403582 |
Protein GI | 284165303 |
COG category | [S] Function unknown |
COG ID | [COG3372] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGACCA AGGACCTGCT CCGCGTCTCG CGGGCCGGAG GCGGCTACCA CCTCCAGTTC GCCGATCGGG AGCACCGTCC GCTCGCCGCC CGCGTCATTG GGACGTATCA GGGCCACGTC GGCGAATCTC GCGCGGAACT CGAGGCGGCC GTGACTGAAC TTGAACGCGG TGCGGACGAT TTCAAGCTCG TCAGAGGGCT GTCGGCGCTG CTCGAGCGCG ACGCGACGTT CGAGACCGAC GCCGAGATCG ATCCCGAACG CGCTCGTCGG GCTGCCTTCG AGGCCGCCGA GGCCGTCGGC GTCGTGACCG AGGACGAGCG TGCGATGGCC CTCGTTCGCG CCGGCGAGTC GCTGGGCGTC TCGGCCGACG ACGTCGCGGG GGCGCTGTAC GCCGACCTCG AGGAGCGGCA GGTCCTCATC GAACTGGCGT CGCGGTGGGA GCCGGACGAG CTGGTGGCCC AGTACAACCT CTCGCTGGCA CAGACCGCCC TTTTCGACGC AACCGAGCTT CGGGTCCGCT CGAGCGATCC GAAGGCGCTC GTTTCGGCGA TCAAGCGACT GCGACTGATG TACGAAATTC GACGGCTGGA GAACGACGAG GTCGGCGAAG CGCCCGATCG AGGCATCGCC GAGCGCGAGG TGATCGTCAC CGGGCCGACC CACCTCTTCC GGGCGACCCG CCGGTACGGC ACTCGGTTTG CCCGCCTCTT GCGGACGGTC GCGAAAGCCG AGGAGTGGCG CCTCGAGGCG ACGATCGACG ACCGCGGGAC CGAACGGACG CTCCGTCTGT CCCACGAGGA TCCCGTCCGC GTCCCCGACG CAGAGCCAGT TGCCGAGGTC TCCTTCGACA GCGGCGTCGA GGCCGATTTC GCCGCGCGCT TCTCGACTCT CGATCTCGAG TGGGATCTCG TGCGCGAACC CGCGCCCCTC GCGACGGGAA CGCGGGTGAT GATCCCCGAT TTCGCGTTCG ACTATCGTCC TGGGGGCAGC GCCCGCAGGG ACTCGTCGGA CGAGTCCGAC GGAGGGCACA GCGAGTTCCG CGTCTACTTC GAAATCATGG GCTTCTGGAC GCCCGAGTAC GTCGAGAAGA AACTCGCACA GCTGTCGGAC CTCGAGGACG TCGAACTGAT CGTCGCCGTC GACGAGTCCC TCGGCGTCGG CGAGGAAATC GCGGCCCGAG ACTTCCGGGC GATCCCCTAC TCCGGAAGCG TCCGGCTGAA GGATGTCGCC GGCGTCCTCC GGGAGTACGA GCGCCAACTC GTCGCCGAGA GCGCCGCCGC GCTGCCGGAC GAACTGTGCC CCGACGAGGA CGTACTCTCG CTCGAAGCGC TGGCCGGCCG GCGGGGCGTC AGCGAGGACG CGCTGGTCGA CGTCGCGTTT CCGGACCACG TGCGGGTCGG CCGGACGCTC GTCCGGCCGG CCGTCCTCGA GTCGCTGGCG GACGAGATCG AGGCCGGAAT GGCGCTGGCC GACGCCGAGA AGATCCTCGA AGCGGCCGGA TTCAGCGATT CGAGTGCGAT CCTCTCGGAA CTCGGTTATC GCGTCGAGTG GGAGGGGCTG GCCGGCGGGA CGCTCGTCGA GCGGTAG
|
Protein sequence | MLTKDLLRVS RAGGGYHLQF ADREHRPLAA RVIGTYQGHV GESRAELEAA VTELERGADD FKLVRGLSAL LERDATFETD AEIDPERARR AAFEAAEAVG VVTEDERAMA LVRAGESLGV SADDVAGALY ADLEERQVLI ELASRWEPDE LVAQYNLSLA QTALFDATEL RVRSSDPKAL VSAIKRLRLM YEIRRLENDE VGEAPDRGIA EREVIVTGPT HLFRATRRYG TRFARLLRTV AKAEEWRLEA TIDDRGTERT LRLSHEDPVR VPDAEPVAEV SFDSGVEADF AARFSTLDLE WDLVREPAPL ATGTRVMIPD FAFDYRPGGS ARRDSSDESD GGHSEFRVYF EIMGFWTPEY VEKKLAQLSD LEDVELIVAV DESLGVGEEI AARDFRAIPY SGSVRLKDVA GVLREYERQL VAESAAALPD ELCPDEDVLS LEALAGRRGV SEDALVDVAF PDHVRVGRTL VRPAVLESLA DEIEAGMALA DAEKILEAAG FSDSSAILSE LGYRVEWEGL AGGTLVER
|
| |