Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5023 |
Symbol | |
ID | 8745829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013748 |
Strand | + |
Start bp | 14734 |
End bp | 15852 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646515637 |
Product | hypothetical protein |
Protein accession | YP_003406584 |
Protein GI | 284176308 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 0.868022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAACT TCACCAAGCG AGTCGCAACA CTCGGCATCA TGCTGGCGGT CATCACCAGC ATGTTCGCGA TGCCGGCTGC CGCAACGTTC TCAACCCAAC CCGCAGTCGA TAGCTCCTCG GACATTTCCG ACGGCGCAAC CATCTCGGTC GTCGACGCCA ACGAGAGCAA CGTCTCTGCG CTGGTCGTCG ACACAGACGC AAACGACACG ATCGAGTCGG CCGACGTCAC GGTCGACCTG AACAACTCCG ATCGGAACTA CACGGTCTAC GACGCGGATA CTGCGAGCAG CGAGTACTCG GTCAATAGTA GCATCGACAC GGACAGCGAC GGGACTGCGG ACACGGACCG TCACACCTGG AACGTCAGCC ACGACGAGTT CGCTGACGTC CCAGTCGCCT ACAACGACAC GACTGACCTC GACTTCACCG TCGAGTTCAC CGACGGCGCT GGCGACACGG CCAACGTCAC CGGCACGATC ACGATCTCCA ACGACGGCGA GCGCGCCGTG ACGGTCGTCG ACAGCGCGTC GATCGATAAT AGCGGCGTCG GCCCGTCGGT CGAAAGCGAG AGCCAGGAGG CTGGCACCCT CGCGAAGTAC AGCCCGTTCC ACGACGAGCC CGAGCACGAC ATCTACACGC TCGATGACTC GTTCGGCTCT GACGCAAACG TCACGCACGA AATCTACCTC GCTGACGGCG ACATGGCCAG CGCCTACGAC GCCAGTGCCG AGGACACCGA GGCAGGCGAC GTCCTAATCG GACAGACGAT GCTCGCCAAC GACGGCCTCG TGCTGACGTT CGACTCCGAG GCCGACTCGG ATCTGGTTGA TACCTCGAGT GATGCCTACG CCGTCTACGA CAGTTCGGCG GACAAGCTGA CCCTCGAGCC CGCCGACGAC AATGCGACGC TCGACGTCGT CTCGATGAAT CAGAACCCGG TCGACGTCGA GAGTGTGAGC AACGACGACA TCGCGAGCAC GTTTGACGAC GCGTTCGGGA CCTACGCGCT GTTCAGCAAC TTCTCGGTTG GCGTCCTGAT GGCCTCGTTC GGCCTGCCGA CGCTGGCGTT CCTGTTCGCC GTCGGAGCGC CGAAGGCCCG CAGTCGCATC GAAGCGTAA
|
Protein sequence | MRNFTKRVAT LGIMLAVITS MFAMPAAATF STQPAVDSSS DISDGATISV VDANESNVSA LVVDTDANDT IESADVTVDL NNSDRNYTVY DADTASSEYS VNSSIDTDSD GTADTDRHTW NVSHDEFADV PVAYNDTTDL DFTVEFTDGA GDTANVTGTI TISNDGERAV TVVDSASIDN SGVGPSVESE SQEAGTLAKY SPFHDEPEHD IYTLDDSFGS DANVTHEIYL ADGDMASAYD ASAEDTEAGD VLIGQTMLAN DGLVLTFDSE ADSDLVDTSS DAYAVYDSSA DKLTLEPADD NATLDVVSMN QNPVDVESVS NDDIASTFDD AFGTYALFSN FSVGVLMASF GLPTLAFLFA VGAPKARSRI EA
|
| |