Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5253 |
Symbol | |
ID | 8745801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | + |
Start bp | 154859 |
End bp | 156166 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646515610 |
Product | hypothetical protein |
Protein accession | YP_003406557 |
Protein GI | 284176280 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.927267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCCA TCGTCATCGA TCACATCCAG GCCGACGGTA CGACCCTCGA GGCGGACGTT CGAACGTCCG GCGACCTCGA GCGGTTCTTC ACCACCGAAT CGTTCCGAAC CGAGTACGAC GTCCCGATCG CGGACGTCCC GGAGGGGGTG CTCGCGATCC CCGTCCTCGC GCAGGTCTGT CCCGTGGCGT GGGCCAACGG GGCCGACGTC TACGTCGACG AGGTCGACGC CACCTTCGCC GCAGCGCTGG GGGACGTCGA GGCGTCGCTG CGGGAGATGT ACGACTTCCT CGAGGGCGGG ACCCTCTACG CGAAGGAGAC GATCGACGCC GACCCCGACG TGAGCGGCGA GAGCGGCCTG CTCTTCACCG GCGGCGTCGA CTCGACGTGT TCGTACGTCC GCCACCGCGA GGAGGAGCCG ACGCTGGTCA GCATCCGCGG CTGGACCATC ACGCCCAGTT CCGCCGACGA CGAGAAGTGG GACGCGCTCC GCGAGCGCGT CACCGGTTTC GCCGACGAGC ACGGCCTCGA GACGGCCTTC GTCGAGTCGA ACATGCTCTC CTTCCTCGAC CACCCCATGC TCCTGGCCCA TTACAAGCGC TACGTCGACG GCGCCTGGTA CAGCTCCGTG GGCCACGGGC TGGGCCTGCT CGGGCTCTGT GCCCCGATGG CCTACGCGCG GGGCATGGAG GACCTCTACG TCGCCGCGAC CCACTGGGAG GGAATCGACC TCGAGTGGGG GTCGCGTCCC GACATCGACG ATCACGTCCG GTGGGCGGGG ACGCGGTGTC ACCACGACGG CTACGAACTG ACCCGTCAGG AGCGGATCGA CGCGATCGCC GACTACATCC GCGAGGAAGA GCCGGACCTC CAGTTGCAGA CCTGCAACGA CCGCATGGAC GGCAACTGCG GCGAGTGCGA GAAGTGTTAC CGGACGGCCG TCGGCCTGCG ACTCTCGGGA CTCGAGCCGA CCGACCACGG CTACCCGTTC GGCGACGAGG ACTACCGCGA GATCCGGACC GCCTTAGAAG AGGGGCGGTG GGTGCTCGGT CAGGACGAGA AGTACATGTG GGAAGATATT CGCGAGCGCG CGCGTGAGAC GGACCCGTCG TCGCCGGCCG AGGCGGCCTT CTTCGCGTGG CTCGACGAGG TCGATCTCGA CGAACTCGTC TCCGAGTCCG AACCGCCGCT GTCGCATCGG CTCCTCCGCG CCGGCGCCCG AAACGCTCCG GCCAGCGTCT ACAACGCCGT CTATCCCGCC TGGGCGACGG CGAAGTCTGG TCTGCGCCGC GTTCGGCACG GCCGCTAG
|
Protein sequence | MSSIVIDHIQ ADGTTLEADV RTSGDLERFF TTESFRTEYD VPIADVPEGV LAIPVLAQVC PVAWANGADV YVDEVDATFA AALGDVEASL REMYDFLEGG TLYAKETIDA DPDVSGESGL LFTGGVDSTC SYVRHREEEP TLVSIRGWTI TPSSADDEKW DALRERVTGF ADEHGLETAF VESNMLSFLD HPMLLAHYKR YVDGAWYSSV GHGLGLLGLC APMAYARGME DLYVAATHWE GIDLEWGSRP DIDDHVRWAG TRCHHDGYEL TRQERIDAIA DYIREEEPDL QLQTCNDRMD GNCGECEKCY RTAVGLRLSG LEPTDHGYPF GDEDYREIRT ALEEGRWVLG QDEKYMWEDI RERARETDPS SPAEAAFFAW LDEVDLDELV SESEPPLSHR LLRAGARNAP ASVYNAVYPA WATAKSGLRR VRHGR
|
| |