Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1410 |
Symbol | |
ID | 8742001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 1462642 |
End bp | 1464012 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646511988 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003402971 |
Protein GI | 284164692 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGGCC TCGAGGCGAC CGCGGTACTG GCCGACGTCG CTCGCCCGGT CGCCGCGTCG GGAGTCGTTG GTTTCGAGCC GTCGACGACC CTCGTGGCCG CCGGCGGCGT CGCGGCGCTG CTGGTGTTGC TGGTCCTCTC GGGGTTTTTC TCCTCGGCCG AGATCGCCAT GTTCTCGCTG GCCCACCACC GCATCGAGGC GCTCGTCGAG GACGGGGCGT CCGGCGCCGA GACCGTCCAG GCGCTGAAGG ACGACCCCCA CCGACTGCTG GTGACGATCC TCGTCGGGAA CAACCTCGTC AACATCGCGA TGTCCTCGAT CGCGACGGGA CTGTTCGCGA TGTACACGAG CCAGGGGCGG GCGATGCTGG CGGCGACGTT CGGCGTGACG GCCGTCGTCC TGCTGTTCGG TGAGAGCGCG CCCAAGTCCT ACGCTATCGA GAACACCGAA TCGTGGGCAC TGTCGGTCGC TCGTCCCCTC AAAGTCTCGG AGTACGCGCT GTTCCCGCTC GTGGTCACGT TCGACGCGCT GACACGGGTG CTTAACCGCC TGACCGGCGG CACGGCCGTC GAGGAGTCGT ACGTCACCCG CGAGGAGATC CGGGAGCTGA TCCGGACCGG CGAGAGCGAA GGGGTCATCG AGGCCGACGA ACGCGAGATG CTCCAGCGCG TGTTCCGGTT CAACGACACC ATCGCCAAAG AGGTGATGAC GCCGCGATTG GACGTCACCG CCGTCGCTCG AGAAGCGACC GTCGACGAAG CCGTCGCGAA GTGCGTCGAG AGCGGCCACA CCCGCCTGCC GGTCTACGAC GGCGATCTCG ATACCGTCGT CGGGATCGTC GCGCTCGGCG ACCTCGTCGG TGATCGCGAG TCGACCGACG ACGGCTTGCT CGAGGCCCAC GTCGAGGAGA CGCTGCACGT CCCCGAGAGC AAACACGTCG ACGAGCTGTT CCGCGAGATG CGCCAGCAGC GGGTCGAACA GGTCGTCGTC ATCGACGAGT TCGGGACGAC GGAGGGGATC GTCACCACCG AGGACATCGT CGAGGCCGTC GTCGGCGAGA TCCTCGAGAC CCAGGAGGAC GACCCGATCG AGACCGTCGA CGACCGAACC GTCCGGGTCG ACGGCGAGGT GAACATCGAG GCCGTCAACG ACGTCACCGG CGTCGAGTTC CCAGAGGGCG AGGAGTTCGA GACGATCGCC GGCTTCGTCT TCAACCGCGC CGGCCGACTG GTCGAACCCG GCGAAACGTT CGCCTACGAC GGCGCCGAAC TGACCGTCGA ACGCGTCGAC GATACGCGCA TCAGGCGGGT GCGCATCAGC GAGTCGGAGC CTTCGGTAAC GGACGGCTCC GGTGTCGCCG CCTCGAGTTA G
|
Protein sequence | MPGLEATAVL ADVARPVAAS GVVGFEPSTT LVAAGGVAAL LVLLVLSGFF SSAEIAMFSL AHHRIEALVE DGASGAETVQ ALKDDPHRLL VTILVGNNLV NIAMSSIATG LFAMYTSQGR AMLAATFGVT AVVLLFGESA PKSYAIENTE SWALSVARPL KVSEYALFPL VVTFDALTRV LNRLTGGTAV EESYVTREEI RELIRTGESE GVIEADEREM LQRVFRFNDT IAKEVMTPRL DVTAVAREAT VDEAVAKCVE SGHTRLPVYD GDLDTVVGIV ALGDLVGDRE STDDGLLEAH VEETLHVPES KHVDELFREM RQQRVEQVVV IDEFGTTEGI VTTEDIVEAV VGEILETQED DPIETVDDRT VRVDGEVNIE AVNDVTGVEF PEGEEFETIA GFVFNRAGRL VEPGETFAYD GAELTVERVD DTRIRRVRIS ESEPSVTDGS GVAASS
|
| |