Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1638 |
Symbol | |
ID | 8742231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1698286 |
End bp | 1699476 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646512216 |
Product | DRTGG domain protein |
Protein accession | YP_003403197 |
Protein GI | 284164918 |
COG category | [R] General function prediction only |
COG ID | [COG0857] BioD-like N-terminal domain of phosphotransacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACA CTGACCCCAC TGACACCGAC ACCACTCCCG ACGACACCGA GACGACCGAC GGTCGGCAGC CCGACGAGCA GCAGGGTGAC CGACGGTCGA CTGACGCCGC CACGACTGCG AGTGACACCG ATACGATCCT CGTCAGTTCG CTCGCGGAGA GCACCGGCAA GACGGCGATC ACGCTGGCAC TGGCCCGGCT CGCGGCCGAG GAAGGCGACA GCGTCGGCTA CATGAAACCG AAGGGCACCC GACTCGAGAG CAACGTCGGA AAGACCCTGG ACGAGGATCC GTTGCTCGCA CGCGAACTGC TCGACCTCGA GGCCGAGATG CACGATCTGG AGCCCGTCGT CTACTCGCCG ACGTTCGTCG AGCAGGCGAT CCGCGGCCGC GAGGACCCCG ACGAGATCCG CGAGCGCGTG GTCGAGGCCT TCGAGACGCT CGCCGACGGC CGAGACCGCA TGTTCGTCGA GGGCGGCGGC GAGTACGACG TCGGCGGTAT CGTCGACCTC ACCGACGCCG ACGTGGCGGA ACTTCTGGAC GCCCGCGTCG TCCTCGTGGC TTCCCACGAG GTTCCGGGCG ATATCGACGA CGTGCTCGCC GCGGTCGACG CCTTCGGCGA CCGACTCGCC GGCGTCATCT ACAACGACGT CGCGGACGCC GTCTACGATA CGCTCGAGAC CGACGTCGTC GCCGCGCTCG AGGAGCGCGG GATTCCGGTC TTCGGCGTGC TCCCTAGCGA GCGGACGCTC TCGGGGGTCA CCGTCGGCGA ACTGGCCGAA GAGCTCGGCG CCTCGATGCT CGTCGAAGAC GGACGGGACG CCTACGTCGA GCGGTTCAGC GTCGGCGCGA TGGGCGCCGA CAGCGCCCTG CGCCACTTCC GCCGGACGAA AGACGCGGCC GTTATCACCG GCGGCGACCG CGCCGAGATC CACACCGCCG CGCTCGAGGC GCCGGGCGTG CGCTGTCTCA TCCTGACCGG CGGCCACCGC CCGTCGGGCG CGATCATCGG TCAGGCCGCC GAGAAGGGGA TGCCGATCCT CTCGGTGCAG ACGGATACGC TGACGACCGT CGAACGCGCC GAGGACGTCG TCCGGAGCGG CCGCACGCGC GACGCGGAAA CCGTCGACCG GATGGCGGAA CTGCTGACCG ACCACACGGC GGTCGACTCG ATTCTGGACG GTTCTCGCTA G
|
Protein sequence | MSDTDPTDTD TTPDDTETTD GRQPDEQQGD RRSTDAATTA SDTDTILVSS LAESTGKTAI TLALARLAAE EGDSVGYMKP KGTRLESNVG KTLDEDPLLA RELLDLEAEM HDLEPVVYSP TFVEQAIRGR EDPDEIRERV VEAFETLADG RDRMFVEGGG EYDVGGIVDL TDADVAELLD ARVVLVASHE VPGDIDDVLA AVDAFGDRLA GVIYNDVADA VYDTLETDVV AALEERGIPV FGVLPSERTL SGVTVGELAE ELGASMLVED GRDAYVERFS VGAMGADSAL RHFRRTKDAA VITGGDRAEI HTAALEAPGV RCLILTGGHR PSGAIIGQAA EKGMPILSVQ TDTLTTVERA EDVVRSGRTR DAETVDRMAE LLTDHTAVDS ILDGSR
|
| |