Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4422 |
Symbol | |
ID | 8745050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 697067 |
End bp | 698491 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646514959 |
Product | protein of unknown function DUF35 |
Protein accession | YP_003405906 |
Protein GI | 284167628 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3425] 3-hydroxy-3-methylglutaryl CoA synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00766931 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACT CGCTCGGCAT CGCGGCCGTC GGGACGTACG TCCCGCGGCG GCGGATCACG GCCGAGGCGG TGACCGAGGC GTGGAACCGC TTCGACGGCG CGGGAATCCA GGAGACCGCC GTTCCCGGCC CCGACGAGGA CAGCCTGACG ATGGCCGCGG CGGCCGCGCG CCGCGCGCTC GAGGCGAGCG ACGTCGACGC GGGCGAGATC GCCGCCCTCT CGTTCGCGAC GACGACCCCG CCGCTCGAGG AGGAGGATCT CTCGGTGCGG CTGGGCGAGT TCCTCGGCGT CGGCGACGAC GCGACGCGGA CGCTGTCGAC GGGCAGCACG AACGCGGGCG TTCGCGCGCT CACGTCAACG CTCGCCGCGG GCGACGGCCC CGCGCTCGTC GTCGCGAGCG ACATGCCGCG GGGCGAACCC GATAGCGCCA TCGATCACGC TGCCGGCGCC GGCGCGGCCG CCTTCGTGCT GACCCCGGAC GCTCCCGTGA CGATCCGAGA GCAGGCGGAC CACGCGATCG ATTATCCCGG GACTCGCTTC CGGGAGCGCG GCTCGACGAC GGTCGACTCG CTCGATATCA CGCCGTACGA TCGCTCGGCG TACACCGAGA CGGTCGGCGG GGCCGTCGAC GGCCTCGAGG CGGATCCGTC GGCGGTGGAC GCGGTCGCGC TGCAGGCGCC GGACGGAAAG CTCCCCTACC GCGGCGCGAA GGTCATCGGC ACGCCGTCGG ACGCCGTCGA ACCGCACGCG GTCGTTCACT CGCTCGGCGA CCTCGGCGCC GCCAGCGTGC CGCTGTCGAT GGCGAGCGCG CTCGTCGACG GCGTCGATCG GCTACTGGCC GTCGGCTACG GAAGCGGCTC GACGGCGACC GCGCTCTCCG TCGAGGCGTC CGGCGCCGTT CCGGGCGAGG TCGACCTCGC GGGCGAAGAG ACGGTGAGCT ACGCGGAGTA TCTCCGCCTG CGCGGCGAGA TCACGAGTGG CGAACCGGAC GGCGGCGGCG CGTACGTGAG CGTCCCCTCC TGGAAGCGAT CGGGTCCGCA GCGATATCGA CTCGAGGGCG GGCGCTGTCC GTCCTGTGGC TCCCTGAACT TCCTCCCCGA CGGGGCGTGC CGGCAGTGTT ACGAACTCGT CGAGTACGAG CCGGTACAAC TGGAGCGGCA GGGAACGATC GAGGCGGTGT CGGTCATCTC TCAGGGCGGC GCACCGCCGG AGTTCGCGGA GCTCCAGGCG CGCGCCGGCG ATTACGCGAC GGCGATCGTC GCGTTCGACG GCCCCGACGG CGACACGGCC AGCGCGCCGG TGCTCGTCGT CGACGCCGAC GCGGAGTCGG TCGACGTCGG CGACCGCGTC GAAGCGACGG TGCGGCGCAT CTACACGCAG GAAGGAGTGA CCCGCTACGG GCTGAAGGTC CGCCCGCTCG ACTGA
|
Protein sequence | MSDSLGIAAV GTYVPRRRIT AEAVTEAWNR FDGAGIQETA VPGPDEDSLT MAAAAARRAL EASDVDAGEI AALSFATTTP PLEEEDLSVR LGEFLGVGDD ATRTLSTGST NAGVRALTST LAAGDGPALV VASDMPRGEP DSAIDHAAGA GAAAFVLTPD APVTIREQAD HAIDYPGTRF RERGSTTVDS LDITPYDRSA YTETVGGAVD GLEADPSAVD AVALQAPDGK LPYRGAKVIG TPSDAVEPHA VVHSLGDLGA ASVPLSMASA LVDGVDRLLA VGYGSGSTAT ALSVEASGAV PGEVDLAGEE TVSYAEYLRL RGEITSGEPD GGGAYVSVPS WKRSGPQRYR LEGGRCPSCG SLNFLPDGAC RQCYELVEYE PVQLERQGTI EAVSVISQGG APPEFAELQA RAGDYATAIV AFDGPDGDTA SAPVLVVDAD AESVDVGDRV EATVRRIYTQ EGVTRYGLKV RPLD
|
| |