Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4194 |
Symbol | |
ID | 8744822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 461102 |
End bp | 462523 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646514742 |
Product | protein of unknown function DUF35 |
Protein accession | YP_003405689 |
Protein GI | 284167411 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3425] 3-hydroxy-3-methylglutaryl CoA synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.492043 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGGAC TCGTCGCCGC CGGGGTCTAC GTTCCTCGGT TCCGACTCTC GGCCGACGAC CTCGAGGCCG CGTGGGAGAC GAGTCACGCC GCGGGCGTCG AACGAAAGGC CGTTCCGGCC GCGGACGAGG ACTCGCTGAC GATGGCCGTC GTGGCGGCCC AACGGGCGCT CGCTGACGCC GCCGTTGATC GCTCGGCGAT CGAGACCGTC GCAGTCGCGA CGACCACTCC GCCGCTCGAG GAGGGCGATT TCGTTCCGCG ACTGGTTCGA GCGCTCGATC TCCCTGCAGG CGTAGCGACG ATGACTACGA CCCACCACAC TGCGGCCGGC GCCGAAGCGC TCTCGCGTGC GCTCGACGCC GACGGGCCCG CCGTTGTCAT CGCCGCCGAC TGTCCCGAAG GGGAGCCGGC CGACGCGGAC CATCCGTTCG GCGCCGCCGG GGCAGCGTTC GTGATCGACG ACGATCCGAT CGTTCCAATC GACGACGTCG CGTGGCACAG CGACGAGACG CCGGGGATTC GGTTCCGCGA GCGCGGCGAC CGCGACGTCG ACTCCCTGGG AGTCACGACG TACGAGCGGG ACGCGGTTCG CGAGGCGGTA ACGACGGCAG TGTCGTCGCT CGAGATCGAC GCGGCCGAGG CGACCGGTGC GGCGGTGCAC CAGCGTGACG GTGGCTTCCC CTATCGGATC TCGAGCGATC TCTCGGTCTC GTCCGAGGCC GTGGCCGCGG GGACGGTAGC CGACCGGATC GGCGACGCCG GCGCGGCGAC GGTCCCGGTT GGACTGCTCT CGGCGCTGGA CGGAGCCGAC ACCGACGAAC TGACCGTCGC CGCCTTCTTC GGCGGCGGTA GCGCGGCCGC GCTCACTTGC GAGGGATCGC TTCCGGTTCG CGGAATCGAC GACCTCGAGT CGACGGAGAC GGTCGATTAC TCGACGTACC TCCGCGAGCG CGGATACATC GTCGACGGCG AGGTCGCCGG CGGCGGTGCG AACGTGAGTT TGCCGAACTG GCAGCAGTCA CTCGATCACC GATACCGACT CGTCGCCGGC GCGTGTCCGA ACTGCGGTGG CGTTACCTTC CCGCCCGCCG GCGCCTGTCA GGAGTGTCAC GCACGTGTCC AGTTCGAGGA GTTCGAAGCA CCCCGAACCG GGACGGTTCG CGCGGTGACC GTCATCGAAC AGGGCGGTGC CCCGCCCGAA TTCGCGGACC TCCAGCAACG CGACGGCGCG TACGCCGTCG CGATCGTGGC ACTCGAGACA GAACACGGCT CGGTTACGCT CCCCGCCCAG CTCACGGACG TCGATCCGCA ATCGGTGTCG GTCGACGACA CCGTCGAGGC CGCGATCCGT CGGATATACA CGCAGGAAGG CGTCCCGCGG TACGGCGTCA AGTTTAGGCC GACCGACGAG GGTAGCGACT GA
|
Protein sequence | MRGLVAAGVY VPRFRLSADD LEAAWETSHA AGVERKAVPA ADEDSLTMAV VAAQRALADA AVDRSAIETV AVATTTPPLE EGDFVPRLVR ALDLPAGVAT MTTTHHTAAG AEALSRALDA DGPAVVIAAD CPEGEPADAD HPFGAAGAAF VIDDDPIVPI DDVAWHSDET PGIRFRERGD RDVDSLGVTT YERDAVREAV TTAVSSLEID AAEATGAAVH QRDGGFPYRI SSDLSVSSEA VAAGTVADRI GDAGAATVPV GLLSALDGAD TDELTVAAFF GGGSAAALTC EGSLPVRGID DLESTETVDY STYLRERGYI VDGEVAGGGA NVSLPNWQQS LDHRYRLVAG ACPNCGGVTF PPAGACQECH ARVQFEEFEA PRTGTVRAVT VIEQGGAPPE FADLQQRDGA YAVAIVALET EHGSVTLPAQ LTDVDPQSVS VDDTVEAAIR RIYTQEGVPR YGVKFRPTDE GSD
|
| |