Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2001 |
Symbol | |
ID | 8742600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2070876 |
End bp | 2072225 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646512583 |
Product | 2-methylcitrate dehydratase |
Protein accession | YP_003403558 |
Protein GI | 284165279 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAC TCGAACTCGC GGAGTTCGTC CAGGCAACCG ACTACGAGGA CCTCTCGCCC GACGTTCGCG ACGCGCTCAA ACGCCGCGTG CTCGACTCGG TCGGCATCGC CGTCGCCGCG GAGGTCGCCG ATCCGACGCA GGTGGTGTTC GAGACCGTCC GGGACCTCGA GACGGACGGA GCGTGCACGC TCTGGGGACG GGACGGCGAC GGCGCCTCGC CGGTGCAGGC GGCGATGCAC AACACGGCGC TGACCCGCTA CCTGGACTAC ATGGACTCGT TTCTCGCGCC CAACGAGACG CCCCATCCGA GCGACAACGT CGGCGCCGTC GTCGCCGCGG GGGAGTACGC CGACCGGTCG GGCGAGGACC TGCTCGCGGG GCTGGCCGTC GCCTACGAGA TCCAGGGGGA ACTCGCGTGG AACGCACCCG TTCGCGACCG GGGGTTCGAC CACGTCACCC ACACCGTCGT CTCGGCGGCC GCCGGCGCGT CGAAGCTCCT CGGCCTCGAT CTCGAGGAGA CCCGGAACGC CATCGGCATC GCGGGGACGG CCCACAACGC CCTGCGGGTG ACCCGGACGG GCGGGATCAA CGAGTGGAAG GGGATCGCGT CGGCGAACGC CGCGCGGAAC GCCGTCTATT CCGCGATGCT CGCGAAAAAC GGGATGGAAG GACCGCGGGA CCTCTTCGAA GGCCAGAAGG GGTGGCAGGA CGTGATCTCG GGGGCGTTCG ACGTCGATCT GACGCCCGGC GAGCGCGTTC ACGACGCAAT GACCAAACGC TACGTCGCGG AGACGTACGC CCAGTCGGCC GTCGAGGGTG TGATCGAACT CGCCGAGCGG GAGGACCTCG ACCCGGACGA CATCGCGGGG GTCAAACTCG AGACGTTCGC CGGCGCGAAG CTCATCATCG GCGGCGGCGA GGGGAACCGG TACGAGATCG ATAACCGGGC GCAGGCCGAC CACTCGCTGC CGTACATGCT CGCGGCGGCG CTGATCGACC GTGACCTCTC GCTCGAGCAG TACGAACCCG ATCGCATTCG GCGCGAGGAC GTCCAGGAAC TGCTTCGAAT CGTCGACGTG AGCGAGGACT CCGAACTCAC CGAGCGCTTC GAAAACGGCG AGATGCCGGC CGTCATCGAC GTCACGACGG ACGACGGCAC CACCTACCGG ATCGAGAAGG AGGCGTTTCA CGGCCACCCG CTCGACCCGA TCGGCTGGGA GGGGCTCGAG GCGAAGTTCG ACGCTATCGC GGGCGAGCAC CTCGAGGACG ACCGCCGCGA CGAACTCGTC GAGACGATCA GGACCCTCGA GGACCAGGAC GTGGCCGATC TGACGGCGCT GTTGGAGTAG
|
Protein sequence | MTTLELAEFV QATDYEDLSP DVRDALKRRV LDSVGIAVAA EVADPTQVVF ETVRDLETDG ACTLWGRDGD GASPVQAAMH NTALTRYLDY MDSFLAPNET PHPSDNVGAV VAAGEYADRS GEDLLAGLAV AYEIQGELAW NAPVRDRGFD HVTHTVVSAA AGASKLLGLD LEETRNAIGI AGTAHNALRV TRTGGINEWK GIASANAARN AVYSAMLAKN GMEGPRDLFE GQKGWQDVIS GAFDVDLTPG ERVHDAMTKR YVAETYAQSA VEGVIELAER EDLDPDDIAG VKLETFAGAK LIIGGGEGNR YEIDNRAQAD HSLPYMLAAA LIDRDLSLEQ YEPDRIRRED VQELLRIVDV SEDSELTERF ENGEMPAVID VTTDDGTTYR IEKEAFHGHP LDPIGWEGLE AKFDAIAGEH LEDDRRDELV ETIRTLEDQD VADLTALLE
|
| |