Gene Htur_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2001 
Symbol 
ID8742600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2070876 
End bp2072225 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content68% 
IMG OID646512583 
Product2-methylcitrate dehydratase 
Protein accessionYP_003403558 
Protein GI284165279 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAC TCGAACTCGC GGAGTTCGTC CAGGCAACCG ACTACGAGGA CCTCTCGCCC 
GACGTTCGCG ACGCGCTCAA ACGCCGCGTG CTCGACTCGG TCGGCATCGC CGTCGCCGCG
GAGGTCGCCG ATCCGACGCA GGTGGTGTTC GAGACCGTCC GGGACCTCGA GACGGACGGA
GCGTGCACGC TCTGGGGACG GGACGGCGAC GGCGCCTCGC CGGTGCAGGC GGCGATGCAC
AACACGGCGC TGACCCGCTA CCTGGACTAC ATGGACTCGT TTCTCGCGCC CAACGAGACG
CCCCATCCGA GCGACAACGT CGGCGCCGTC GTCGCCGCGG GGGAGTACGC CGACCGGTCG
GGCGAGGACC TGCTCGCGGG GCTGGCCGTC GCCTACGAGA TCCAGGGGGA ACTCGCGTGG
AACGCACCCG TTCGCGACCG GGGGTTCGAC CACGTCACCC ACACCGTCGT CTCGGCGGCC
GCCGGCGCGT CGAAGCTCCT CGGCCTCGAT CTCGAGGAGA CCCGGAACGC CATCGGCATC
GCGGGGACGG CCCACAACGC CCTGCGGGTG ACCCGGACGG GCGGGATCAA CGAGTGGAAG
GGGATCGCGT CGGCGAACGC CGCGCGGAAC GCCGTCTATT CCGCGATGCT CGCGAAAAAC
GGGATGGAAG GACCGCGGGA CCTCTTCGAA GGCCAGAAGG GGTGGCAGGA CGTGATCTCG
GGGGCGTTCG ACGTCGATCT GACGCCCGGC GAGCGCGTTC ACGACGCAAT GACCAAACGC
TACGTCGCGG AGACGTACGC CCAGTCGGCC GTCGAGGGTG TGATCGAACT CGCCGAGCGG
GAGGACCTCG ACCCGGACGA CATCGCGGGG GTCAAACTCG AGACGTTCGC CGGCGCGAAG
CTCATCATCG GCGGCGGCGA GGGGAACCGG TACGAGATCG ATAACCGGGC GCAGGCCGAC
CACTCGCTGC CGTACATGCT CGCGGCGGCG CTGATCGACC GTGACCTCTC GCTCGAGCAG
TACGAACCCG ATCGCATTCG GCGCGAGGAC GTCCAGGAAC TGCTTCGAAT CGTCGACGTG
AGCGAGGACT CCGAACTCAC CGAGCGCTTC GAAAACGGCG AGATGCCGGC CGTCATCGAC
GTCACGACGG ACGACGGCAC CACCTACCGG ATCGAGAAGG AGGCGTTTCA CGGCCACCCG
CTCGACCCGA TCGGCTGGGA GGGGCTCGAG GCGAAGTTCG ACGCTATCGC GGGCGAGCAC
CTCGAGGACG ACCGCCGCGA CGAACTCGTC GAGACGATCA GGACCCTCGA GGACCAGGAC
GTGGCCGATC TGACGGCGCT GTTGGAGTAG
 
Protein sequence
MTTLELAEFV QATDYEDLSP DVRDALKRRV LDSVGIAVAA EVADPTQVVF ETVRDLETDG 
ACTLWGRDGD GASPVQAAMH NTALTRYLDY MDSFLAPNET PHPSDNVGAV VAAGEYADRS
GEDLLAGLAV AYEIQGELAW NAPVRDRGFD HVTHTVVSAA AGASKLLGLD LEETRNAIGI
AGTAHNALRV TRTGGINEWK GIASANAARN AVYSAMLAKN GMEGPRDLFE GQKGWQDVIS
GAFDVDLTPG ERVHDAMTKR YVAETYAQSA VEGVIELAER EDLDPDDIAG VKLETFAGAK
LIIGGGEGNR YEIDNRAQAD HSLPYMLAAA LIDRDLSLEQ YEPDRIRRED VQELLRIVDV
SEDSELTERF ENGEMPAVID VTTDDGTTYR IEKEAFHGHP LDPIGWEGLE AKFDAIAGEH
LEDDRRDELV ETIRTLEDQD VADLTALLE