Gene Htur_1507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1507 
Symbol 
ID8742098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1565563 
End bp1566900 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content68% 
IMG OID646512083 
ProductPBS lyase HEAT domain protein repeat-containing protein 
Protein accessionYP_003403066 
Protein GI284164787 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATG ACGAGGCAGC CGAGGACGGA GCGGACGACG ATCAGCGAGA GCCGGTCGAC 
CTCGAGGCCA TCCGCGAGGC CCTCGCGGCC TTCGAGGACG ACGTCGAAGC CCTCGAGAGC
GACCTCGAGG CCGCCGAAAC CGAGGATGAC CTCGACGTCG TCGAGGCCGA CATCGAGTCG
TTCCGCGAGG AGTTCGAGGA GATCGAGATC CCCGACCCGC CGGAGACCGA GGACGAAGAC
GACGAGGACG AGGAAGACGA AATAACGCCC GAGGAGGAAC TCCAGGAGCG CTACGACGAG
ATCGAAAGCG ACGTCTCGGA CCTCGAGTCC GATCTGGAAG ACCAGCGCGG TCCCTACGGC
GAGGACGTCG TCAGCGAGAT CAACAGCGCC AGCGGGACGA TCACGGGCAC CCGCTGGACC
GAAGAGGGTA ACGCCGAACT GATCGAAGCC GTCGACGACT TCCTCGACGA CCTGAACGAA
CTGCTCGGCA CCTCGGTCAC GCTGAGCAAC GAGGGCGAGG CGGTCCCCGA CCAGCTTGAT
GCGACTCTCG ACCGCGCGGC CGAGGCCGTC GAGGACGCCG AACTCGACGC CGACGACGAC
GCCGAGACGA TCGCCGGCCT GCTCGAGGCC ACGGACGACC TCGAGTCCGA TATCGACGAT
GCGACCGAGT GGACCGACCT CGAGATCCGC GAGCAACTGC GCCGCGAGGG GTTCTACGAC
GTGCTCGACC ACGTCAAGGA CTTCCCGCCG GAGTGGCACG CGCTGAAGGT CCACGAGAAA
CGCGGTAACG TCGATCAGAT CCTGCTGGCC TACGAGACCT TCGACTCCGA CTACATGGAG
GAGCACTGCC TCGAGGCATT GGAACGCATG GGCCCCGAGG AGGCCATGGA ACCCATGATC
CAGAAGGCGG GTCGCCGCGA CCAGGCTGCG ATGCGCATCC TGGGCAAGAT CGGCATCGCC
GACGACGAGG TCGTCGAGGC GCTGATCGAT TACGTCGACT CGAACCCCAA CCTCCAGCGG
CCCGCGTTCC GCGCGCTCGG CGAGGTCGGC GCCGAGGACG CCGTCGAGCC GCTCGCCCAG
CAGCTGGTCG CCGACGAACC GGACGTCCGC AGCTGGGCCG CCCGCGCGCT CGGCCTGATC
GGCGACACCC GCGCCATCGA GCCGCTCGCG GATGTGCTGG CCGACGACGA GGAGGACCGC
GTCCGCGCCA GTGCCGCCTG GGCACTCAAC CAGATCGGCA CCGCCGAGGC CCTCGAGATC
GTCGCCGACT ACGGCGACGA CCGCGCGTAT CTCGTCCAGG CCGAGGCCGA GAAGGCCGCA
ACCGAGCCCG CGGCCTGA
 
Protein sequence
MSDDEAAEDG ADDDQREPVD LEAIREALAA FEDDVEALES DLEAAETEDD LDVVEADIES 
FREEFEEIEI PDPPETEDED DEDEEDEITP EEELQERYDE IESDVSDLES DLEDQRGPYG
EDVVSEINSA SGTITGTRWT EEGNAELIEA VDDFLDDLNE LLGTSVTLSN EGEAVPDQLD
ATLDRAAEAV EDAELDADDD AETIAGLLEA TDDLESDIDD ATEWTDLEIR EQLRREGFYD
VLDHVKDFPP EWHALKVHEK RGNVDQILLA YETFDSDYME EHCLEALERM GPEEAMEPMI
QKAGRRDQAA MRILGKIGIA DDEVVEALID YVDSNPNLQR PAFRALGEVG AEDAVEPLAQ
QLVADEPDVR SWAARALGLI GDTRAIEPLA DVLADDEEDR VRASAAWALN QIGTAEALEI
VADYGDDRAY LVQAEAEKAA TEPAA