Gene Htur_4194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4194 
Symbol 
ID8744822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp461102 
End bp462523 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content69% 
IMG OID646514742 
Productprotein of unknown function DUF35 
Protein accessionYP_003405689 
Protein GI284167411 
COG category[I] Lipid transport and metabolism 
COG ID[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.492043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGGAC TCGTCGCCGC CGGGGTCTAC GTTCCTCGGT TCCGACTCTC GGCCGACGAC 
CTCGAGGCCG CGTGGGAGAC GAGTCACGCC GCGGGCGTCG AACGAAAGGC CGTTCCGGCC
GCGGACGAGG ACTCGCTGAC GATGGCCGTC GTGGCGGCCC AACGGGCGCT CGCTGACGCC
GCCGTTGATC GCTCGGCGAT CGAGACCGTC GCAGTCGCGA CGACCACTCC GCCGCTCGAG
GAGGGCGATT TCGTTCCGCG ACTGGTTCGA GCGCTCGATC TCCCTGCAGG CGTAGCGACG
ATGACTACGA CCCACCACAC TGCGGCCGGC GCCGAAGCGC TCTCGCGTGC GCTCGACGCC
GACGGGCCCG CCGTTGTCAT CGCCGCCGAC TGTCCCGAAG GGGAGCCGGC CGACGCGGAC
CATCCGTTCG GCGCCGCCGG GGCAGCGTTC GTGATCGACG ACGATCCGAT CGTTCCAATC
GACGACGTCG CGTGGCACAG CGACGAGACG CCGGGGATTC GGTTCCGCGA GCGCGGCGAC
CGCGACGTCG ACTCCCTGGG AGTCACGACG TACGAGCGGG ACGCGGTTCG CGAGGCGGTA
ACGACGGCAG TGTCGTCGCT CGAGATCGAC GCGGCCGAGG CGACCGGTGC GGCGGTGCAC
CAGCGTGACG GTGGCTTCCC CTATCGGATC TCGAGCGATC TCTCGGTCTC GTCCGAGGCC
GTGGCCGCGG GGACGGTAGC CGACCGGATC GGCGACGCCG GCGCGGCGAC GGTCCCGGTT
GGACTGCTCT CGGCGCTGGA CGGAGCCGAC ACCGACGAAC TGACCGTCGC CGCCTTCTTC
GGCGGCGGTA GCGCGGCCGC GCTCACTTGC GAGGGATCGC TTCCGGTTCG CGGAATCGAC
GACCTCGAGT CGACGGAGAC GGTCGATTAC TCGACGTACC TCCGCGAGCG CGGATACATC
GTCGACGGCG AGGTCGCCGG CGGCGGTGCG AACGTGAGTT TGCCGAACTG GCAGCAGTCA
CTCGATCACC GATACCGACT CGTCGCCGGC GCGTGTCCGA ACTGCGGTGG CGTTACCTTC
CCGCCCGCCG GCGCCTGTCA GGAGTGTCAC GCACGTGTCC AGTTCGAGGA GTTCGAAGCA
CCCCGAACCG GGACGGTTCG CGCGGTGACC GTCATCGAAC AGGGCGGTGC CCCGCCCGAA
TTCGCGGACC TCCAGCAACG CGACGGCGCG TACGCCGTCG CGATCGTGGC ACTCGAGACA
GAACACGGCT CGGTTACGCT CCCCGCCCAG CTCACGGACG TCGATCCGCA ATCGGTGTCG
GTCGACGACA CCGTCGAGGC CGCGATCCGT CGGATATACA CGCAGGAAGG CGTCCCGCGG
TACGGCGTCA AGTTTAGGCC GACCGACGAG GGTAGCGACT GA
 
Protein sequence
MRGLVAAGVY VPRFRLSADD LEAAWETSHA AGVERKAVPA ADEDSLTMAV VAAQRALADA 
AVDRSAIETV AVATTTPPLE EGDFVPRLVR ALDLPAGVAT MTTTHHTAAG AEALSRALDA
DGPAVVIAAD CPEGEPADAD HPFGAAGAAF VIDDDPIVPI DDVAWHSDET PGIRFRERGD
RDVDSLGVTT YERDAVREAV TTAVSSLEID AAEATGAAVH QRDGGFPYRI SSDLSVSSEA
VAAGTVADRI GDAGAATVPV GLLSALDGAD TDELTVAAFF GGGSAAALTC EGSLPVRGID
DLESTETVDY STYLRERGYI VDGEVAGGGA NVSLPNWQQS LDHRYRLVAG ACPNCGGVTF
PPAGACQECH ARVQFEEFEA PRTGTVRAVT VIEQGGAPPE FADLQQRDGA YAVAIVALET
EHGSVTLPAQ LTDVDPQSVS VDDTVEAAIR RIYTQEGVPR YGVKFRPTDE GSD