Gene Hlac_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0289 
Symbol 
ID7401215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp311947 
End bp313275 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content70% 
IMG OID643707352 
Productthreonine dehydratase 
Protein accessionYP_002564964 
Protein GI222478727 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.122969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACCGA TCCGACTCCG TTCCGCAACC GAGTCCTTTT CTGGGATCCC CGAGAACACC 
GCGCTCATGT CCTCCGTCAC GATCGCGGAC GTCGAGGCCG CCGCGGCCCG GCTCGAACCG
GCGGCGATCG TCCAGCGCAC GCCCGTCGAG CGCAGCCGGT CGCTGAGCGA GCGGTGCGGC
GCCGACGTGC GCCTGAAGAT GGAACACCTC CAGCGCACCG GCTCGTTTAA AACGCGCGGT
GCGTATAACG CGATCTCGCG GGCAGTTGAG CAGGCGGCGG AACGAGCCGA TGAGTCCGAG
CTCGACCGGG TCGTGGCCGC GAGCGCGGGC AACCACGCGC AGGGGGTCGC CCTGGCGGCG
TCGGGCACCG GGATCGACGC GACGATCGTG ATGCCGGAGT CGGCGCCGGC AGCGAAGATC
GAGGCGACCC GCGGGTACGG CGCCGAGGTC GTGCTCCGCG GGAGCGCGTT CCCGGAGGCG
ATGGCGCACG CGCAGACGCT GATCGACGAC CCCGGAACGC GGTTCGTCCA CGCGTTCGAC
GATCCGGACG TGGTCGCCGG GCAGGGAACG CTCGGGCTGG AGGTGCTCGA CCAAGTGCCA
GACGTGGACA CCGTGCTCGT CCCGGTCGGG GGCGGCGGGC TGGCAGGCGG GGTCGCGACC
GCGATCAAAG CGCGCTCGCC CGAGACGCGG GTGATCGGGG TCCAGACCGA GGGCGCCTCG
ACGCTCTCGG AGAGCCTCGC GGCCGGCGAA CTCGTGACGC GCGAGGAGCC GGACACCATC
GCGGACGGGA TCGCGACCGG CGGGCTGAGC GAGCTCACCT TCGGCCTGTT GAAAGAGCAC
CTCGACGACG TGGTCGTCGT GAGCGACGAC GACGTGGCCG CCGCGATTCT GCTCCTCTTG
GAGCGCGCGA AACAGATGAT CGAGGGCGCG GGCGCGACCG CGGCGGCCAC CCTTTTAAAT
GACGACGCTC TCGACGAGCT CGATCTGGCC GGCGAGACGG TGGTGCCGCT GCTCTGTGGC
GGCAACATCG ACGTCACGAC GCTGAAGGAG GTGGTGACGC ACGCCCTCGT GGAACGTGAC
CAACTGATCG AACTTGCCGT CCGGATCGAC GACACGCCCG GGACGATGGG CGAGATATCC
ACCCTGATCG GCGCGGAGCG CGCGAACATC CGGACGGTGC GCCACGAGCG CAGCCGGCCG
GACCTGCCGG TCGGCGACGC CGACCTCGTG TTCGAGGTGG AGACCAACGG GCCGGCCCAC
GTCGATCGGG TCCTGAAGGC GGTGCGCGAG GCGGGCTACG AGGTGGAGTG GACGACGCAG
GAAGGGTGA
 
Protein sequence
MKPIRLRSAT ESFSGIPENT ALMSSVTIAD VEAAAARLEP AAIVQRTPVE RSRSLSERCG 
ADVRLKMEHL QRTGSFKTRG AYNAISRAVE QAAERADESE LDRVVAASAG NHAQGVALAA
SGTGIDATIV MPESAPAAKI EATRGYGAEV VLRGSAFPEA MAHAQTLIDD PGTRFVHAFD
DPDVVAGQGT LGLEVLDQVP DVDTVLVPVG GGGLAGGVAT AIKARSPETR VIGVQTEGAS
TLSESLAAGE LVTREEPDTI ADGIATGGLS ELTFGLLKEH LDDVVVVSDD DVAAAILLLL
ERAKQMIEGA GATAAATLLN DDALDELDLA GETVVPLLCG GNIDVTTLKE VVTHALVERD
QLIELAVRID DTPGTMGEIS TLIGAERANI RTVRHERSRP DLPVGDADLV FEVETNGPAH
VDRVLKAVRE AGYEVEWTTQ EG