Gene Hlac_1536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1536 
Symbol 
ID7401466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1557347 
End bp1559095 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content72% 
IMG OID643708602 
Productthiamine pyrophosphate protein domain protein TPP-binding 
Protein accessionYP_002566194 
Protein GI222479957 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACGG ACGAGTCCCA GAGCGGTACG GATCCGAGCG CGGTCGACGC TCCGACCGTC 
GCCGAGGCGG TCGTCGACTG CATGCTCGAC CGCGGGATCG ACGTCGCCTT CGGGATCCCC
GGCAAGCAGA CGCTCCCGCT GAATCGGGCG CTCGGCGAGC GCGACGCCCG GTTTGTCGTC
GCGCGCCACG AGACGGCCGT GACTCATCAG GCGTGGGGAT ACGCTGAGAC GAGCGATCCC
GGAGCGATGG CGGCGTCGAT CGTCGTCCCC GGCCCCGGCG ATATGAACGC GATGAACGGG
CTGAAAAACG CCCTGAACGA CTGCGTCCCT CTCCTCCACC TCGCGGTCGA GACCGAACGA
GAGGTCCGCG GCGGCGACGG GATCCACGAG ACGCCGCCCG AGACGTACGA CACGGTCGTC
AAGGAGAACG TCCTCGTCGA CTCGCCGGCC GGTGCGGTCC CCGCCGTCGC CGAGGCGATC
CGGGTCGCCC GCGAGCACCC GCAGGGACCC GTCCGCGTCG GGATCCCAAA AGATGTACTC
GCGAGCCGGA CGCCCCAGCC GGCGATCGGG GACCGAGAGC CGGCCGCGCC GCCGGACCCG
CCCGCGGACG CGGTCGATCG CGCGGCCGAC CTCCTCGCTG GAGCCGGTTC GCCGGTGATT
CTCGCGGGCG GCGGCGTCCG GCGCGCGGGC GCGAGCGACT CGCTCCGCGC GATCGCCGAG
CGCCTCGACG CCCCGGTCGT CACGACTTAT AAAGGAAAGG GGACGCTTCC CGAGACGCAC
CCCCTCTCCG CGGGCGTGCT CTGTGGCGGA TCGAGCACGG AGCTCCGCGA TCTCCTCGCC
GACGCGGACC GCGGGCTCGT CGTCGGCTCC GACCTCGACG CGGTCGCGAC CGCCTCCTGG
TCGGTGTCGC TGCCTGATTC CCTGATTCAC GTCACGCTCG ACGGCGACGA TATCGGCTTC
GGCTACGAGG CAGATCTCGG GATCGTCGCG GACGCTGACC GATTCCTGCA GGCGCTCGGG
GACCGGCTGG GAGACGAGGA GGAGGAGATG TTGGCCAGTG AGCCGGCGGG CTCGGCTTCG
CATCCTGAGT CGCCCGGCGC CGCCCGTGCC GACGCGGTCC GGTCCGCAGA CCGAGAGCGA
TTCGCCGCGC TCGCTGACGA GCGCAGCGCC AACGATCCCC TCCGTTCCGT CGAGGTCCTC
CGCGAGGTCC GCGAAGCGCT CCCAGCAGAG GCGGTCGTCA CCGCGGACGC CGGCGGGTTC
CGGCTGTGGA CGCTCGTCTC GTTCCCCGCG GCTGGCCCCT CGCGGTACGT GAATCCGGGA
TCGTGGGCGA CGATGGGGAC CGGGCTCCCG TCGGCGATCG GCGCCAAACT CGCGAACCCC
GACCGCGACG TGGTCGCCCT CACCGGTGAC GGCGGCCTCA TGATGTGCGT TCACGAGTTG
CACACGCTGG CCGCGGAGGG GATCGACGTG ACCGTCGTCG CGTTCACCAA CGACGACTAC
GCGATTATCA GCGAGGAGGC GTCGCGGTCG TACGACCTCC CGGCGGGCGC GTACGGCTGG
GCGGAGACCG CGATCGACCT CGTCGCCGTC GCATCCGGGA TGGGCGTCCG CGCCGAGCGG
GTGACCGATC GAGACGCGGT CGGCGAGGCC CTCACATCGG CGCTGGCCCA CGACGGACCG
GCTCTGATCG AGGTCGCCAC CGATCCGGAC GAGCCACAGG CGAGCGAGTG GATGACGCGG
GAACGCTGA
 
Protein sequence
MDTDESQSGT DPSAVDAPTV AEAVVDCMLD RGIDVAFGIP GKQTLPLNRA LGERDARFVV 
ARHETAVTHQ AWGYAETSDP GAMAASIVVP GPGDMNAMNG LKNALNDCVP LLHLAVETER
EVRGGDGIHE TPPETYDTVV KENVLVDSPA GAVPAVAEAI RVAREHPQGP VRVGIPKDVL
ASRTPQPAIG DREPAAPPDP PADAVDRAAD LLAGAGSPVI LAGGGVRRAG ASDSLRAIAE
RLDAPVVTTY KGKGTLPETH PLSAGVLCGG SSTELRDLLA DADRGLVVGS DLDAVATASW
SVSLPDSLIH VTLDGDDIGF GYEADLGIVA DADRFLQALG DRLGDEEEEM LASEPAGSAS
HPESPGAARA DAVRSADRER FAALADERSA NDPLRSVEVL REVREALPAE AVVTADAGGF
RLWTLVSFPA AGPSRYVNPG SWATMGTGLP SAIGAKLANP DRDVVALTGD GGLMMCVHEL
HTLAAEGIDV TVVAFTNDDY AIISEEASRS YDLPAGAYGW AETAIDLVAV ASGMGVRAER
VTDRDAVGEA LTSALAHDGP ALIEVATDPD EPQASEWMTR ER