Gene Hlac_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2371 
Symbol 
ID7401989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2362795 
End bp2363847 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content66% 
IMG OID643709444 
Productglyceraldehyde-3-phosphate dehydrogenase, type I 
Protein accessionYP_002567016 
Protein GI222480779 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.257564 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAT CATATCTCAC CGCGGATGAC GACGTGAGCG ACGACGAGGT CGTCCGGGTG 
GGCCTCAACG GCTTCGGTCG GATCGGGCGA AACGTGTTCC GTGCGGTGCT GGAGTCGCCG
CGCATCGATC TCGTCGGCAT CAACGACGTG ATGGACTTCG ACGACATGGC GTACCTCGCG
AAGTACGACA CCGTCATGGG GCGGCTCGAC GGCGTCGAGC GCGACGGCGA CGCGCTGACG
ATCGGCGACA CCTCAGTTCC GCTGTACAAC GTGCAGGACC CCGCGGACCT CCCGTGGGAC
GAGCTCGACG TGGACGTGGC CTTAGAGTGT ACGGGCGTCT TCCGTACCCG CGAGGACGCG
AGCGCGCACC TCGACGGCGG CGCAGACACC GTGATCATCT CGGCGCCCCC GAAAGGCGAG
GAGCCGGTCA AACAGCTCGT CTACGGCGTC AACCACGACG AATACGAGGG CGACGACGTG
ATCTCGAACG CCTCCTGTAC GACTAACTCC ATCACGCCGG TCGCGAAGGT GCTCGACGCG
GAGTTCGGCA TCGACGCCGG CACCCTCACC ACGGTTCACG CGTACACCGG CTCCCAGAGC
CTGATCGACG GGCCGAAGGC GAAGACCCGC CGCGGTCGCG CGGCCGCCGA GAACATCGTA
CCGACCTCGA CGGGCGCCGC GGGCGCCGCA CAGGAGGTTC TCCCGCAGCT TGAGGGGAAG
ATCGACGGGA TGGCGATGCG TGTTCCCGTC CCGTCCGGCT CGATCACCGA GTTCGTCGTC
AGCCTCGATG AGAACGTCAC CGCGGACGAG GTCAACGCCG CCTTCCGCGA CGCCGCCGAC
TCCGGGCCGC TCGCGGGCGT GCTCGGCTAC ACCGACGACG AGGTCGTCTC CAGCGACATC
GTCGGCCTCC CGTTCTCCAG CTACGTCGAC CTGCAGTCGA CGAACGTCAT CGCCGGTGGG
AAGCTCCTGA AGATCCTCAC CTGGTACGAC AACGAGTACG GCTTCTCGAA CCGGATGCTC
GACATGGCCG CGTACGTTCA GGACGAAGCG TAA
 
Protein sequence
MSKSYLTADD DVSDDEVVRV GLNGFGRIGR NVFRAVLESP RIDLVGINDV MDFDDMAYLA 
KYDTVMGRLD GVERDGDALT IGDTSVPLYN VQDPADLPWD ELDVDVALEC TGVFRTREDA
SAHLDGGADT VIISAPPKGE EPVKQLVYGV NHDEYEGDDV ISNASCTTNS ITPVAKVLDA
EFGIDAGTLT TVHAYTGSQS LIDGPKAKTR RGRAAAENIV PTSTGAAGAA QEVLPQLEGK
IDGMAMRVPV PSGSITEFVV SLDENVTADE VNAAFRDAAD SGPLAGVLGY TDDEVVSSDI
VGLPFSSYVD LQSTNVIAGG KLLKILTWYD NEYGFSNRML DMAAYVQDEA