Gene Hlac_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1610 
Symbol 
ID7399559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1629306 
End bp1630736 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content70% 
IMG OID643708676 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_002566265 
Protein GI222480028 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.731117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.416595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCACG ATCAGGCTGA CTACGCGCGC TCGGACGAGG ACGTTCACGA GCCGCTGCTC 
TCGGAGAAGG ATGAGACGAC CGTCGACGTA CGTGAGTCGT CGCCGCTTCC CGACGAGCTG
ACGCGCGAGG AACTCACCCT GCCGGCCCCC TCCGAGCCGG AGATCGCCCG TCACTACACC
CGGCTCTCGC AGATGACCTG GGCGATCGAC TCGGGACCGT ACCCGCTCGG GAGCTGTACG
ATGAAGTACA ACCCGAAGTT CACCGAGGAC GTGGCTGTCG ACCCGAACGC GGCGATCCAC
CCGGACCGCT CGGCGCGGGC CGCGCAGGGG AACCTCGAAC TGCAGTACCG GCTCCAGGAG
TACCTCGGCG AGATCGGCGG GATGGACGCC GTCACGCTCC AGCCGCCCGC GGGCGCCGCC
GGCGAGTTCA CCGGTATCGC GGTCGCGAAG GCGTACCACG AGCACCACGG CAACGACCGG
TCCGAGGTCG TGATCCCCGC CTCCGCGCAC GGGACGAACT TCGCGACCGC CGCGTTGGCG
GGCTACGACG TGGTCGAACT CCCATCCGGC GACGACGGGC GAGTCGACGT CGAGGCGCTT
GAGGCGGCCG TCGGCGACGA CACCGCCGCG CTCATGCTGA CCAACCCGAA CACGGTCGGG
CTCTTCGAGC GCGACATCGC GGAGATCGCC GACATCGTGC ACGACGCGGG CGGACTCCTC
TACTACGACG GCGCGAACCT CAACGCCCTG CTCGGCCGCG GTCGCCCCGG CGACATGGGG
TTCGACGTGA TGCACTACAA CGTCCACAAG ACGTTCGCGA CCCCGCACGG CGGCGGCGGT
CCGGGCGCTG GCCCGGTCGG CGTCGTCGAC GAGCTGGCCG AGTTCCTCCC CTCCCCGCGA
GTCCGGGAGG CGGACGAAGG GAGCGGCTAC GAGCCGTTCG AGCCGGACAA CACGATCGGG
AAGGTTCACG GATTCACGGG CAACTGGCTC GTCCTGATAA AGACGTACGC GTACATCGCT
CGCCTCGGCG ACGAGGGGCT CGCGGACGCG GCCGCGAAGG CGGTACTCAA CGCGAACTAC
CTCGCCGAGC GGATCGACCT CGACGTGCCG TACGGCCCCT TCCACCACGA GTTCGCGGCG
ACCGCCGGCG ACCGCGACGC CGCCGACGTC GCCAAGCGCA TGCTCGACTT CGGCGTCCAT
CCGCCGACGA CGAAGTGGCC GGAGATGGTG CCGGAAGCGA TGTTGACCGA GCCGACCGAG
ATCGAGGGGC AGTCGTCGCT CGACGACCTC GCGGAGGCGT TTAACCTCGC GTACGCTGAC
TCGGACGAGG CGCTCGGTGC GGCCCCGAAC CGCACGGCTG CGAGCCGGAT CGATCAGGTG
AGCGCCGCGC GAGACCCGCG GCTCTCGTGG CAGGCGCTGG ACGGGGAGTA G
 
Protein sequence
MIHDQADYAR SDEDVHEPLL SEKDETTVDV RESSPLPDEL TREELTLPAP SEPEIARHYT 
RLSQMTWAID SGPYPLGSCT MKYNPKFTED VAVDPNAAIH PDRSARAAQG NLELQYRLQE
YLGEIGGMDA VTLQPPAGAA GEFTGIAVAK AYHEHHGNDR SEVVIPASAH GTNFATAALA
GYDVVELPSG DDGRVDVEAL EAAVGDDTAA LMLTNPNTVG LFERDIAEIA DIVHDAGGLL
YYDGANLNAL LGRGRPGDMG FDVMHYNVHK TFATPHGGGG PGAGPVGVVD ELAEFLPSPR
VREADEGSGY EPFEPDNTIG KVHGFTGNWL VLIKTYAYIA RLGDEGLADA AAKAVLNANY
LAERIDLDVP YGPFHHEFAA TAGDRDAADV AKRMLDFGVH PPTTKWPEMV PEAMLTEPTE
IEGQSSLDDL AEAFNLAYAD SDEALGAAPN RTAASRIDQV SAARDPRLSW QALDGE