Gene Hlac_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2034 
SymbolglyA 
ID7402053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2026029 
End bp2027276 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID643709105 
Productserine hydroxymethyltransferase 
Protein accessionYP_002566682 
Protein GI222480445 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0609471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.358704 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCACG AGCACGTCCG GGAGGTCGAT CCCGAGGTCG CCGACGCGCT GGCCGGGGAG 
CGAGACAGGC AGGAGCAGAC GCTCGCGATG ATCGCCAGCG AGAACCACGT CAGTGAGGCG
GTACTGGAGG CGCAGGGGAG CGTCCTCACG AACAAGTACG CGGAGGGGTA TCCGGGCGAG
CGCTACTACG CCGGCTGCGA GTACGCCGAC GAGGTCGAGA CGCTCGCGAT CGATCGCGCG
AAAGAGCTGT GGGGCGCCGA CCACGTCAAC GTCCAGCCGC ACTCCGGAAC GCAGGCGAAC
CAGGCGGTCT ACTACGCCGT ACTCGATCCC GGCGACAAGA TCCTCTCGCT CGATCTGAAC
CACGGCGGCC ACCTCTCGCA CGGCCACCCC GCGAACTTCA CCGGGCAGAT CTACGAGGTC
GAGCAGTACG AGGTCGACGC CGACACCGGC TACATCGACT ACGAGGGGCT CCGCGAGGCC
GCCGAGGAGT TCGAGCCGGA CATCGTCGTC TCCGGCTACT CCGCGTATCC GCGCACGGTC
GACTGGGAGG AGATTCAGGC GGCCGCCGAC GCCGTCGACG CCTACCACCT CGCCGACATC
GCCCACATCA CCGGGCTCGT CGCCGCGGGC GTCCACCCCT CGCCGGTCGG CGTCGCCGAC
TTCGTCACCG GCTCGACGCA CAAGACGATC CGGGCCGGTC GCGGCGGGAT CGTGATGTGC
GACGAGGAGT TCGCGGACGA CATCGACAAG GCGGTGTTCC CCGGCGGGCA GGGCGGGCCG
CTCATGCACA ACATCGCCGG CAAGGCGGTG GGGTTCAAGG AGGCGCTCGA TCCCTCGTTC
GACGAGTACG CGCAGAACGT GGTCGACAAC GCCGAGGTAC TCGCCGAGAC GCTGCAGGAC
CACGGGTTCT CCCTCGTCTC CGGCGGCACC GACAACCACC TCGTGTTAGT CGACCTCCGG
GACTCGCACC CGGACCTGCC CGGCGGCGAC GCCGCGGATG CGCTCGCGGC CGCGAACATC
GTTCTTAACG GGAACACGGT CCCCGGCGAG ACCCGCTCGC CGTTCAATCC CTCGGGGATT
CGCGTCGGGA CCGCCGGAGT CACCACTCGC GGCTTCGACG CGGACGTAAT GGAGGAGGTC
GGCGACCTCA TCCACCGCGT CGTCGACAAC GTAGACAGCG ACGACGTGAT CTACGAGGTC
GGCGAGCGCG TCGTCGAGCT GTGCGACGAG CACCCGCTGT ACGAGTGA
 
Protein sequence
MDHEHVREVD PEVADALAGE RDRQEQTLAM IASENHVSEA VLEAQGSVLT NKYAEGYPGE 
RYYAGCEYAD EVETLAIDRA KELWGADHVN VQPHSGTQAN QAVYYAVLDP GDKILSLDLN
HGGHLSHGHP ANFTGQIYEV EQYEVDADTG YIDYEGLREA AEEFEPDIVV SGYSAYPRTV
DWEEIQAAAD AVDAYHLADI AHITGLVAAG VHPSPVGVAD FVTGSTHKTI RAGRGGIVMC
DEEFADDIDK AVFPGGQGGP LMHNIAGKAV GFKEALDPSF DEYAQNVVDN AEVLAETLQD
HGFSLVSGGT DNHLVLVDLR DSHPDLPGGD AADALAAANI VLNGNTVPGE TRSPFNPSGI
RVGTAGVTTR GFDADVMEEV GDLIHRVVDN VDSDDVIYEV GERVVELCDE HPLYE