Gene Hlac_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0235 
Symbol 
ID7401161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp253845 
End bp254906 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content74% 
IMG OID643707298 
Productaminotransferase class I and II 
Protein accessionYP_002564910 
Protein GI222478673 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.490059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTCG ACACCGCGCT CGACCTCGAA CGCGAACCGC ACGGCAGCAG CGACGACCCC 
GACCCGCTGG ATTTCTCGGC GAACATCAAC CCCGAGGTTC CGCCGGGCGT TGAGGAGGCG
TACCGCGAGG CGTTCGCGGC CGCGCGGTCG TACCCGGTCG AGCCGCCCGA GTCGTTCCGC
GAGGCCGCCG CCGAGTACGT CGACTGCGAC CCGGACGCGG TCGTGCCGAC GCCGGGTGGA
CTCGCCGCGA TCCGCGCGGC GATCGCGCTC GCGGTCGATC CGGGGGACAC CGCCCTGATT
CCGGCACCGA GCTTCGGCGA GTACGCCCGC GAGGTGCGAC TGCAGGGCGG CGAGCCCGCC
TTCGTCGCTG CCGACGCGGT CCTCGACGCC GACCCGGCGG ACCACGCGCT CGCGGTCGTC
TGCGCCCCGA ACAACCCCAC GGGGACCGAC TACGAGCGCG CGGAGCTGGA GGGGTTCGCG
GCGCGGTGCC GCGCGGCCGA CACGCTCCTG CTCGTCGACG AGGCGTTCCG CGGGTTCACC
GATCGCCCCT CGCTCGCGGG GGAGGAGGGC GTCGTCGTCG CCCGGTCGCT GACGAAGCTG
TTCGGGCTCC CCGGGATCCG GGCGGGGTTC GCGGTCGCGA CGGGCAAGTT CGGCGCGGCG
CTGGAGCGCG CTCGACGGCC GTGGAACGTG AGCGTTCCGG CGCTGGCGAC CGGTGCGCAC
TGCATGCGGC AGGGGGGATT TATAAGGAGA ACCCGCGAGC GCATTCGCTC GGAGCGGTCG
CGGATGGCCG CGACGCTTGC GGAGCGGTAC GACGTGGCCC CCTCCGAGGC GCCGTTTCTG
CTGCTCGACG TGGGAGAGGG GGAGAGGGGG CGGTCCGTTG AGCAGGCCGT GGCCGACGCC
CGCGACCGCG GCGTCGCGAT TCGGGACGCA ACCACCTTCC GCGGGCTCGA CTCGCACGTC
CGGGTCGCGG TGCGCCGGCC CGCCGAGAAC GACCGCCTGC TGGCGGCGCT GGGCGTTGGC
GACGGGACGG CGACCGACCC CTCGGAGGCC GACGATGTTT GA
 
Protein sequence
MNLDTALDLE REPHGSSDDP DPLDFSANIN PEVPPGVEEA YREAFAAARS YPVEPPESFR 
EAAAEYVDCD PDAVVPTPGG LAAIRAAIAL AVDPGDTALI PAPSFGEYAR EVRLQGGEPA
FVAADAVLDA DPADHALAVV CAPNNPTGTD YERAELEGFA ARCRAADTLL LVDEAFRGFT
DRPSLAGEEG VVVARSLTKL FGLPGIRAGF AVATGKFGAA LERARRPWNV SVPALATGAH
CMRQGGFIRR TRERIRSERS RMAATLAERY DVAPSEAPFL LLDVGEGERG RSVEQAVADA
RDRGVAIRDA TTFRGLDSHV RVAVRRPAEN DRLLAALGVG DGTATDPSEA DDV