Gene Hlac_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1051 
Symbol 
ID7400123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1045437 
End bp1046618 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content66% 
IMG OID643708119 
Productcreatinase 
Protein accessionYP_002565718 
Protein GI222479481 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGAG AACACATCTT CGACGAGGCC GAGTACGAGC GGCGGGTGGC TCGGACGAAA 
GAGCGGTTGC GCGAGCAGAA CCTCGACGCG ATCGTGGTCG CCGATCCGGC GAACATGAAC
TACCTGACCG GCTACGACGG CTGGTCTTTC TACGTCCATC AGGCGGTCGT GGTCACGCCC
GATCGCGACG AGCCGATATG GATCGGTCGC GACATGGACG GCGACGGCGC GCGGGCGACG
ACGCACCTCT CCGACGACAG CATCCGCGCG TACAGCGACG ACCACGTTCA CTCACCGCAC
GACCTCCACC CGATGGACTA CGTCGCCGGC GTTCTCGAAG AGTTAGATGT CGCGGACGGC
CGGATCGGAT TGGAGATGGA CGCCGCCTAC TTCACCGCGA AGTCGTACAT GCGACTCCAG
CAGAACCTCC CGGACGCCGA GTTCGAGGAC GCGACGCTGC TCGTCGGCTG GATCCGTGTC
AAGAAGTCGG ACCAAGAGCT GGAGTACATG GAGCAGGCCG CGCGGATCTC CGAGAACGCG
ATGCGTGCCG GCCTCGACGC CATTGAGGAA GGAGTCCCGG AGTACGAGGT CGCCGCTGCG
ATCTACGAGC AGTTGATCGA GGGGACAGAG GAGTACGGCG GCGACTACCC CGCGATCGTC
CCGCTAATGC CGTCGGGCGA TCACACCGGG ACGCCACACC TCACGTGGAC GGATCGACCG
TTCGAGGAGG GCGACCCGGT CATCATCGAA CTCTCCGGCT GTCGGCACCG CTACCACTCG
CCGCTGGCCC GAACGACCTT CGTCGGCGAC CCGCCGGCCG AGCTGCAGGA GACCGCGGAC
ATCGTCGTCG AGGGGTTGGA GGCGGCGCTC GACGCCGCGG AGCCCGGCGT CAAATGCGAG
AGCGTCGAGA AGGCGTGGCG GACCACCATC GAGCAGTACG GGCTCGAAAA GGAGGATCGC
ATCGGGTACT CGATGGGGCT CGGCTACCCG CCGGACTGGG GCGAGCACAC CGCGAGCATC
CGGCCGGGCG ACGAGACCGT CCTCGAAGAG GACATGACGT TCCACATGAT CCCGGGCATC
TGGACCGACG AAATCGGCAT GGAGATCAGC GAGACGTTCC ACGTCACGTC TACCGGGGCG
GAGACGCTGG CCGAGTTCCC TCGCGAGCTG TTCACGGCCT GA
 
Protein sequence
MPREHIFDEA EYERRVARTK ERLREQNLDA IVVADPANMN YLTGYDGWSF YVHQAVVVTP 
DRDEPIWIGR DMDGDGARAT THLSDDSIRA YSDDHVHSPH DLHPMDYVAG VLEELDVADG
RIGLEMDAAY FTAKSYMRLQ QNLPDAEFED ATLLVGWIRV KKSDQELEYM EQAARISENA
MRAGLDAIEE GVPEYEVAAA IYEQLIEGTE EYGGDYPAIV PLMPSGDHTG TPHLTWTDRP
FEEGDPVIIE LSGCRHRYHS PLARTTFVGD PPAELQETAD IVVEGLEAAL DAAEPGVKCE
SVEKAWRTTI EQYGLEKEDR IGYSMGLGYP PDWGEHTASI RPGDETVLEE DMTFHMIPGI
WTDEIGMEIS ETFHVTSTGA ETLAEFPREL FTA