Gene Hlac_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2112 
Symbol 
ID7400632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2103026 
End bp2104273 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content70% 
IMG OID643709182 
Producthypothetical protein 
Protein accessionYP_002566759 
Protein GI222480522 
COG category[R] General function prediction only 
COG ID[COG4552] Predicted acetyltransferase involved in intracellular survival and related acetyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.607011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.41072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTACC GACCCTTCCC CGACGAACGC AGCGATGAGT TCGACGCGTT CATGCGCTAC 
GCGTTCTCTC CTGCCGAGGG CCCGTACGAC CCCGAGGAGG CCGACGACCA CGACACCATC
GCCGACACGC GGGGCCTGTT CGACACTGAC GACGACCCGG TCGCGGTCTG CGCGCACCAC
TCTTTCTCGC TGCGGATCCG CGGCGCCGAC CGCGAGGTCG CCGGGCTCTC CGCGGTCGCG
TCCCCGCCGG AACATCGCCG GCAGGGGAAC GTCGGCCGCA TGCTCCGGGA GTCGCTGACG
GAGTACCGCG ACCGGGGCGT CTTCGTCTCG ACGCTGTGGC CCTTCGAATA CCCCTTCTAC
GCCAGCTACG GCTGGGCGAC CGCGAGCCGC TATCGCTACC TCACCGCGCC CCCCGATCAG
CTCGGGTTCG TCGACGACCT GATCGCGACC GCGGGCGACG ACGCCGGGAG CTTCCGGCCG
CTCGACGAGG ACGACTACGC GGCCGTGAAG CCGGTGATAG CGGCGATGGC CGACCGCTAC
GACCTGACGA TGGACTGGAC CGAGGAGTGG TGGCGCGAGC GCGCTCTCCA AGGATGGAAG
ACCGACCCGT TCGTCTACGG CTGGGAGCGC GACGGGGACC TCCGCGGGAT CTGCGCGTAC
AGCTTCGACG ACGACGCGGA CGACGCGGAC GAAACGGTGA TGCGCGTCAC CGACGTCGCC
GCCGCTGACG ACGAGGCGTG GTTCCAGCTG CTGCGCTTTG TCCGCAACCA CGACTCGCAG
GTCGCCGAGG TCCGGATCCA AGCGCCGCCG GACGCACCCT TACTCGACCT CGTTGAGGAC
CCCCGCGCCG TCGACTGCGA GATCCGAACC GGGCCGATGG TCCGGCTCGT CGATGCCGCC
ACCGCGCTCG AAGCGCTCGA CCCCGATCCG GAGATCGAGA CCGCGTTCTC GCTTTCGGTT
TCCGACCCGC TCGTCGACTG GAACGACGAG ACGTTCCGGG TCGCCGTCGC TGACGGGACG
GTGGCGGTCG AGCCGACGGT GGACGGCGAG GTCGACAAGT CAGAAGTCGA TGGGGCCGAG
GCCGCGGACG CCGCGATCGA CATCGGTACT CTCTCGCAGC TGTACGTCGG CTACACGTCG
GTCGACGAGG CGGTCCGGAG CGACGGGCTC GCAGTCGGTT CCGCGCTCGC CGACGATCTT
CGCGCGGTGT TCCCGCCGCG GACGACGCAC CTTCGCGAGG GGTTCTGA
 
Protein sequence
MEYRPFPDER SDEFDAFMRY AFSPAEGPYD PEEADDHDTI ADTRGLFDTD DDPVAVCAHH 
SFSLRIRGAD REVAGLSAVA SPPEHRRQGN VGRMLRESLT EYRDRGVFVS TLWPFEYPFY
ASYGWATASR YRYLTAPPDQ LGFVDDLIAT AGDDAGSFRP LDEDDYAAVK PVIAAMADRY
DLTMDWTEEW WRERALQGWK TDPFVYGWER DGDLRGICAY SFDDDADDAD ETVMRVTDVA
AADDEAWFQL LRFVRNHDSQ VAEVRIQAPP DAPLLDLVED PRAVDCEIRT GPMVRLVDAA
TALEALDPDP EIETAFSLSV SDPLVDWNDE TFRVAVADGT VAVEPTVDGE VDKSEVDGAE
AADAAIDIGT LSQLYVGYTS VDEAVRSDGL AVGSALADDL RAVFPPRTTH LREGF