Gene Hlac_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1784 
Symbol 
ID7399657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1802154 
End bp1803242 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content69% 
IMG OID643708850 
Producthypothetical protein 
Protein accessionYP_002566433 
Protein GI222480196 
COG category[R] General function prediction only 
COG ID[COG4801] Predicted acyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.19562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCTGA GGGGAGGCGG TCCGATAGAG GAGCTCGCCG TCCCATCCGG GACGACCGTC 
GAGGAGCACG ACCTCGTCAC CGACGGCGAC GTGCTCGTCG GCAGCCAGTC GACCGTGGAG
TTCGGGCTCC GCGGGCGCAA CGTCGCGCTC GGCGAGCGTG TGAGCGTCGA AAACGACATC
GAGGCCGAGG GCGACTGCCG GCTCGACACG TGGTGTTCCG TCGACGGCAA CGTCCTCGTC
GGCGAGGACG CGTACCTCGG CGAGCGGGTG ACCGTCACCG GTCGACTGAT GGTCTCCGGC
GACCTCGACA TCGGCGACGA CGTGACGATC GAGGAGGGGT TCGAGGCGAA CGGGTGGATC
GTCATCCGCA ACCCCGTCCC CACCCTCGTC TTCTACTTCA TCGTCCTCTC TCAGCTCCTG
CGGCTCGGCG AGACCGACGC GGCCGACAAC CTGGCGGAGG CGCTCGCCGA CGGCGAAGAC
GTGCGTGACC CCCTGCTGGT CCCGCGTAGC GCCGAGATTT CCGACGACGC GTGGCGCGTT
TCGACGCCCG CGAGCGTCGG CGACGACTGT CGGCTCCACG GCAACCTCCG CGCGGAGTCG
ATCCGCGTCG GCGAGCGCAA CGAGGTGTTC GGCTCCCTGC GCGCCCGAGA GGGGATCACA
GTCGGCGCGG ACACGACGAT CCACGGCGAC GTCACTACTC GCGGCGGAAC CGTCACGGTC
GAAGCCGGCG CCCGCGTGCT CGGCGACGTC TCCGCCGGCG ATCTCGTCGT TCACGACGGC
GCCGAGATCG ACGGCACCCT CCGCGCTCGC GGCGAGATGA AACTCGTTCA AGAAACCGGC
GATGGCGACG AAGGTGAGGG CGAGACTGAG AGCGATGATG CCGGCGAAGA CGAGGGCGAT
GCCGACGAGA TCGGCGAGAC AGACGCCGAC GAACTATCCG ACGAGGACGG GACGTCCGAC
GGCGACGACT CCGCGGACGG CGAGGAGTCG GACTCCGACG AGTCCGGCGT CGAAGAATCA
GACTCCGGAG GGTCAGATGT CGAGAGCCCC GACACCGAGG AACCAGACGT GGACGCGGAA
GCGACGTAG
 
Protein sequence
MSLRGGGPIE ELAVPSGTTV EEHDLVTDGD VLVGSQSTVE FGLRGRNVAL GERVSVENDI 
EAEGDCRLDT WCSVDGNVLV GEDAYLGERV TVTGRLMVSG DLDIGDDVTI EEGFEANGWI
VIRNPVPTLV FYFIVLSQLL RLGETDAADN LAEALADGED VRDPLLVPRS AEISDDAWRV
STPASVGDDC RLHGNLRAES IRVGERNEVF GSLRAREGIT VGADTTIHGD VTTRGGTVTV
EAGARVLGDV SAGDLVVHDG AEIDGTLRAR GEMKLVQETG DGDEGEGETE SDDAGEDEGD
ADEIGETDAD ELSDEDGTSD GDDSADGEES DSDESGVEES DSGGSDVESP DTEEPDVDAE
AT