Gene Hlac_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1456 
Symbol 
ID7400283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1463552 
End bp1464745 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content74% 
IMG OID643708517 
Productglycosyltransferase-like protein 
Protein accessionYP_002566114 
Protein GI222479877 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.443232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTACG TACAAGAGCG CGTGACCACG CTGCACGCGT TGACCGACCA CCGGCCGGAC 
GCCCCGACCG GCCGGGCGGC GGTCGTCGTG CCGATGACGG AACGCGAGTA CGGGACGCTC
GCGGCCGATC GGGTCCTGAC GGCGCTGGAG TCGGTCGATC CCGCCCGCGT CGTCGTCCCC
CTCCGTGCCC CCGCCGAGCG CGTCGGCCCC TTCGCCGACT GGCTCGACGG CTTCGACGTG
GACGTGGAGC CCCTCTGGTG TGGCGGCCCG CGGCTCACGG AGCTGCTCGC GACTCGCGGA
CTCGACGGGG ACCGCGGAAA GGGTCGAGAC GTGTGGCTCG GGCTCGGCCG CGCCTTAGAG
GAGGAGTTCG TCGTCGTCCA CGACGCCGAC ACGAAGACGT ACTCGCCCGC CTTTGTCACC
CGGCTGCTGT TCCCGCTCGC GCGCGGCCAC GACTTCTCGA AGGGGTACTA CGCCCGCGTC
GAAGACGGAT CGCTGTACGG GCGGCTGTTC CGGCTGTTCT TCCGGCCGCT GGTCCGCGCG
CTCGCCGACG CGACCGAGCG CCGCGAGCCC GGCATCTTGG AGTACCTCGA CGCGTTCCGC
TACGCGCTTG CCGGGGAGTT CGCGGCGACG ACCGACCTCG TCTCCAGACT TCGCATCCAG
CGCGGCTGGG GGCTGGAGGT CGGGACGCTC GGCGAGGCGT TCGCGCACGC CGGCTTCGCG
GGGAGCGCGC AGGTCGATCT GGGGCGGTAC GAGCACGACC ACCGCTCCGT CGACGGGCCG
ACCGGGCTCG CCGACATGAG CCGGGCGGTC GGCGAGGCGA CCCTGCGCGC GGTCGAGGGC
GCCGGCGTCG AGATCGAGTA CGACACGCTC GCCGACCGCT ACCGCGAGGC GGCCGACGGG
CTGATCCGCG GCTACGAGAC GGACGCCGCG TTCAACGGCC TCGACTACGA CCGCGGGGCC
GAACGCGAGC AGGTGGCGAC GTACGCCGAC GCCCTCGGCA AGCCAGAGCC GGACACCCGC
TTGCCGGCGT GGCGAGACGC GCCCGTCACG CCCGCCGAAG TCGGCGACGC GGCGCGAGCC
GACCTCGCGG TGGCGCGGGA TAAGGGGTCG AGGCGAACAG ACCGGAACCC GGGGAAGGCC
AACCGCCAGC GGCCCGCGGA CCCGAGCGCC GACGCCGCGC CGGGGGAGGA TTGA
 
Protein sequence
MEYVQERVTT LHALTDHRPD APTGRAAVVV PMTEREYGTL AADRVLTALE SVDPARVVVP 
LRAPAERVGP FADWLDGFDV DVEPLWCGGP RLTELLATRG LDGDRGKGRD VWLGLGRALE
EEFVVVHDAD TKTYSPAFVT RLLFPLARGH DFSKGYYARV EDGSLYGRLF RLFFRPLVRA
LADATERREP GILEYLDAFR YALAGEFAAT TDLVSRLRIQ RGWGLEVGTL GEAFAHAGFA
GSAQVDLGRY EHDHRSVDGP TGLADMSRAV GEATLRAVEG AGVEIEYDTL ADRYREAADG
LIRGYETDAA FNGLDYDRGA EREQVATYAD ALGKPEPDTR LPAWRDAPVT PAEVGDAARA
DLAVARDKGS RRTDRNPGKA NRQRPADPSA DAAPGED