Gene Hlac_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1234 
Symbol 
ID7399502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1244583 
End bp1245614 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content66% 
IMG OID643708298 
Productglycosyl transferase group 1 
Protein accessionYP_002565896 
Protein GI222479659 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000428229 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACATCG GATTCTTCAC CGACAGTTAC TTCCCCGGTA TCGACGGGGT AACGTACACG 
ATCCGCGCGT GGCGGGATCG GCTCGAAGAC CGCGGCCACG AGGTGTACGT CGTCTACCCG
GCGAGCAGCC ACGAGCCCGA CGATCGAGAG ATTCCCGTGC CGTCGCTGCC GAATCTCTTC
TACAGTCAGT ACCGCGTTCC GCTGTACCGC CGGATCTCGA CGCTTCCCGA TCTGGACGTG
GTCCACTGCC ACGGGCCGGC GTCGACCGGT CTGATGGGCC GCCGATACGC GAAGAAGCGC
GACGTGAAGT CGGTGTACAC CCACCACACG CCCGTGGAAG ACTACTTCGT CCAGGGGTTG
AAACTGGAGT TGCTGGCCGG GATCGCTGGC CGGGCGTACG TGGCCTACGA AAACCGGTTC
CTCCAGTCGT TCGACTGCGT CACCGCGTCC ACCTCGCGAA TCCGGCGGGA CGTGACACCG
CGGAAGCTCC CGGTCGGCAT CGAGATGGAC ACGTTCCGCC CGGTGACGGA CTCGCAGTTC
GCAAGCGACG AGCCGACGGT GGGATACAGC GGTCGGATGA CTCGAAAAAA ACACGTCGAC
GAGATCCTCC GGCTGGCCGA CCGGCTGCCC GACGTGCGGT TCGAACTGGT GGGCGAGGGA
CCGGTCCGGG ACGACCTCGA ACGGGGCGCC CCGGGGAACG TCCGGTTCCG CGACTTCCTC
CCGCGCGAGA ATCTTCCGGC GTTCTACTCC GCGCTCGACG TCTTCGTCAC CGCCTCGACC
TGCGACACGC TCGGGCTCTC GACGCTGGAG GCGAACGCCT GCGGGACCCC GGTCGCCGCC
GCCGACGTGC CCCCATTTGA CCGGACCATT GGGCCGGACA ACGGCACCCG GTTCGACCAC
GGCGACCTCG ACGACATGGA GCGCGCCGTC GTCGACTGTC TCGACGGCGA CAGGCCGACC
CGTGCGGCGG TCGAGGGGTT CTCCGTCGAG CGGACGATAG ACGACTTAGA GGAGATATAC
GGGGTGTCGT AG
 
Protein sequence
MNIGFFTDSY FPGIDGVTYT IRAWRDRLED RGHEVYVVYP ASSHEPDDRE IPVPSLPNLF 
YSQYRVPLYR RISTLPDLDV VHCHGPASTG LMGRRYAKKR DVKSVYTHHT PVEDYFVQGL
KLELLAGIAG RAYVAYENRF LQSFDCVTAS TSRIRRDVTP RKLPVGIEMD TFRPVTDSQF
ASDEPTVGYS GRMTRKKHVD EILRLADRLP DVRFELVGEG PVRDDLERGA PGNVRFRDFL
PRENLPAFYS ALDVFVTAST CDTLGLSTLE ANACGTPVAA ADVPPFDRTI GPDNGTRFDH
GDLDDMERAV VDCLDGDRPT RAAVEGFSVE RTIDDLEEIY GVS