Gene Hlac_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3174 
Symbol 
ID7399303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp404865 
End bp407894 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content60% 
IMG OID643706974 
Producthypothetical protein 
Protein accessionYP_002564596 
Protein GI222476075 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTGATC GCTTCTTCGG TCGCGACGAC GACCAAGACG CTGCGTCGAA TGCAGACGCT 
GCCTCGGACG AGTCAGCAGA GAGTGACGCA GGCCCGATGG CTGAGACCAG TCCGGGCCCG
CAGTCACTGC TTGGCGAAAC GTACGACATC ACCGAACAGA ACACGGACTA CGTCGGGAGC
AAGCCCGTCA TCACCGAGAC GCAAAATGAG GGCACTGTCG CCGGCTCATA CATCCGCGAG
ATGCTCGAAT CGGGCGTCGA TACAGCACCA TCGCCGCTGT GGATCGGCTA CGACGAGGAC
GCCCAACGAG GGTTCCGTGA AGCACCGCTC CAGTTTGAGT CGCTCTTCCG GCACATCTGG
ATCACCGGCA CCACCGGCTA CGGGAAAACG ACGGAACTAT TGAACATGAT GGTCCAGTGG
GCGTATTCAG GGTACGGATT CGTCTACTTC GACCCGAAGG GCCGTGACTC TCGGGAACTC
CTGCGGATGC TCCCTGAAGA CCGGCTCGAC GATGTTGTCT GGATCGAACC CGGCTCATCC
GAACACGACA AGACGGTCGG GCTGAACTTC CTCGAAGTTC CCGACTGCGA AACCGAAGAA
GAGCGTGAAA ACGAGATCGA GAATCGCATC GAGAACCTCA AAGCGATCTT CGATACGGAC
GAGTACTGGG GGATCAACAT GGAGTCGATC ACCGAGTCGA TGGGCCGGGC AATGATGCAG
TCCGACCAGC CGTTCTCGGT GATCGATATG TACTTCACGC TGTTGCACGC TGAGCGCCGC
GAAGAGTTCG CCCTCGACGT CGACGATCCC TACGTTAGGG AGTTCTGTCT CGAAATCGCG
CGGATGGACG ACGAAACCGT CCGGCCGCTG CTCAAACGCA TCAAGTCGTG GGTCGAAAAC
TCGGTCATCC GGCGGATCAT TGCCCACCGC GAGAGTACCA TCGACTTCCG TGATATCATC
GACAACGACC GGATCGTCAT CGTCCGTACG CCCGTCGAGA ACACGGACAT CAAGAAGATG
GTCACACTCG GCGTGATGCG AAACCTCTGG AGCGCCATCC AACAACGGTC GTACGAACGC
GATACCGATC CAGATCCGTA CTTCGTGCTC TGTGACGAGT TCGACGACAT CGCGAGCGAC
AACCTCGACA TCGAGTCGAT GCTCGCTCGT GCCCGGTCGA TGCGCCTCTC CGTGACGTTG
TCGTCGCAGT ATCCCTCACA GTTCGGTGAG GACACGCGCA AGGCGATGCA GAACAACTGC
GACAACCTCA TCGCCTTCTC CGTGAACGAC GTCGACGACG CCCAGCTCTT GATGAAGCGC
TTTCGCGACT ACACGGCCGA AGACCTCATC TCGACCGACC AGTACCAGGC CTGGACGAAA
CTCCCACTGG AGGGTGGTCG GTACTCCGAG CCAGTGCTCC TCCGAACGTT CCCGCCGTAT
CCACCACTGC GGTCGGCCGA TTCGGTCGAC GACATCATCG AGGCGAGTTT AGACCGGTAT
GGAACCGACC CGCTTACCGA CGCGGAGATC ATGCAGAACC TCATCTACAG CGACGCCAAC
GAGGCAGCGA ATCCGACACA GATTCTGGAC AAGACGATGG CCGAGGCGAT TCGTGCCGTT
CAAATCCGAG AGAACGCCCG CGAGGCAAAC GGCTGGGTGG ACATCGTCGC CATCGACGAG
GAACTCTCGA CACGCCTCGA GAATAGTCAG GCTGAGATCG ACTACGAGTA CGAGGACTTA
CCGGACGTGC GTGACGCGTC GCGGTTAATC GATGTCGAGT TACAGGACGG CGAGATTGTG
GCGCGGCTCT CCGACGACGG GGAAACGGAG GTGCAGCCCG AAACGGGGAC TGCTAGATCG
GCAGGAGGAA TGAGCCACGA TGCTGTACTC ATGGATACTG AGACGGCGCT CACCGAGCGT
GGGTTCAGTG TTGATATTCT CGAACAGGAC GGGAGCGAGC AACCCGACGC CACCGCGACG
CACCCAGACC ACGACGTGGT GTTCAACATC GAGGCCGAGA CGACGACGCC TGACCGACCC
GCGAAAGTCT TGCAGAACCT CAAACGAGCG CACGACGCAG ATCGCGTCCC GCTCTTCGTC
GTCCGCCCTG GCGATCCTGA GACCGAGTGG GCGACGCGAC TCGAGAATAT TCTCTCGCCG
CCGCTGCGAG AGCGGGCGGA CGGCACCGAG CAGTTCTACA ACTGCGATGA GATCGTGACC
TTCGGCGGTG GGGCGACCGC CCACGGTGGG GTAACCGCCG TTCGGCCCCG GACGGCCGAC
ACAAATCGGA CAGTCTGGAC GCGCGACGAT GGTGAGCGTG TGCTGTCAGA CGGCGAGACG
GAATTCGCCC GCGTTCCAGA TGAGGGGGCG CTCTCGAAAG ACGGCGTCCC GGCCTACTAC
AGTTACGACC ACGAAACCGG GCAGTACACC GTCCATAAAC CCGGAGAGAC CCGCGTCTTT
GACTCGAAAG ACGAGTTCAA GGCCGAGTGG ACGCCGATCA AGCGACCGTT CATTCCCGAT
GTCGAACTCC CAGATGCGGA CTATTCGCGT GACAGCTACG TCGTGTTGCT GCTTCCGGAG
GACGGTGCTC CTACGGTATT CCAACAGGGA GAATCGTATA CACTCTCCGA GAGCCCAGAC
GAGGAGGACT TGTGGCCGGA CGCCCCCACC AGTGAGCACG CACAGGCCAC AATATCGATG
GATTCCGGTG AGGAGACTCC ATCGACGGAG TCTACCGCAC CTTCAGATGG GTCGGTGGAG
ATCGATCCGA GTGGTGACGG CATCGAGGCG TTCGCAGCGA TGTACATCAG AGAAGCTGAG
GGAGCCCAGG TGCCAAAGGA AACGCTGTTT CAGGCGTACT CGGCCTGGAC TGACCAGCAC
GATATTGAGG GGACAAATGC GAGTTGGTTC GGTCGAAAGC TCGCGAACGT CGTCGAGTAT
GAGAATGATC GGGTCAGAGA CGGCGACGAT CTGGTGACTG TCTACACCGG TGTCGACCTG
ACGCCAGCGG GATCACAGTT CCTTGAATAA
 
Protein sequence
MFDRFFGRDD DQDAASNADA ASDESAESDA GPMAETSPGP QSLLGETYDI TEQNTDYVGS 
KPVITETQNE GTVAGSYIRE MLESGVDTAP SPLWIGYDED AQRGFREAPL QFESLFRHIW
ITGTTGYGKT TELLNMMVQW AYSGYGFVYF DPKGRDSREL LRMLPEDRLD DVVWIEPGSS
EHDKTVGLNF LEVPDCETEE ERENEIENRI ENLKAIFDTD EYWGINMESI TESMGRAMMQ
SDQPFSVIDM YFTLLHAERR EEFALDVDDP YVREFCLEIA RMDDETVRPL LKRIKSWVEN
SVIRRIIAHR ESTIDFRDII DNDRIVIVRT PVENTDIKKM VTLGVMRNLW SAIQQRSYER
DTDPDPYFVL CDEFDDIASD NLDIESMLAR ARSMRLSVTL SSQYPSQFGE DTRKAMQNNC
DNLIAFSVND VDDAQLLMKR FRDYTAEDLI STDQYQAWTK LPLEGGRYSE PVLLRTFPPY
PPLRSADSVD DIIEASLDRY GTDPLTDAEI MQNLIYSDAN EAANPTQILD KTMAEAIRAV
QIRENAREAN GWVDIVAIDE ELSTRLENSQ AEIDYEYEDL PDVRDASRLI DVELQDGEIV
ARLSDDGETE VQPETGTARS AGGMSHDAVL MDTETALTER GFSVDILEQD GSEQPDATAT
HPDHDVVFNI EAETTTPDRP AKVLQNLKRA HDADRVPLFV VRPGDPETEW ATRLENILSP
PLRERADGTE QFYNCDEIVT FGGGATAHGG VTAVRPRTAD TNRTVWTRDD GERVLSDGET
EFARVPDEGA LSKDGVPAYY SYDHETGQYT VHKPGETRVF DSKDEFKAEW TPIKRPFIPD
VELPDADYSR DSYVVLLLPE DGAPTVFQQG ESYTLSESPD EEDLWPDAPT SEHAQATISM
DSGEETPSTE STAPSDGSVE IDPSGDGIEA FAAMYIREAE GAQVPKETLF QAYSAWTDQH
DIEGTNASWF GRKLANVVEY ENDRVRDGDD LVTVYTGVDL TPAGSQFLE