Gene Hlac_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0190 
Symbol 
ID7402119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp204282 
End bp207578 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content70% 
IMG OID643707253 
Producthypothetical protein 
Protein accessionYP_002564865 
Protein GI222478628 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.363536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAAAGC TATCGCCGGG AACGAACGCG TCGTGTTCCG GAACGCGGTG TCGGTCCTTC 
ACCGTCGACG AGCGTGCACG AGTCCCGTTC GCGCTGATCG GTGTTCTCCT CCTCGTGACG
AGTTCGACGT ACGCCGCAGG GATCGCAGAT CAGGGACTCA TCGGCGAGGA CCGGAGTGTG
GAGCGCGCCG TCGAGCGCGT CGACGCCGAT TCTACCGCCG CGCTCAGGGC CGCAGCGCGT
CAGGCCGCGC ACGACGCGGC CGCAGAGCCG GTGACGCGAG CGCCGGACGG ATCAGCCGCC
GGCTCGGAAG ACGGACACGG ATCAACCGCA GTCAGGAACG GATCGGCCTT CGAAGACGCG
TTCCGGATCC GGCTCGCGAT CGCCGGTTCC GAGGCGCTGG CAGCCGTGGA CGCCGAAGTC
GGTGACGTGA CGGCGACGGC GTCGCTACCG GCGGTAGACG GTCCGAGTGA GCTGGCGGCG
GCGCGCGATC GAATCCGCGT CGAACCCGCC GCGAACGGAA CAGCGACACG GGTGACGTTC
GAAGGGGTAG AAACAACCGC GACGCGGAGC GGACGCACCG TCGTGGAGCG AACGGAAGAC
CGGACAGTCG TGGTGGCCGT GCCGACGCTC GCGGCACATG AGCGGACGGA GCGGTTCGAG
GAGCGGCTTA ACCGGGGTCC GGTCGAGGGG CCGGGGTTGG GGCGCCAGAT CACCGCAAGT
CTCTACGCGA TGACATGGGC GCGCGGGTAC GGGCAGTACG CGGGCGCGCC GGTCGAGAAC
GTGCTCGCGA ACCGCCACGT GGAGCTGTCG ACGAACGCGG GGATCGTCCG GACGCAGCGT
GACGTGTTCG GCACGAGCGA CCCGAACGCA CGCGGGGGAG TGGCGCGCGC GACGGCGCGG
ACCGGAGTCA CCGACCTGCT CAAGCCGACC GGCGTCGACG AGGGGTCGTG GACCGAGGCC
GTGCTCAGCG CGCCGACACC GGGGGTACGC GATGGTCCGG GCGAGAGAGA CGGTTCGGGC
GAGAACGACA CCCCGGGCGA TCAGCCGAGC GAGGACGCTG CGTTCGAGTT CGGAGAAGAG
CGGACGGACG AACGGACTTC GGTTCCGGTC GGCCGTGCAG CCGACGAGGG GGCGACGCGC
GTCTACGACG ATCTTGACGC GATCATCGAG AGCGCGTATC GGGTCGAAGC ACGCGTCGAG
ACGTCGACGA CGCAGGTTGT CGACGGCGGG CGTCCGACGC CGCCGACACC CTCTCCCTCG
GAATTCACGA CGACCGGCGA CTGGGAGCGC GTCGACCTGA CACGGACAGA ACGGTCGCCG
ATCATCTCCG GATCCGGCGT ACCGGAGGGT GTCCCGTCGG GAATCGTCGA CCCCGGCGAG
CGTGTCTCGT TCGGCCTCGA AACCCGCGAG GCGACCGTCA AGCGCGTTGC GGTCGCGGAG
TGGGAGCGAG TGACGGTCGA GCGCGGCCCG AACGGGAGCG TCGTCGACGA GCGGGTCCAT
CGGGCGACCA CACGCGACGC GGTGACCGAC CACTATCGGG TAAGGGTGTC GGTATCGGGG
GAACACGCCC CGACCGACGG AGCGCCCGAC CGGGCCACCG CGACGTTCGG CGCGGGCGAC
GCGAGCGACG GTCCGGACCT GCGCGACACG CCCCCGATCG CACGGGTCGA CCTCGATGTC
GACACCGAGA ACGGCGTCGA TCGGATCACC GAAGACGCGA TCAGCGGTGG GGAGGTGACG
CGATCAACGA CCGTTGTCGG CTCCCGATCG GAGGCCGATC GGAACGCCGT TGCGGCCGAC
GTGGCCGCAC TCGCGAGCGA CGTGCGCGAC ATCGAGACCG AGGCGTCGAT GGAGGATGCA
GCGGTCGGCG AGGCGGAACC GTACGCCGAT CTCGCCGACG CAATCCGCGA CCGACGCGCG
GAACTTGTCG ACGCGCCGGT GACGTACGAC GGAGCCGCTG ATCGGGCCGG AACCGCTGCC
CGCACCGCGT ATCTCGACGC CGTAATCGAC GAGCTCGAGT CCGCCGCAAG CGACCGCGAG
CGCGCGACTG ACGGGTTCCT CGACCGCGTG AACAGCGCCT TCGACGGCCC TGACGTTGGC
GACGCTCTCG CCAGCCGAGA GGCCGCCCGC GATTCCGGGA CGTACGCGAT CGGCGAGGAC
GGCCCCGGCG GAGCGGTGAC GTTCGAACCG AACGGCTCTC CGGGATACCT CCCGCGGACG
ACCGTCGACG GGGAAGCCGT CGACGGCGTG GATGGGACGA CGACGCGACC GCTCGCGGTC
CGCAACGTGA ACTACGTCAC CGTGCCGTAC GGGGAGGTGT CGAGCGGCGT CGTCGATCGG
ATCCTCGGTA CCGAGGACAC GGTACGGGTC GGAACTGCGG GCCGGTCGCT CCTCCTAGCG
AACGATGCGC TGGCAGCGGA CGACGATCCC GACCTGCGAG CAGATCGGGA TGCACTTGCC
CGGCAGATAG ACGGGTCGCT GGACGAGGTT GACGGGGCGC TCGGATCGAC ACTCCGCGTC
CGGACCTCGC TGTCGCGAGA CGAGCGACGC AGAGCGCTCG ACGAGGCCGC CGCGAGCTAC
GGTTCCCCCG GCGAGCGTGC GGTTGCCGTC GGAGACGGAT CCTATCCGGA TCGCGTCGCC
GCCGAGGCCG CAAGCGTCGG GTCGCTGTCG CGGGCGGCCG AAGAGGCGCT CGCCGCGAAC
TTACGCGTCG CGACGCGGAC CGCGGCCGGG AGGGACGCGG TTCGGGTCCC GACCCGGTTC
GTCGACGAGA CGACGAGCGG AGCGCGGGTG TTGCTCAGAG ACCGGATGGA AAAGGCGGTT
GAGAACCAAG CGGCGCGCGC GGGCAAAGCA GCTACCGGAA AGGTGAGCGA GAAGGCTGTA
AAAGAACTGG GCGAAAAGTG GTCGATGAAG CCCGCTCGAA CCGTCGGCGC AGGGCTCCCA
GTCGCGCCCG TTCCGGGGTA CTGGGTGACG ACGGTGAACG CCTGGCGCGT GCAGATCCGC
GGGGAGTACC CGCGATTCGC GCTGCGCGCC AATGTGGGAA CACCCGATAA ACGCTTTACG
TACGTCCGGA GCGAGGGCGA TGTGACCGTC GACGTGGGCG GAGAGACGGT GCAACTGGGC
AAGACGGAAC CGATCGCGTT CGAGGCCGAG ACGGTAGTCG CCGTCGCGGT CCCAGCCGGC
CCGCCCGGTG TCGGCGATGT CGACGGAACT CGCGATGAGC GGTCAGGGGG ATGGCCATGC
CCCGGCGCGA TGGGGGAATC GCCGTCGGGC GGCGAGGGGA AGTGTTCGAA GCCGTGA
 
Protein sequence
MGKLSPGTNA SCSGTRCRSF TVDERARVPF ALIGVLLLVT SSTYAAGIAD QGLIGEDRSV 
ERAVERVDAD STAALRAAAR QAAHDAAAEP VTRAPDGSAA GSEDGHGSTA VRNGSAFEDA
FRIRLAIAGS EALAAVDAEV GDVTATASLP AVDGPSELAA ARDRIRVEPA ANGTATRVTF
EGVETTATRS GRTVVERTED RTVVVAVPTL AAHERTERFE ERLNRGPVEG PGLGRQITAS
LYAMTWARGY GQYAGAPVEN VLANRHVELS TNAGIVRTQR DVFGTSDPNA RGGVARATAR
TGVTDLLKPT GVDEGSWTEA VLSAPTPGVR DGPGERDGSG ENDTPGDQPS EDAAFEFGEE
RTDERTSVPV GRAADEGATR VYDDLDAIIE SAYRVEARVE TSTTQVVDGG RPTPPTPSPS
EFTTTGDWER VDLTRTERSP IISGSGVPEG VPSGIVDPGE RVSFGLETRE ATVKRVAVAE
WERVTVERGP NGSVVDERVH RATTRDAVTD HYRVRVSVSG EHAPTDGAPD RATATFGAGD
ASDGPDLRDT PPIARVDLDV DTENGVDRIT EDAISGGEVT RSTTVVGSRS EADRNAVAAD
VAALASDVRD IETEASMEDA AVGEAEPYAD LADAIRDRRA ELVDAPVTYD GAADRAGTAA
RTAYLDAVID ELESAASDRE RATDGFLDRV NSAFDGPDVG DALASREAAR DSGTYAIGED
GPGGAVTFEP NGSPGYLPRT TVDGEAVDGV DGTTTRPLAV RNVNYVTVPY GEVSSGVVDR
ILGTEDTVRV GTAGRSLLLA NDALAADDDP DLRADRDALA RQIDGSLDEV DGALGSTLRV
RTSLSRDERR RALDEAAASY GSPGERAVAV GDGSYPDRVA AEAASVGSLS RAAEEALAAN
LRVATRTAAG RDAVRVPTRF VDETTSGARV LLRDRMEKAV ENQAARAGKA ATGKVSEKAV
KELGEKWSMK PARTVGAGLP VAPVPGYWVT TVNAWRVQIR GEYPRFALRA NVGTPDKRFT
YVRSEGDVTV DVGGETVQLG KTEPIAFEAE TVVAVAVPAG PPGVGDVDGT RDERSGGWPC
PGAMGESPSG GEGKCSKP