Gene Hlac_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0194 
Symbol 
ID7402123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp209219 
End bp211045 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content72% 
IMG OID643707257 
Producthypothetical protein 
Protein accessionYP_002564869 
Protein GI222478632 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.6206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGTC CGGATAGCGA TCTCGACGGC GCTGCGCGAC GCAGTGCGAC GGACGACGGT 
GAGGGGTCGG CCGAGCTCCG TCGAGCGATC GCGTTCCTCG GATGGGAGGC GACGAGCGCG
CGTGTGATCG AAACGGGATA TTGGGCCGGC GTGGGAGTCG GTCTCACCTG CGCCGTCGTC
GTCGCGTTCG TCTCTGGACC CGTTCCCGGC TTGTTGTCCG GACTGGCGGG CGGTCTCGGA
GCCACGCACA TCGCCCATCG GGGCCCAGTG TGGCTTGCCT CGCTCCGGCG GACCAGAGCG
CTCGGGGCCG CGCCCGGACT CGTCGGACGG CTCGTGTTGC GAATGCGGTT GGACCCGTCG
ACGGAGCGTG CCGTTCGATT CGCCGCACGG ACTGGCGACG GACCGCTGGC CGACGGCCTC
GCGCGCCACG AGCGAGCGAA CACCGATGGA CCGACCAGCG GGCTCCGGGC GTTCGCCCGC
GAGTGGCAGC CGTGGTTTCC GGCCATCGAT CGGGCAGCGG CGCTCGTCCG AACGGCGGCA
TCAGCGCCGC CCGAGCGTCG AGGGCGGTGT CTGGACCGCG CGCTCGATGC AACAATTTCC
GGAACGACGG ACCGACTTGC CGCCTTCGTC GGGGAAGTCC GCAGCCCCGT CTCGGCGCTG
TACGCGTTCG GGGTGTTGCT TCCCCTTGCG CTCATTGCTC TCCTTCCGGC GGCTGCCGCG
ACGGGCGTCC CAATCGGTCC CGGCGTCGTC GCGGCGCTGT ACCTCGGCGT GCTGCCGGCG
GGACTGCTCG CGGCGTCGGC ATGGCTGCTC TCCCGTCGCC CGGTCGCGTT CGCGCCCCCG
AGTATCGACG AAGACCACCC GGATGTCCCG GAGAGAGGAA CTCACGCCGC GGTCGCCGGA
TTGGGAACCG GAGCCATCGC CGCCGTCGTT ACAGCGCGGT TCGTCTCCGG ATGGGCCGCT
CCGGTCGCAG GGGTCGGCGT CGGAGCCGGC GTCGCGCTGT TCGTTGCCGT TCACCGCCGC
AGAGCGGTGC TGTCGAACGT CCGTGCCGTC GAGCGCTCCC TCCCAGACGC GATGACGGTG
ATCGGCGGCG ACGTGGCCGA GGGCGTCGCC GTCGAGACCG CAATTGCGAA CGCCGGCGAA
CGACTCGACG GCGCCACTGG GGAGCTGTTC GAGCGCGCTG GACGGCGGAG CGACACGCTC
CGCGTCGACG TTCGGGAGGC GTTCCTCGGC CGGGGCGGTC CGGCGGTTCC GGTTCCGTCG
CCGCGGGTGC AAGGCGCGGT CGCGCTGCTC GCCGTCGCCG GGCGTGAGGG GCGTCCCGCG
GGCGACGTGC TGCTCGAACT CGCCGACCAG TTGGAGGAGC TGCGCGAGCT CGAAAACGAT
GCGCGACGAC AGCTCGCGAC CGTGACGGGA ACGCTGACGA ACACCGCGGC GGTGTTCGCG
CCGCTCGTCG GCGGGGCGAC AGTCGCGTTG GCCACCGGAA TCGACGCGGT CGACGCTGGG
GGACTCGGCG CCGGAGCGAC GGCCGGCGCA GACGCGCTCG GCGGTGCCGG TGGTGTCGGT
GACCCGGGCA CAGTCGGTGC CGAAGCGAGC GGAACGACGA GCGGGGCGGG TGCAAAATCG
GACAGCGCCA GCGCACTCTC GGTGCCCGTG CTCGGACAGA TCGTCGGCGC GTACGTGTTG
ATCCTCGCGG CGCTGCTCAC TGCGTTAGCG ACAGGACTCG AACGCGGGTT CGACCGGACC
CTCGTCGCCT ATCGGGTTGG AATCGCGCTC CCGACCGCGG CCGCGACGTA CCTCGTCGCT
TTCCTCGGTG CCGGACTGCT GCTGTAG
 
Protein sequence
MTGPDSDLDG AARRSATDDG EGSAELRRAI AFLGWEATSA RVIETGYWAG VGVGLTCAVV 
VAFVSGPVPG LLSGLAGGLG ATHIAHRGPV WLASLRRTRA LGAAPGLVGR LVLRMRLDPS
TERAVRFAAR TGDGPLADGL ARHERANTDG PTSGLRAFAR EWQPWFPAID RAAALVRTAA
SAPPERRGRC LDRALDATIS GTTDRLAAFV GEVRSPVSAL YAFGVLLPLA LIALLPAAAA
TGVPIGPGVV AALYLGVLPA GLLAASAWLL SRRPVAFAPP SIDEDHPDVP ERGTHAAVAG
LGTGAIAAVV TARFVSGWAA PVAGVGVGAG VALFVAVHRR RAVLSNVRAV ERSLPDAMTV
IGGDVAEGVA VETAIANAGE RLDGATGELF ERAGRRSDTL RVDVREAFLG RGGPAVPVPS
PRVQGAVALL AVAGREGRPA GDVLLELADQ LEELRELEND ARRQLATVTG TLTNTAAVFA
PLVGGATVAL ATGIDAVDAG GLGAGATAGA DALGGAGGVG DPGTVGAEAS GTTSGAGAKS
DSASALSVPV LGQIVGAYVL ILAALLTALA TGLERGFDRT LVAYRVGIAL PTAAATYLVA
FLGAGLLL