Gene Hlac_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1233 
Symbol 
ID7399501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1242927 
End bp1244582 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content64% 
IMG OID643708297 
Productmembrane protein-like protein 
Protein accessionYP_002565895 
Protein GI222479658 
COG category[S] Function unknown 
COG ID[COG3356] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000686662 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGCGG ACAACGTCGA CGTCTTCCAG CGGCTGGTGT TCAGCGTGCC GCCGCTCTCG 
CGACAGCTCC CCGCGATGGT GATTCTCGCT GTCGCCTACA GCGTCCTCGC GTTCGCCGCC
TTCTCGGCGT TCACGCCGCT GTCCCCGGAG CTGTCGACCC TCCTTCCGAT CGCGGTGCTG
CTCTTCTTGC TCCCTTTCCT GTTCGCCGGG GAGCTGTTCC ACCGCGTTCT CCCGGAGTAT
CCGCGGACGT GGAGCTTCTT TCTGGCGTTG CTCAACCAGT TCGTCCTGTT CGTCCACGCC
TTGGTTCTCT CCGGAGCCAA CGACGTGGGA AACGCGTGGA GCATTGTCTG GCTCCTGTTT
ATCACGATCT ACCTGATCAA CATCCTCGTG CTCGTCGTCT CGACGGGGAT CGACCGCTAC
AAGCGCATCC TCCTCGTCGC GCTCGCGGAG CCGGCGGCGC TGATCGTCGC GTTCTACGCG
TTCGCCGGCG GTGACCTCGG GTTCACGACG TACCGACACG TGTTCGCGTT CGCCTCGCTG
CTCATCGCGG CGGCGTTCCT CGTGTTGGTG CTCGCCATCG TCGACTACCT CATCAGGAGC
AACACCGACG TCTCCGCGTT CGAACTCACC TCCGGAATCC TGCAGAACGA CCGCGCCTCG
CTCGATCTCG GCATCGAGGC TGAACCCGCC GTCGAGACGC TCGCGATCGA CAACGGCGAC
CAGCTGACGC TCGCCGCGCC GTGGGTCCAC CCCGGCCCGC TCGGCGGGTT CGGCGGCGGC
CAGCTGAGCG GGAACGTGAT CGACGCCTTG AACGACGGCG ATGAAGGAGA TAGCGGGTTC
TTCCTCCACG TCCCGTGCAC GCACAAGGAG GACCTCTCGA ACCCGGGCGA CGCCGAGACG
ATCCTCGATG CCGTCGCCGA TCCCGGGCGC GTCGACCGCG CGTCGCGGCT GGTGAGCCAC
GACTACGGAG AGATCGAGTT CCACGGCCGA CGAATCGGCG ACAAGCAGGT GATCTTCCTT
CACGGCGAGG GGATCGACGA CTACGACACG GGCGTGTTCA TGAGCGATGT CGACGAGTCC
GAGGTGCTGC TCGTGGACCT CCACAAACAC GACCTCCAGA ACGGGCCCGA AAAAGAGGTG
CTGTACGGCT CGTCCGAGGC GGATCTGCTG AAACGGCACT TCGACGACTT CCGTGATCTC
CTCGACGACG CGCCGCTTTA CGACTACGCG GCCGGCTTCG CGGTTCGCCG CACTGATCAG
GATGTCGCGG CGATCGTCGA GTCGGTCGAC GGACAGGAGG TCCTGTTAAT GGGGATCGAC
ACCAACGGAA TCACGCCAGA CGTGCGGGAG CTAGCGGCCG ACTACCGCGA GTCGTTCGAT
GAGGTGCTCG TCTTCTCGAC GGACACCCAT GCGTCCATCC ACGAGCTCGC GAACACGACG
CGGTCGGACA CCGAGTCGCT GACCGAGGCG ATCGAACACG CCACCGACGC GGTGGCTCCC
GCGACGATCG GCCTGACCAG CCGGACAACC CGCCCGCTCA AGCTACTGAA AAACGACTAC
AACGGGCTCG TGTTCAGCGT CAACATCCTG ATCCGGTTGA CGGTCATCTC GCTCGCGATG
CTGTACGCGC TGCTCGTCAT CTGGCTGTTC TTCTGA
 
Protein sequence
MGADNVDVFQ RLVFSVPPLS RQLPAMVILA VAYSVLAFAA FSAFTPLSPE LSTLLPIAVL 
LFLLPFLFAG ELFHRVLPEY PRTWSFFLAL LNQFVLFVHA LVLSGANDVG NAWSIVWLLF
ITIYLINILV LVVSTGIDRY KRILLVALAE PAALIVAFYA FAGGDLGFTT YRHVFAFASL
LIAAAFLVLV LAIVDYLIRS NTDVSAFELT SGILQNDRAS LDLGIEAEPA VETLAIDNGD
QLTLAAPWVH PGPLGGFGGG QLSGNVIDAL NDGDEGDSGF FLHVPCTHKE DLSNPGDAET
ILDAVADPGR VDRASRLVSH DYGEIEFHGR RIGDKQVIFL HGEGIDDYDT GVFMSDVDES
EVLLVDLHKH DLQNGPEKEV LYGSSEADLL KRHFDDFRDL LDDAPLYDYA AGFAVRRTDQ
DVAAIVESVD GQEVLLMGID TNGITPDVRE LAADYRESFD EVLVFSTDTH ASIHELANTT
RSDTESLTEA IEHATDAVAP ATIGLTSRTT RPLKLLKNDY NGLVFSVNIL IRLTVISLAM
LYALLVIWLF F