Gene Hlac_0455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0455 
Symbol 
ID7401073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp473038 
End bp474165 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content69% 
IMG OID643707519 
ProductProtein of unknown function DUF373 
Protein accessionYP_002565127 
Protein GI222478890 
COG category[S] Function unknown 
COG ID[COG2237] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.669189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGC TGGTCATCTG TATCGACCGC TCCGGGGCGA TCGGGCGGGC CACCAACGTC 
CCGATGCCGG TCGCCGGCTG GGAGGCGGTC CGCTCGCTCG TCACCGACGC CGGGCTCAGC
GACCCGGAGG ACGCCAGCGT CAACTGTCTG CTCGAATCGC TGCGGGTCGC TCGCGACCTC
CGCGACGAGC GCGAGGAGGC GGTCGTCGCG GTCGTGTCGG CCGAAAGCGA CACCGCGGTC
GGCGCGGATC GCTCGATCGC CGCCCAACTC GACGACCTCG TGGACCGATA CGACCCCCGG
GCCGCGATCG TCGTCGTCGA CTCCGCCGAG GACGAGCAGG TGCTCCCGAT CGTGGAGTCG
CGGATCCCGG TCGATTCGGT CGACCGCGTG GTGGTCCGGC AGGCCCGCGA TATCGAGTCG
ACCTACTACT TGCTCAAGCA GTTCCTCGCC GACGAGCAGC TCCGGTCGAC GATTCTCGTT
CCGTTCGGGG TCGCGCTCCT GTTGGTGCCC GCGCTGTTCT ACTGGTTCTC CGCCGGCGAG
GCGATCGCGG GCGTCGCCGG GCTGCTCGGG GCCGCGCTCC TGTATAAAGG CCTCGCGATC
GACCGGTTCG TCGCGGGGAT GCCCGAACGG ATCCGCGAGG CGCTGTACGC CGGTCAGGTG
TCGGTGGTGA CGTACGTCGT CGCCGGCGGG CTCGCGCTGG TCGGCGGCTT CTTCGGCGTG
CTCTCCGCGT CGGCCCTGAG CGACGCCCCG GCGCTGGTCG AGGCGGTCGA GTTCACATAC
GCCGCGGTCC CGTGGTTCGC GGTCGCAGGT GTGACCGCAG CGGTCGGACG ACTGCTCGAC
GAGCTGATCC GCGGCGAGGG AATCCGAACG CCGTACCTCA ACCTCCCGTT TGTCATCGCC
GCCGTTGCCT TGGTTGTCCG CGGATTCGCC GGCTACTTCC TCGCGCAGGA GGCGATCCAC
GAGCCGTTCG AGGTGGGTGG CCTCGCCGTG AGTCCGGTCC AGCAGCTCGC GGCGTTCATC
GTCGTCGGTA TCGTGGTCTC GCTTATGGGA GTCAAGCTCG CGAGCGACGT GGGTACGGAA
ACGCTGGAGG ACGTGATCGA CGCGGATCGG GAGACGGACG GGAAGTGA
 
Protein sequence
MTTLVICIDR SGAIGRATNV PMPVAGWEAV RSLVTDAGLS DPEDASVNCL LESLRVARDL 
RDEREEAVVA VVSAESDTAV GADRSIAAQL DDLVDRYDPR AAIVVVDSAE DEQVLPIVES
RIPVDSVDRV VVRQARDIES TYYLLKQFLA DEQLRSTILV PFGVALLLVP ALFYWFSAGE
AIAGVAGLLG AALLYKGLAI DRFVAGMPER IREALYAGQV SVVTYVVAGG LALVGGFFGV
LSASALSDAP ALVEAVEFTY AAVPWFAVAG VTAAVGRLLD ELIRGEGIRT PYLNLPFVIA
AVALVVRGFA GYFLAQEAIH EPFEVGGLAV SPVQQLAAFI VVGIVVSLMG VKLASDVGTE
TLEDVIDADR ETDGK