Gene Hlac_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0940 
Symbol 
ID7401312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp938748 
End bp939785 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content63% 
IMG OID643708006 
Productprotein of unknown function UPF0118 
Protein accessionYP_002565608 
Protein GI222479371 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.726387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.96935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACA GATCCGCCCC CCCGACGTGG CTCGTCGGGA AACCAGGGCT GACCGCACTC 
GTCCTGCTGA GCAGTCTCCT CGCGTTGTTC GTCCTCCTGC CGTATCTCCA GTTCATTCTG
TTCGGCGTGG TTCTCGCATA CATCCTGTTT CCCGTCCAAC AACGAGCCGA GCAGCACGTC
AGGCCCACAA TCGCGGCGAT TGTCATTGTC TTGGGCGCGT TACTGTTCGT ACTGATCCCG
ATCATCTATC TCCTCACGAT CGCCGTCCAA CAGTCGCTCA GGGTCGTGAG TGCCGTCAGA
AACGGACAAA TCGACGTTGC GTCGATCGAA GAACTCCTCG AGAGTACCGG ATACCGCATC
GACCTCGTCG CGCTGTACGA ATCGAATCAG GAACGGATCG CAACAAGTCT CCAAGAGGTC
ACGTCAGGGG CGATCGACCT CGCCGGGAGT TTGCCAGGGC TGTTTATCGG ACTGACCATC
ACGCTGTTCG TCCTCTTCGC CCTGTTGCGC GACGGGGAAC AGCTCGTGGC GTGGGTCCAG
TGGGTGCTGC CGGTCGACGA GGACATCCTG GACGAACTCC GCGAGGGACT GGATCAGCTC
ATGTGGGCCT CTGTCGTCGG GAACGTCGCC GTCGCGGCCA TTCAGGCGGC GCTCCTCGGC
GTCGGGCTCG CGATCGCCGG CCTCCCCGCC GTGATCTTTC TCACGGTCGT TACGTTCGTG
CTGACGCTGC TCCCGCTCGT CGGCGCGTTC GGCGTCTGGG TCCCGGCTGC AATGTATCTC
CTCGCAGTCG GACGACCGAT TGCCAGCGCG GCGATAGCCG TGTACGGCCT GCTCGTTACC
TTCTCCGATA CGTACCTCCG ACCCGCGCTA ATCGGTCGGA CCGGCGCATT CAACTCCGCT
ATCATCGTCA TCGGCATCTT CGGCGGGCTC GTCGTATTCG GCGCCGTCGG CCTGTTCATC
GGCCCCGTCG TCCTCGGCGG CGCGAAACTC GTCCTCGATT GCTTCGCTCG GGAACACACC
GGAGAGCCGA CTGCTTGA
 
Protein sequence
MPDRSAPPTW LVGKPGLTAL VLLSSLLALF VLLPYLQFIL FGVVLAYILF PVQQRAEQHV 
RPTIAAIVIV LGALLFVLIP IIYLLTIAVQ QSLRVVSAVR NGQIDVASIE ELLESTGYRI
DLVALYESNQ ERIATSLQEV TSGAIDLAGS LPGLFIGLTI TLFVLFALLR DGEQLVAWVQ
WVLPVDEDIL DELREGLDQL MWASVVGNVA VAAIQAALLG VGLAIAGLPA VIFLTVVTFV
LTLLPLVGAF GVWVPAAMYL LAVGRPIASA AIAVYGLLVT FSDTYLRPAL IGRTGAFNSA
IIVIGIFGGL VVFGAVGLFI GPVVLGGAKL VLDCFAREHT GEPTA