Gene Hlac_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2094 
Symbol 
ID7400614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2084172 
End bp2085218 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content58% 
IMG OID643709164 
Producthypothetical protein 
Protein accessionYP_002566741 
Protein GI222480504 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0787184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.014964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCCT CCCTGAGGGA CCGTCACGGT TCGGTCGCAT GGATCCTCGT CGGGGTCCTG 
ATCACAGCGG TGGTGGCGTT CGTTCTCTAC TCGTTCGTCG GTGCCATCGT GGTCGGCATC
TTTCTCTACT ATGCGACACG GCCGATCTAC CGATGGATAG ACCAATGGAC CGAACATCCG
GACCTCAGTG CGACGGTTAC GCTCCTGACC GTCGGCCTGC CGATTCTCCT CATCCTCGCG
TATGCCACAT TCGTTGGGAT CCGCGAGATT GACCAGTTTT TTGCAATCGC AAATCTCGAA
CAGCTCAGGA CGGTGTTAGA ACCCTATGTT GACCTCGTGT CGGGGTCTGA AGAACAAGGG
CTATTCGGCA TCCTTCGTGA CAACGTCTCT AGGGCGCGGG GGTTCGCCAG TTCAGCCACG
GTCTGGTTGT TACGTCTCTT CGTCAGCTTC ACGCTTGCAT TCTACCTCTT GCGGGACGAT
TACAAAATCG CACAGTGGTT CCGACGGAGC TTCGAGCATC AGCCCGCCGC CGTGACCTTC
GTCGAACAGG TAGACGCCGA CCTCACGACG ATCTACACGG GGAACCTCAT TACGATCGGA
GCGAGTGGCC TCCTCGCCGT CAGTACCTAC TACGTCTTGG ACATCATCGC TCCGGCCGGG
ACGGGCGTCC AATTCCCGTT CCTCCTGGGA CTCCTGACTG GAGTGGCCAC ACTCATTCCG
GCCATCGGTA TGAAACTCAT CTACTACCCC TACACCGGCT ACCTCGTCTG GCAAGTCGTG
TCTAAAGGCG AGGGTTCACT CTGGTTCCCG GTCGTCTTTT TCCTCGTAAC GGTTGTTGTC
GTCGACGTCA TTCCCGACTT CTTCATCCGG TCGTACGTCT CGAAGGGCGA ACTCAACATG
GGCCTGCTAC TCCTGACGTA CGTTCTCGGT GTGGTGGCAT TCGGCTGGTA CGGGGTATTC
TTCGCACCGA TTGTACTGGT CGTCTTCATC CACTTCGTGC GGGACATCCT CCCAGTGCTT
CTCGGATCGG ACGCGACGAC ACGGTGA
 
Protein sequence
MVASLRDRHG SVAWILVGVL ITAVVAFVLY SFVGAIVVGI FLYYATRPIY RWIDQWTEHP 
DLSATVTLLT VGLPILLILA YATFVGIREI DQFFAIANLE QLRTVLEPYV DLVSGSEEQG
LFGILRDNVS RARGFASSAT VWLLRLFVSF TLAFYLLRDD YKIAQWFRRS FEHQPAAVTF
VEQVDADLTT IYTGNLITIG ASGLLAVSTY YVLDIIAPAG TGVQFPFLLG LLTGVATLIP
AIGMKLIYYP YTGYLVWQVV SKGEGSLWFP VVFFLVTVVV VDVIPDFFIR SYVSKGELNM
GLLLLTYVLG VVAFGWYGVF FAPIVLVVFI HFVRDILPVL LGSDATTR