Gene Hlac_1581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1581 
Symbol 
ID7401514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1601454 
End bp1602479 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content66% 
IMG OID643708647 
Productprotein of unknown function UPF0118 
Protein accessionYP_002566237 
Protein GI222480000 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0769558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTCA ACCGGCGGCG ACTGCTGGCT ACACTGCTCG TCGCGGTGGC CGCGCTCGCG 
GCGGTCGTGT TGGCGGAAGT GCTGCGAACC GTCGTCTTCG CGGTCACGGT CGCGTACGTC
CTGTACCCGA TCTGCCAGTG GCTCGTCGGA CGAGGATTGT CCCGCCGGAT CGCGTGCGTC
GGCACCACCG TCATCGCCTT CGTTGCCGCC GCGATTCTCG TGGTTCCGTT GCTCTACGTG
CTGTATCGGC GGCGGGCGGA GCTGATCGAG ATCCTCGAGC AGATCCCCGA TGTCGTCCCG
ATCAGCGTGG GTGGGTTCGA GCTGGTCATC GAGATGGTGC CGTACGTGGC GGCCGCCGAG
GTGTGGGTTC GACAGGTCGC GCTCGCGCTC GCGGCGGCGG CGCCGCGGCT CGTACTCGAA
CTCGTCGTGT TCACGTTCCT CGTGTACGGG CTCCTCTATC GACCGGGTTC GGTCGAAGCC
GCCGTCTTCG GTGTGGTCCC GGCGGAGTAC CACGACATCC CGACGCGGCT CCACGAGCGG
ACGCGAGAGA CGCTGTACTC GATCTACGTC CTGCAGGCGG CGACGGCCGC GGGGACGTCC
GTGCTCGCAT TCGTGGTGTT TTGGGCCTTA GGGTACGGGT CACCGGTCTT GCTTGCCGTC
ATCGCCGGCG TCCTCCAGTT CATCCCCATT ATCGGCCCGA GCGTGCTCGT CGTCGCACTG
GCCGTGGGAG ACCTCCTCGT CGAGGAGACC GGGCGGGCGA TTGCCGTGCT AGTACTCGGT
CTCGTCCTCG TCTCCTTCGT CCCCGACGCG GTGATCCGGA CGCAGCTCGC CGACTGGACC
GGAAGGATCT CTCCGGGACT GTACTTCGTC GGATTCGTCG GCGGTATCCT CACTCTCGGC
GCGGTCGGAC TCATCGTCGG TCCCCTCGTC GTGTCGCTGT TGCTCGAAGT GATCGACATG
CTCTCCGAGC GCGACGTGCC GCCCGACCGG ATCGGGAAAG AAGAGAGGGC GGAGTCGACG
GACTGA
 
Protein sequence
MILNRRRLLA TLLVAVAALA AVVLAEVLRT VVFAVTVAYV LYPICQWLVG RGLSRRIACV 
GTTVIAFVAA AILVVPLLYV LYRRRAELIE ILEQIPDVVP ISVGGFELVI EMVPYVAAAE
VWVRQVALAL AAAAPRLVLE LVVFTFLVYG LLYRPGSVEA AVFGVVPAEY HDIPTRLHER
TRETLYSIYV LQAATAAGTS VLAFVVFWAL GYGSPVLLAV IAGVLQFIPI IGPSVLVVAL
AVGDLLVEET GRAIAVLVLG LVLVSFVPDA VIRTQLADWT GRISPGLYFV GFVGGILTLG
AVGLIVGPLV VSLLLEVIDM LSERDVPPDR IGKEERAEST D