Gene Hlac_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2036 
Symbol 
ID7402055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2028381 
End bp2030072 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content74% 
IMG OID643709107 
Productprotein of unknown function DUF181 
Protein accessionYP_002566684 
Protein GI222480447 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0289934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.387863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCG GACTTGTCGG TGACGGCCCC GCGGTCGAGG CCGTCTCGGC CGCGCTCGGC 
GACGTCGACG TGAACGTGAT GCCCGTCGAA GTGGGGCTGC TCGACGGGTT CGACCTCGCG
GTCGTGGTCG ACACCGCCGG ATCGGAGACG TTCGCCGCCG CCAACGAGCA CCTGGACCGC
TGGGTCGCAA TCGAGGTCGG CGGACTCGGC GGCGTCCCCC TGGAGCCTGT CGACGCCGCG
GTGACCGCCT TCGACGAGAC CTGCTACGAC TGCCTCCGGC GACGCGTCGC CAGCGGCGGA
GCCGAGCCCG CCGACGCGCC GGCCGGCCGC CGGTCGGCCG TGCGCTACGC CGGCGCGCTG
GCCGGGCGCC GCGTCATCCA ACTGCTCGCC GGCGACCCGG TCGCCGACAC GGTCGTCGAG
GTACCGGGTG CCGAGCGCAC CCTCCTCCCG GTTCCCGGCT GTGACTGCGG CGCCGACCCG
GGCGACGCGC TGCCCCGCGA GCATCAGGAG CGATCGCTAT CGGACGCGAT CGACCGCGCG
GAGCGGGCGG TCGATCCCCG AATTGGTCTG CTCTCGGAGG TGGGCGAACA GGAGTCGTTC
CCCGTGCCGT ACTACGTCGC GCGGGTCGCC GACACGACGC CCTTCTCCGA CGCGCGGGCG
GCCGAGTTCG GCGGCGGCGT CGACGCCGGG TGGGACGCCG CGTTCATGAA GGCGCTCGGC
GAGGGGTTAG AGCGGTACGC CGCCGGAGTC TACCGCGGGG CGTCGTTCAC GCGCGCGCCG
GCCGCGAACG TCCCGAATCC CGTGGCGCCC GACGCGTTCG TCCGTCCCGA GAGCGCGGAG
CCGTACGCCC GCGACGACCG ACTTCCGTGG GTGACCGGCG AGCATCTCGG AACCGGGGAG
GTCGCGAGTC TGCCGGCGGA GTTCGTCCAC TTCCCGCCGC CGGAGAACCG GTATCGGCCC
GCGATCACGA CCGGGCTCGG CCTCGGAAAC TCCGGACCGC ACGCGGCGCT GTCGGGACTC
TACGAGGCGA TCGAGCGCGA CGCGACGATG ACAAGTTGGT ACTCGACGGC CGACCCGCTC
GGAATCGAGG TCGACGACGA GGGGTTCGCG GAACTGGAGA AACGCGCCCG CACCGAGTCG
CTGTCGGTGA CCCCGCTCCT CGTCACGACC GACATCGACG TGCCCGTCGT CGCGGTCGGC
GTGCATCGCG AGGGCGACTG GCCCCGCTTC GCGGCGGGGT CGGGCGCCGA CCTCGATCCC
GCCGCCGCGG CCCGGAGCGC GCTCGCGGAG GCGCTCCAGA ACTGGACCGA ACTCCGGTCG
ATGGGTCCCG ACGCGGCCGC AAAGCAGGGT GCGGCGATCG GTCGCCACGC CGACTTCCCG
GCGGAGACGC GGGCGTTCTT CGACGCGGAC GCGACGGTCC CCGCCGAATC GCTCGGCGAG
CCGGAGCTGT CGGGAGGCGA CGAGCTAGCC GCCGTCGTCG ACCGCGCCGA GGCGGTCGGA
CTGGAGCCGT ACGTCGCGCG GACGACGACC CGAGACCTCG CCGCGCTCGG CTTCGAGGCG
GTCCGCGTGC TCGTTCCCGG CGCGCAGCCG CTGTTCACCG GCGACCCGTT CTTCGGCGAC
CGCGCCCGCG ACGTGCCCCG ATCGATGGGG TTCGAGCCCG ACTTGGAGAA GGCGTACCAC
CCGTTCCCCT GA
 
Protein sequence
MDIGLVGDGP AVEAVSAALG DVDVNVMPVE VGLLDGFDLA VVVDTAGSET FAAANEHLDR 
WVAIEVGGLG GVPLEPVDAA VTAFDETCYD CLRRRVASGG AEPADAPAGR RSAVRYAGAL
AGRRVIQLLA GDPVADTVVE VPGAERTLLP VPGCDCGADP GDALPREHQE RSLSDAIDRA
ERAVDPRIGL LSEVGEQESF PVPYYVARVA DTTPFSDARA AEFGGGVDAG WDAAFMKALG
EGLERYAAGV YRGASFTRAP AANVPNPVAP DAFVRPESAE PYARDDRLPW VTGEHLGTGE
VASLPAEFVH FPPPENRYRP AITTGLGLGN SGPHAALSGL YEAIERDATM TSWYSTADPL
GIEVDDEGFA ELEKRARTES LSVTPLLVTT DIDVPVVAVG VHREGDWPRF AAGSGADLDP
AAAARSALAE ALQNWTELRS MGPDAAAKQG AAIGRHADFP AETRAFFDAD ATVPAESLGE
PELSGGDELA AVVDRAEAVG LEPYVARTTT RDLAALGFEA VRVLVPGAQP LFTGDPFFGD
RARDVPRSMG FEPDLEKAYH PFP