Gene Hlac_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1107 
Symbol 
ID7400916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1113411 
End bp1114757 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content64% 
IMG OID643708173 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_002565772 
Protein GI222479535 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTGT TCGATCGACT GCGCGGCGAC GACGACGAGC GCGTGGTCTT CCTCGGTATC 
GACGGTGTAC CGTACGACCT GGTTCAGAAT CACCCCGATG TCTTCGAGAA CCTGACCGAC
ATCGCCGAGA CGGGGTCGGC GGGTCGACTA GAGAGCATCG TACCCCCCGA GTCGAGCGCG
TGCTGGCCGA GTCTCACGAC GGGGAAAAAT CCCGGCTCGA CCGGCGTGTA CGGCTTCCAG
GACCGCGAGA TCGACTCCTA CGAGACGTAC GTCCCGATGG GGAAACACGT GTCGGCGACG
CGGCTGTGGG ACCGCGTCAC CGACGACGGC CGCGACGCGA CCGTACTCAA CGTTCCCGTC
ACGTTCCCGC CGTCGAGCCG GATCCAGCGG CAGGTCTCCG GGTTCCTCTC ACCCTCAATC
GATGCGGCCT CGAGCGACGA CTCGGTCCGA CAGGTCCTCG AGGACCACGA CTACCGGATC
GATGTGAACG CGAAGCTCGG ACACGACGAC GACAAGACCG AGTTCATCGA GAACGCGCAC
GCGACGCTCG ACGCCCGCCA CGACGTGTTC ACTCACTACC TCGATCAGGA CGACTGGGAC
CTCTTCTTCG GCGTCTTCAT GAGCACCGAC CGGGTCAACC ACTTCCTGTT CGGCGACTAC
GCGAACGACG GCGAGTACAA GGACGACTTC CTCGACTTCT ACCGGACCCT CGACGGGTAT
ATCGGTGAGA TCCGCGACGC GCTCGACGAC GACACGACCC TGATCGTCGC CTCCGACCAC
GGCTTCACCG AGTTGGTGTG GGAGGTGAAC TGCAACCAGT TCCTCGCCGA CGAGGGGTGG
CTCTCGTACG ACGGCGACGA CCACGACTCG CTTGCCGACA TCGACGACGA GGCCCGGGCG
TACTCGCTCA TCCCCGGCCG CTTTTACCTC AATCTGGAGG GTCGCGAGCC GGAGGGTGTC
GTCCCCGAAT CGGAGTACGA GGCGGTCCGC GAGGAGCTCC GCACCGACCT CGAATCGCTC
ACCGGCCCCG ACGGCCGGCA GGTGTGCAAG CGGATCGTGG ACGGCGAGAC CGTCTTCGAC
GGCGACCACG ACGAGATCGC GCCCGACCTC GTGGTCATCC CCGCGGACGG CTTCGACCTG
AAGTCCGGCT TCGGCGGGAA GAAGTCCGTC TTCACCGAAG GACCACGTAA CGGGATGCAC
AAGTTCGAGA ATTCGCTGCT GTACTCGACC GACTCCGACC TCGACATCGA GGGCTCGAAT
CTCTTCGACG TGACCCCGAC GATCCTCGAT CTGATGGATG TCGAACACGA CGGCGACTTC
GACGGCGATA GTCTCCTAGG CGCGTAA
 
Protein sequence
MGLFDRLRGD DDERVVFLGI DGVPYDLVQN HPDVFENLTD IAETGSAGRL ESIVPPESSA 
CWPSLTTGKN PGSTGVYGFQ DREIDSYETY VPMGKHVSAT RLWDRVTDDG RDATVLNVPV
TFPPSSRIQR QVSGFLSPSI DAASSDDSVR QVLEDHDYRI DVNAKLGHDD DKTEFIENAH
ATLDARHDVF THYLDQDDWD LFFGVFMSTD RVNHFLFGDY ANDGEYKDDF LDFYRTLDGY
IGEIRDALDD DTTLIVASDH GFTELVWEVN CNQFLADEGW LSYDGDDHDS LADIDDEARA
YSLIPGRFYL NLEGREPEGV VPESEYEAVR EELRTDLESL TGPDGRQVCK RIVDGETVFD
GDHDEIAPDL VVIPADGFDL KSGFGGKKSV FTEGPRNGMH KFENSLLYST DSDLDIEGSN
LFDVTPTILD LMDVEHDGDF DGDSLLGA