Gene Hlac_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3035 
Symbol 
ID7399010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp293065 
End bp294849 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content65% 
IMG OID643706843 
Product5'-Nucleotidase domain protein 
Protein accessionYP_002564465 
Protein GI222475944 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAGC ATGAAACGGA TATCGGCGAG TGGCGGTCCC TCGGCGGCGC CCCTGTCGGA 
GACGGTAACG ATGACGCCGA CGCCGTCTTC GCTCACGTCA GCGACCTCCA CGGGCAGCTG
ACGCCGCGCT ACCAGGTCTA CTACGACAAT CCGACGTCGA CGCCGGACTT TAATTTCGGA
GACGACGATC GCGTCGTCGA GCGCGGCGGC GGGATCCCCC TGCTCGCAGC GAAACTCGAC
GAACTCCGCG AGGACTACGA CGTGTGTACG CTCATGAGCG GCGACACGTT CCACGGCTCC
GCCGTGACCA CCTACACCGA TGGGCGAGCG ATGCTCGATC CCGTCAACGA CCACGTCGCG
CCCGACATCT ATGTCCCGGG GAACTGGGAC TACTCGAACG AGGCCGCCGA GGACGGCAAC
TTCGTGGAGT TGATGGACGA CCTCGACGCC CCGATTCTCG CGAACAACCT CTACGACTGG
GAGACCGACG AGCGACTGTA CGACGCGTAC CGGATCCTCG ACATCGGCGG ACTCTCCGTG
GGGGTCGTCG GGATGACGAA CGTCTACGTC GATCGGATGG CACCCGCGTT CTCCGAGGGG
AAGTACCGCT TCGGTAAACA CCCCACACTC CTCGAGGAGT CCGCACAGGC CGCCCGCGAG
GACGGCGCGG ACGTCGTGGT CGCGGTCACC GAGATCGGCC TCCCGTGGAT GGTCCAAGCC
GCCAAGGACT GTGCGAGCGT GGACGTGATG TTCAGCGCGC ACACCCACGA GTACACCTAC
GATCCGATCG TCGTCGAGGA GACCGAAACC GTGGTCGTCG AGTCCGGGAT GGGTGAGGCG
ATCGGCCGTG TGGACCTCCG CGTTCGGGAC GGGGAGATAC AGTTCCGTCA CCACCTCTAC
TGTCTGACCG AGGACGGCGA GCACACGCCG GAACCGGACG CCGATGCGGC GGAGACAGTC
GAAGCCGTGC GTGCGCCCTT CTTCGAGGCC GATCCGGGAT TCGAGCGAGG GGCTGGCACG
CTCGACCGTC CGCTGGATGC GGTCGTCGGT CGGACGGAAG AACCGCTCTA CCGGCAGTCC
TTCCTTGAGA GCGCGTGGAA CGCGCTGTTC AACGACGCAC TCCGTGCACA CTTCGGCACC
GACCTCGCCG TCTCGCACGG GTTCCGGTAC GGGACTGCCA TCCCACCCGG CGACATCACG
CTCGGCGAAC TCTACACGTT CTTCCCGATG ACGACGCCCG TCGCTCGTGG CGTCGCCTAC
GGCCAGCAAC TCACGAACCA CATGGAGGAG TTCCTCGGGG ACAACTTCAC ACCGTACCCC
TACGACCAGG AGGACGGCCG CGTCCGCAAC TTCTCCTCGA ACGTCGAGGT GACCCTCGAT
CCGACCGCGA AGCGTGGCCG CCGCCTCGTC GAACTGCGAA TCGACGGCGA GCCGGTCGAC
CCGGAGGAGA CGTACTCGGT GGCGACGTTC CGCCGACCCG GTGATCCCGA ACGCGACCTC
GGAAACTGCG GGTTCCCGTT CCGGGACGTC GAGGTCGACG ACGATACGAT ACCTGTCGAC
GTCATCGTCG AGTATCTCGA GGAACACTCG CCCGTCGACT ACGAGGTGAT GGGGCTAGTC
GAGACCGCCG AGGATGGCGG CCGAGTCCAG AACACGCCCG CAGACGGGCC GTATCCGTTT
ATCCAGCCCG GCGTCGACTA CGCGGCCGGC GAGGCGTACT GCGAGACGTC CATGATACCG
CGCAGGAACA CGTTTCCCGA TGCAGGGCGT AATCGAACGC GCTAG
 
Protein sequence
MNEHETDIGE WRSLGGAPVG DGNDDADAVF AHVSDLHGQL TPRYQVYYDN PTSTPDFNFG 
DDDRVVERGG GIPLLAAKLD ELREDYDVCT LMSGDTFHGS AVTTYTDGRA MLDPVNDHVA
PDIYVPGNWD YSNEAAEDGN FVELMDDLDA PILANNLYDW ETDERLYDAY RILDIGGLSV
GVVGMTNVYV DRMAPAFSEG KYRFGKHPTL LEESAQAARE DGADVVVAVT EIGLPWMVQA
AKDCASVDVM FSAHTHEYTY DPIVVEETET VVVESGMGEA IGRVDLRVRD GEIQFRHHLY
CLTEDGEHTP EPDADAAETV EAVRAPFFEA DPGFERGAGT LDRPLDAVVG RTEEPLYRQS
FLESAWNALF NDALRAHFGT DLAVSHGFRY GTAIPPGDIT LGELYTFFPM TTPVARGVAY
GQQLTNHMEE FLGDNFTPYP YDQEDGRVRN FSSNVEVTLD PTAKRGRRLV ELRIDGEPVD
PEETYSVATF RRPGDPERDL GNCGFPFRDV EVDDDTIPVD VIVEYLEEHS PVDYEVMGLV
ETAEDGGRVQ NTPADGPYPF IQPGVDYAAG EAYCETSMIP RRNTFPDAGR NRTR