Gene Hlac_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0906 
Symbol 
ID7401277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp897291 
End bp898640 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content68% 
IMG OID643707971 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_002565574 
Protein GI222479337 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0766913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTCT TCGACCGGCT GCGAGGGAAC GACACGCCTC GCGTGGCGTT CATCGGGATC 
GACGGGCTCC CCCACCGCCT CGTCGCCGAC AACCCGGACA CGTTTCCGAC GCTGTCGACG
ATCGCCGACA AGGGCGACGG GGGACCGATC GACAGCGTTG TGCCGCCCGA GTCGAGCGCG
TGCTGGCCGG CGCTCACAAG TGGTGTGAAC CCCGGCGAGA CCGGTGTGTA CGGCTTTCAG
GACCGAGAGG TCGGCTCGTA CGACACCTAC GTCCCGATGG GCCGCGACGT GCAGGCGACG
CGAGTGTGGG ACCGAGTGAC CGAGGCGGGC CTCAACGCGA CGGTGATGAA CGTCCCCGTC
ACGTTCCCGC CGCAGCGAAC GGTCCAGCGC ATGGTCTCCG GCTACCTCTC GCCCGACGTG
GACAAGGCCG CTCACCCGGA GGAGCTCCGG AAGTACCTCA CCGAGAGCGA CTACAGGCTG
TCGGTCAACG CGAAGCTCGG CCACCGGAAG GACAAAGCAG AGTTCATCGA GCAGGCCCGC
CAGACGCTCG ACGCCCGCGC GGAGGCGTTC TCCCGCTACG TCGAGATGGA CGACTGGGAC
CTGTTCGTCG GCGTGTTCTC CACCCCGGAC CGGATCAACC ACTTCCTGTG GGGCGACTAC
GAGGACGGCG GCCCGTATCG CGAGGACATG CTCGCGTTCT ACGCCGCCCT CGACGAACAC
ATCGGGAACA TCCGGAAAGC GCTCCCGAAC GACGTTCGGC TCGTCGTCGG CTCCACGCAC
GGGTTTGCGC GACTCCGATA CGACGTGTAC TGCAACGAGT GGCTCGAACG CGAGGGGTGG
CTCTCGTACG AAGGCGGCGA CGACCACGGA TCGCTGTCCG ACATCGCCGG CGACGCCCGC
GCGTACTCGC TGGTCCCCGG ACGCTTCTAC CTCAACGTGG AGGGCCGCGA GCCCGACGGC
GTCGTCCCCG AGTCGGAGTA CGAGGCGGTC CGCGAGGAGC TCCGGGCCGA GCTCGAAGCG
TGGGAAGGGC CGAACGGGAA CCCCGTCGCG AAACGGGTCG TCGAACGCGA GACGGTGTTC
CGCGGCGACC ACGACGCTAT CGCCCCCGAC CTCGTGGTGA TCCCGCACGA AGGGTTCGAC
CTCAAGTCGG GATTCCGCCC CCACGATGCG GTGTTCGATC CCGACGGCCC CCGAACCGGC
ATGCACACGT TCGAGGACGC CGCCCTGTTC ATCGATCACC CCGATGCAAA GGTCGAAGAC
GCGGACTTAC TCGACGTTGC TCCGACCCTC CTGCGCCTGC TCGACGTCGA CTACGGCCGG
ACCGACCTCG ACGGCGCGAG TCTCATCTGA
 
Protein sequence
MGLFDRLRGN DTPRVAFIGI DGLPHRLVAD NPDTFPTLST IADKGDGGPI DSVVPPESSA 
CWPALTSGVN PGETGVYGFQ DREVGSYDTY VPMGRDVQAT RVWDRVTEAG LNATVMNVPV
TFPPQRTVQR MVSGYLSPDV DKAAHPEELR KYLTESDYRL SVNAKLGHRK DKAEFIEQAR
QTLDARAEAF SRYVEMDDWD LFVGVFSTPD RINHFLWGDY EDGGPYREDM LAFYAALDEH
IGNIRKALPN DVRLVVGSTH GFARLRYDVY CNEWLEREGW LSYEGGDDHG SLSDIAGDAR
AYSLVPGRFY LNVEGREPDG VVPESEYEAV REELRAELEA WEGPNGNPVA KRVVERETVF
RGDHDAIAPD LVVIPHEGFD LKSGFRPHDA VFDPDGPRTG MHTFEDAALF IDHPDAKVED
ADLLDVAPTL LRLLDVDYGR TDLDGASLI