Gene Huta_2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2241 
Symbol 
ID8384535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2288076 
End bp2289629 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content68% 
IMG OID644973310 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_003131141 
Protein GI257053308 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGCA CCCGGACCGT CGTCGTGGGG CTCGACGGCG CGAGCTGGCG GCTGCTCGAC 
CCCTGGATCG ATGCCGGCGA CCTCCCGAAC CTCGCCGCAC TCCGGGATTC GGGCTCGTGG
GCTGAGACCG AGAGTTGCCT CCCACCGGTA ACCTTCCCGA ACTGGAAGTG CTACTCGTCG
GGGAAAGACC CCGGTGGGTT CGGCGTCTTC TGGTTCGAGC ACGTCGACCT CGACGAAGGC
ACGATCACCG TCGCCGACGG CAGCGACTAC CACACCGCTG AACTCTGGGA CTACCTCGCG
GCCGACGGCC GATCGACGGG CGTCGTCAAC ATGCCGACGA TGTACCCGCC ACGAGGAATC
GACGGCGCGT CGATCGTCGC CGGCGGTCCC GACGCTGTCG AGGGCGAGTA CCGCTCGATC
TCCGGCGGCT ACACCCACCC GCCGGAACTC GAATCCGAGA TCGAATCGCG CTTCGACTAC
CAGGTCCATC CGGACCCCCT GCTCTCCAGC AACGACGAAC GTGGGGCCGA GGTCGAGGCG
ATCCTCGATG TCCTCGAGAT GCGCTTCGAG GTCGCGCTGT GGCTGCTCGA GGAGCGCGAC
CGTGATTTCG TCCACGTCAC GCTGTTCTAT CTCAACGTCC TCCATCACTT CTTCTGGGAC
GCGGAGCCGA CCCACCGCGC GTGGCAACTC GTCGACGAAT ATCTCGGTCG CCTGGCGGAC
ATCGACGACC TCAACGTCGT CCTCATGTCC GACCACGGCA GCGCCCCCAC CACGACGGAG
TTCTACGTCA ACGAGTGGCT CGCCGAGCAC GGCTATCAGG CCCGGACGGC GACGGTCGAC
GATACCCTGC GGCGGATCGG CCTCGATCGG GAGACCGCGC TCGGCGTGGC CAAGCGCCTC
GGGATCGTCG ACGTGCTCGC CACGGTCGTC CCCGAACGCC TCCAGGCACT CGTGCCCCAA
CAGGCGGGGC TCAAGCGCGA TCGCAAGCTC GAAGCGATCG AACTCGACCG GACGAAGGCC
GTCGCGAGCG GGCAGGGACC GATCTATCTC AACCCCGCCT TCGACGACGC ATCGGTCCGC
GAGTCGCTGA TGGCCGATCT CCGGGCAGTC GAGGACAGCG AGGGGCCGCT GTTCGACGGC
GTCTATCGTG GCGAAGACGT CTATTCCGGC CCCTACGTCG AGGATGCCCC GGAGATCGTC
CTCGACATGC GCCCCGGCGT CCACGTCAAC GACGGCGTCG GCGGCGGCGA GATCACGGCC
GGGCCGGACC GCTGGGCGGC CGAGAACACC CGCCACGGGA TCTTCCTCGC GAACGGCCCG
GACTTCACCG CCAGCGGGCA ACTCGACCGG ATCAGCATTC TCGACATGGC CCCGACGCTG
CTGGTCGCGG CCGGGTGTGA CGTCCCCCGC GACATGACCG GTGACGTCCT CCCGATCGTC
GCCGGCGATC CCGACTGGGG CCGGCGTGAT CCGATCCGGA TCGACGAGGG GGGACGCGGT
GAGGCCGGCG AGGAAGTCGC CGACCGCCTC CAGCAACTCG GGTACATGGA GTGA
 
Protein sequence
MTGTRTVVVG LDGASWRLLD PWIDAGDLPN LAALRDSGSW AETESCLPPV TFPNWKCYSS 
GKDPGGFGVF WFEHVDLDEG TITVADGSDY HTAELWDYLA ADGRSTGVVN MPTMYPPRGI
DGASIVAGGP DAVEGEYRSI SGGYTHPPEL ESEIESRFDY QVHPDPLLSS NDERGAEVEA
ILDVLEMRFE VALWLLEERD RDFVHVTLFY LNVLHHFFWD AEPTHRAWQL VDEYLGRLAD
IDDLNVVLMS DHGSAPTTTE FYVNEWLAEH GYQARTATVD DTLRRIGLDR ETALGVAKRL
GIVDVLATVV PERLQALVPQ QAGLKRDRKL EAIELDRTKA VASGQGPIYL NPAFDDASVR
ESLMADLRAV EDSEGPLFDG VYRGEDVYSG PYVEDAPEIV LDMRPGVHVN DGVGGGEITA
GPDRWAAENT RHGIFLANGP DFTASGQLDR ISILDMAPTL LVAAGCDVPR DMTGDVLPIV
AGDPDWGRRD PIRIDEGGRG EAGEEVADRL QQLGYME