Gene Huta_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1723 
Symbol 
ID8384009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1732649 
End bp1733872 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content49% 
IMG OID644972790 
ProductHNH endonuclease 
Protein accessionYP_003130629 
Protein GI257052796 
COG category[V] Defense mechanisms 
COG ID[COG3440] Predicted restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGAAG AATCTGTGGT CTCGCCGGAG CGTTGTGCGA CAATCCGACG TGAATACTCT 
CAGCAAACCG GGTTTGATCC ATTGGTGGAG GAGCTAGATA TCGAGAAACA GGAACTCCGC
CACCACCTAT ACGGAGACTG TAGTCACGAT ATCGCAATTG AACCAATCGA CCCGCCGGTC
AGTCATCAAT TAGATGCTGA TCAGTGCCAG GAGATCCGAA ACCTATTCGC AGATGGTTTC
GATACGGAGA CCTTAGAGCA ACGGTTCGAG ACACGATGGC GGCCAATTGC TCGTCATCTC
ACCGGTGAGT GCTCACATAA TAACGATGCG CCCACGGTGG CTCGGAGCGA AATCAGTGAT
CGGGAGCCGA TTTCGGAAAC TGACTGTGCC GTGCTCCGCG AGCGGTTCTT CGACGACGAA
GAACGAAGCA TTATGGATGT TGCACGGGAT GTTCGGTGGA GCTATGAGGC CGTTGTTCAA
CACGTCAACG GAAACTGTTC TCACGATATT ACTACGAGCT CTCGATCAAC TGAGGAACGG
GGAGGTAATC TAACCAAAGA GGATTGCCAG AACGTTCGAG AACTATGGGC TCAGGATCCC
GAAATGACAC TCGAAAAAGT TGCATCGGAG ATCGAAAGAT CAGAAGCGAC CGTTGAAAAG
CATATCAAAC GGGCTTGTTC TCATTCTTCG GATGAATTGT TGATCGACGA AATGCAAATA
TTTGACTCAA TATTGACAGA CGAGGATGAG CAGGTTAGCG ATTCGCAGGC TATACTAGAT
GCCGCTAATT CTTCGAATAT TGACTCTGAA GAGTTCGTAG ACGACGTGAT TACCCCGGAT
TCAGTCGAAA CGACTATTAG TCGGACAGTC CGCAACACGA CACTCGTCAA AGAATTAAAA
GGAGCATACG ATTACGAGTG TCAGGTCTGC GATAGTCCCC GGTATCAGGG TCCAGATAAA
CGCTACGCAG AGGGACATCA TATCAAGCCG CTGGGTGAGC CGCATAACGG ACCAGACACG
CCAAGTAATA TCTTGGTTCT ATGTCCGAAT CATCATGCAG ACTTTGATTA CGGTTTGATA
GAGATTGATC CCGGGACTTA TGAGATACAT CATGAATATG ATGATACTGT TCACGGTAGT
ACTCTGACTG TCGATGGGGA ACACGATTTG GACCCTGAGA AATTAAGCTA CCATAGTCAA
CGGATCTCCG AAGTCACCCG CTGA
 
Protein sequence
MMEESVVSPE RCATIRREYS QQTGFDPLVE ELDIEKQELR HHLYGDCSHD IAIEPIDPPV 
SHQLDADQCQ EIRNLFADGF DTETLEQRFE TRWRPIARHL TGECSHNNDA PTVARSEISD
REPISETDCA VLRERFFDDE ERSIMDVARD VRWSYEAVVQ HVNGNCSHDI TTSSRSTEER
GGNLTKEDCQ NVRELWAQDP EMTLEKVASE IERSEATVEK HIKRACSHSS DELLIDEMQI
FDSILTDEDE QVSDSQAILD AANSSNIDSE EFVDDVITPD SVETTISRTV RNTTLVKELK
GAYDYECQVC DSPRYQGPDK RYAEGHHIKP LGEPHNGPDT PSNILVLCPN HHADFDYGLI
EIDPGTYEIH HEYDDTVHGS TLTVDGEHDL DPEKLSYHSQ RISEVTR