Gene Huta_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0103 
Symbol 
ID8382365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp100855 
End bp102336 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content62% 
IMG OID644971162 
ProductN-6 DNA methylase 
Protein accessionYP_003129025 
Protein GI257051192 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTGA CTCTCGACGA ACTGGACTCT CATCTCTTCA AGTGCGCAGA CATCATCCGT 
GACGCTGTTG ACTCGACAGA GTATAAGGAT TTCATCCTTC CGCTGGTGTA CTACAAGACG
ATCTCGGACA ACTTCGAGGT ACAGCGCGAG AAATACGTCG AAGAGTACGG GGATGAACAC
GCCAACCGGC CTAACATCTA CGACGTACCG TATGTCCCAG ACGGCTACCT GTGGGACGAC
CTACGAGCGG TCAACGAGAA CGTCGACGAA GCGATCAACG ACGCTTTCGA CGCCCTTCGT
GAGGCCAACG ACGGCGAGGT CGAGGGCGTG TTCCGGGCTG ACTACGTCGC CGAGGACGCC
CTTACCGACG ATCGACTCAC TCGTCTGATC GAGCACCTAA GCACCATCGA TCTCGATAAC
GACAGCGTCC CGCCGGATAT GCTCGGTGAG GCGTACATGG ACCTCGTGCG CCACTTCGCC
GAGGAGGAGG GCAAGTCCGG CGGTCAGTTC TTCACCCCGC CCCACATCGT CGAGTTGATG
GTCAGGCTGC TCGCCCCCTT CGAGGATGGC GATACGTTCC ACGACCCGAC TGTGGGATCA
GGCGGGATGC TCGTCGAGGC GGCCACCCAC TACCGTGACG AGCAGGGCGG CGATCCCTCG
AAGCTCACGT TCACGGGCCA GGAGATCAAT CCCGATATCG CCGCCATCGC AAAGATGAAC
CTCTCGATCC ACGGCCTCAG CGGGCGGATC GAGCGCGAGG ACTCTCTCCT GCGACCGCAG
TTCACCGAGA ACGGCGAACT GACCAAGTTC GACTACGTGC TCGCGAACTT CCCGTTCTCG
GCGGACTGGC AGAAGGACGA ACTCCAGGAC GACACCTACG GGCGCTTTGA CTGGCACGAG
AAGCTCCCCC GCGCCGACCG GGGCGACTAC GCGTTCATCA TGCACATGGC GGAACAGTTG
AACGAGACCG GCCAGGCGGC CATCGTCATC CCCCACGGCG TGCTGTTCCG CAAGCACGAG
TCCCGCTACC GGGAGCCGAT GCTGGAAAAC GATCTGGTCG AGGCCATCGT CGGCCTGCCC
GAGAACCTGT TCCAGAACAA CTCGATTCCC TCGGCCATCC TCTTGTTGAA CACCGACAAG
CCCGCCGAGC GCGAGGGCGA GGTGCAGTTC ATCCACGCCG CCGACGAGGC CTTCTATCGG
GAACTCTCGA ACCAGAACGA ACTCACCGAC GAGGGCGTGG CCCACGTCGT CGAGAACTTC
CGGGACTGGA CGACCGAGGA GCGCGTCAGT CGGACGGTGT CGATCGAGGA GATCCGGGAG
AACGACTACA ACCTCAACAT CGCGCTGTAC GTCGATACGA CCGAACCCGA GGAAGAGATC
GACGTGGCTG AGGAACTGGC GACGCTCCGG GAGTTGCAGG CCGAGCGCGA CGAGATCGAG
GCGCGGATGG ACCAGCACAT GGAGGCGCTG AACTATGAGT GA
 
Protein sequence
MSLTLDELDS HLFKCADIIR DAVDSTEYKD FILPLVYYKT ISDNFEVQRE KYVEEYGDEH 
ANRPNIYDVP YVPDGYLWDD LRAVNENVDE AINDAFDALR EANDGEVEGV FRADYVAEDA
LTDDRLTRLI EHLSTIDLDN DSVPPDMLGE AYMDLVRHFA EEEGKSGGQF FTPPHIVELM
VRLLAPFEDG DTFHDPTVGS GGMLVEAATH YRDEQGGDPS KLTFTGQEIN PDIAAIAKMN
LSIHGLSGRI EREDSLLRPQ FTENGELTKF DYVLANFPFS ADWQKDELQD DTYGRFDWHE
KLPRADRGDY AFIMHMAEQL NETGQAAIVI PHGVLFRKHE SRYREPMLEN DLVEAIVGLP
ENLFQNNSIP SAILLLNTDK PAEREGEVQF IHAADEAFYR ELSNQNELTD EGVAHVVENF
RDWTTEERVS RTVSIEEIRE NDYNLNIALY VDTTEPEEEI DVAEELATLR ELQAERDEIE
ARMDQHMEAL NYE