Gene Huta_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0102 
Symbol 
ID8382364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp99534 
End bp100862 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content51% 
IMG OID644971161 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003129024 
Protein GI257051191 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGG AGGCGACGCT GGACGAGTTC GTAGATGAGC AGGAAGCAGG AGGAAATCAT 
TCTGGAGACG TTAGTGTTGG GGATTTACAG CAATTCGAAT CTTCCCCGAT TGAGTCATGG
AATCTTGTTA GGCTGGGTGA GATTCTAACC TTAGAGTACG GTGATAATCT TCCATCAGAT
AGTCGAGAAA GTGGAACCGT ACCTGTTTTT GGCTCTAATG GCCAGGTAGA CACGCATTCT
GAGGCCGCTG TAGAGAAACC AGGCATCATA TTGGGGCGAA AGGGTTCAAT TGGTGAGATT
GATTTCAGCG ATAGACCGTT TTGGCCTATC GATACGACAT ACTATATCAC AAGCGAGGAG
ACGAGCCAAA ACCTGCGTTT CCTGTATTAC CTCCTTCAGA ACATCCAACT GGAACGGTTA
AACGCTGCAT CTGCCATACC TGGATTAAAC CGAAATGATG CGTACGGCCT GAAAGCACTC
ATGCCTCCGG CCGAAGAACA GCGCAAAATC GCCAGCGTGC TCTATACCGT CGATCAGGCG
ATTCAGAAGA GCGAAGAGAT AATCGAGCAA ACTGAACGGG TCCGTCGTGG TACTGAACAA
GATGTCCTTT CGAGGGGCGT TCGTGAAGAT GGGACGCTCA GGCCCGACGA CGATGTCGCA
TATCGAAGCA GTTGGGTCGG CGACATTCCC TGTGACTGGG ATGTCAAACA GTACAGCAAA
CTGATTTCAG ATTCCTCCGT CGGTATCGTC GTTAAGCCTT CCCAGTATTA CGACGACGAC
GGAACAGTCC CGATTCTTCG CTCGAAAGAT ATCTCCAGAG ATGGCATCGT TGATGGGGAT
TTCGAGTATA TGTCGGAAGA GTCGAACGCC GAAAATGAAA ACAGCCGATT GCAGGAAGGT
GACGTAATAA CGGTGAGGTC GGGGGACCCC GGCCTTTCTT GCGTCGTCGA CGGTGAATTT
GATGGGGCAA ACTGTGCAGA TTTACTCATT TCCACGCCGG GACCGAAATT GGACCCCCAC
TACGCCGCTA TGTGGATTAA TTCCTTTGCA GGGAGAAAGC AGATCGACCG GTTTCAGGCT
GGTCTGGCAC AGAAGCACTT CAACCTCGGG GCCCTCCGTA AGCTTCGAGT CGGAGTGCCA
TCGCTCGATG AACAGAAGCG GATCGTCGAA AAGGTGTCAT CAATATCAGA ATCTCTCGAA
AGTCAGAGAG AGTCCAAAAG GCAACTCCAG CGCCTCAAAC AGGGCCTCAT GCAAGACCTC
CTCTCGGGCA AGGTCCGCAC CCACGACACA GACATCGAGA TCGTAGACGA CGCACTCCAG
CATGGCTAA
 
Protein sequence
MSEEATLDEF VDEQEAGGNH SGDVSVGDLQ QFESSPIESW NLVRLGEILT LEYGDNLPSD 
SRESGTVPVF GSNGQVDTHS EAAVEKPGII LGRKGSIGEI DFSDRPFWPI DTTYYITSEE
TSQNLRFLYY LLQNIQLERL NAASAIPGLN RNDAYGLKAL MPPAEEQRKI ASVLYTVDQA
IQKSEEIIEQ TERVRRGTEQ DVLSRGVRED GTLRPDDDVA YRSSWVGDIP CDWDVKQYSK
LISDSSVGIV VKPSQYYDDD GTVPILRSKD ISRDGIVDGD FEYMSEESNA ENENSRLQEG
DVITVRSGDP GLSCVVDGEF DGANCADLLI STPGPKLDPH YAAMWINSFA GRKQIDRFQA
GLAQKHFNLG ALRKLRVGVP SLDEQKRIVE KVSSISESLE SQRESKRQLQ RLKQGLMQDL
LSGKVRTHDT DIEIVDDALQ HG