Gene Huta_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2202 
Symbol 
ID8384496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2254862 
End bp2256115 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content58% 
IMG OID644973271 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003131102 
Protein GI257053269 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTACA ACTACAGGTA TCGGCTCTAT CCGACGGACG ACCAACGGGA GGCGTTAGCG 
TGGACGCTCG ATACCTGTAG ACAGGTCTAC AACCACTTCC TGAACCGGCT CAACGAAGCC
GACGACGTGC CTTCGGAATA CGCCCAAAAG AACACACTCC CTGACCTGAA ACGCGAATGG
TCCGACCTGA AGCAGATCCA TTCGAAGGTG CTTCAGGTCG TCGTAGAACG GCTCTACAAC
AACCTTCGTA CACTCTCAGG GCAAAAGGAG AACGGGTACA ACGTCGGTGC GCTTCGTTGG
AAGGGCGCGG GGTGGTACAA GTCGTTCACC TACAGCCAAA GCGGGTTCAA GCTCATTGGA
ACCGACACCC GACGGGATCG GCTTCGACTG AGCAAGATCG GTGAGATACC AATCGCGTAC
CACCGCGAGA TTCCCGAGAA CGCGACCATC AAGCAGGTCT GCATCAAACG GAACGCTTCG
GGGAAATGGT ACGCGACGTT CGGCATTGAG ATCGACGAAC AACCCGAAAA ACCCGCCCCC
GAAACCATCG ACCCCGAAGA TGCTGTCGGT ATCGACGTGG GTATCCTGAA GTACGCTCAC
GACACCGACG GGACCGCCGT GGAATCGTTG GACCTCTCGG ACGAACGTGA GCGCCTACGA
CGAGAACAGC GGAAGCTCTC GCGAAAAGAG AAGAGGTCGA ACAACTACGA AAAACAACGG
ATGGTCGTCG CCCGCTGGCA CGACCAGATT GCGAACAAAC GCCGCGACTT CCTGCACAAG
CTCGCCCACT ACTACGTCGA GACCTACGAC GTGGTGGCCG TCGAGGACCT GAACGTTCGC
GGCATGATGG AGCAAGACCG AAACAGTCGA AACACAGCAC ATTCCGCGTG GCGAACCTTC
ATCGAGATAC TGCGATACAA GGCTGAGAGC GCCGGTACGC ACCTCGTTGA AGTCAACCCA
CGTGGAACTA CCAAGGAGTG TAGCAACTGT GGCGTTGAAA CCGAGAAACC CCTGTGGGTG
CGCGAGCACT CATGTCCGTC GTGCGGATAC GAAGACGATA GGGACGCCAA CGCCGCGAAG
AACATCCTTC AGCGTGCTTT TTCTGAATTA GGCATGGGAC AGGCCGAATC CGCGCCCCTG
GAGACTGCGA CCGCTACGGA TACCCGTGTG GTATCTGCAA GTCGCGTCAT CGAACGGGGA
AGCCCCGCCC TCAACGAGCG AGGTCGTCAG ACCGAGCGCA GTAGGACGGG GTAG
 
Protein sequence
MNYNYRYRLY PTDDQREALA WTLDTCRQVY NHFLNRLNEA DDVPSEYAQK NTLPDLKREW 
SDLKQIHSKV LQVVVERLYN NLRTLSGQKE NGYNVGALRW KGAGWYKSFT YSQSGFKLIG
TDTRRDRLRL SKIGEIPIAY HREIPENATI KQVCIKRNAS GKWYATFGIE IDEQPEKPAP
ETIDPEDAVG IDVGILKYAH DTDGTAVESL DLSDERERLR REQRKLSRKE KRSNNYEKQR
MVVARWHDQI ANKRRDFLHK LAHYYVETYD VVAVEDLNVR GMMEQDRNSR NTAHSAWRTF
IEILRYKAES AGTHLVEVNP RGTTKECSNC GVETEKPLWV REHSCPSCGY EDDRDANAAK
NILQRAFSEL GMGQAESAPL ETATATDTRV VSASRVIERG SPALNERGRQ TERSRTG