Gene Huta_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0072 
Symbol 
ID8382332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp74566 
End bp75906 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content58% 
IMG OID644971130 
Producttransposase 
Protein accessionYP_003128994 
Protein GI257051161 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.198369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCGTA CCAACACGTT CGCCGTGCGA CCGCTTTCCG ACAATGGAGA GCGACTGCTA 
CGGGACTTGT TGGACGCTTC CGCCGCTCTC TGGAACGAGG TTAACTACGG ACGCCTCATG
CGGTACAACG ACGAAGGCGG CTACGAAAAC GAAGACGTGT GGGACGCCGA TACAGGCCGA
CTCGAAGGCA AATACAAAGG CGTCCTCGGT GCGTCCACCG CCCAACAGGT GATACGGAAG
AACTCCGAAG CATGGCGCGG ATTCTTCGAG AACAAGAAAG CGTATCACGA CGAGTCGGAT
ACGTCCGTCA CTGAACACCC GGAACCACCG GGATTCTGGG GCAACAAAGA CGAGGGCCGC
AAACTCCATA CCGTCATCCG CAACACGTCG TACACCGTCG AATGGGGCGA TCGCTCCCGA
CTCGAAATAC TGGTCGGGAG TGAATTGAAA GACCGATACG ACCACACTGG GCGTCTCCGT
CTGGAAATCA CTGGCGACCC GAACTGGCCC GAGTACGAGA AACAGGGTCG GTTAGACCTG
TGGTACGACG AGACTGATAG CACGTTCAGG GCTTCGCAAC CCGTGACTGT TTCTGACGAG
ATACGGGATA CTCCACTGGC CGATGAAAAG GCCGCTCTGG ACATTGGTGC AAACAATCTC
GTCGCCTGTA CCACCACAAC CGGTAAGCAA TATCTGTACG AGGGCCGCGA GTTGTTCCAG
CGATTCCGCG ACACGACGCG AGAAATCGCC CGGTTACAGT CGAAACTCGA TGAAGGGCGA
TACAGTAGCA AGCGTATCCG GCGGCTGTAC CAGAAACGGA CTCGTCGCCG GGACCACGCT
CAGGAAGCGT TGTGTCGTGA CCTGTTGGAA CGACTGTACG CCGAAGGCGT GGACACACTG
TATATCGGCG GGTTGACCGA CGTACTGGAC ACGCATTGGT CGGTCGAGAC AAACGCCAAG
ACCCATAACT TCTGGGCATT CAAGCAATTC ACCGAGCGAC TGGTCTGTAC TGCCGAAGAA
TACGGTATCA CGGTAGAAGT CCGGTCAGAG GCGTGGACCA GTCAGGAATG CCCGCAGTGT
GGTTCGACAG ACCGAACGAC ACGGCATCAG GACACACTCA CCTGTCCGTG TGGGTTCGAG
GGGCACGCCG ACCTTACAGC GTCGGAAACA TTCCTAGAGC GGCACACAGA GGAAGCGGTC
AGGCCGATGG CACGGCCCGT GCGGTTTGAG TGGGACGACC ACGACTGGTC GGAGTCACCA
CGCTCTCACC GTCCCAAAGA ACAGCGCACA GACCCGAGTA CCGTCCACCG TGACGGGAAT
GTTGCCTCCG GCGAGTCGTA G
 
Protein sequence
MKRTNTFAVR PLSDNGERLL RDLLDASAAL WNEVNYGRLM RYNDEGGYEN EDVWDADTGR 
LEGKYKGVLG ASTAQQVIRK NSEAWRGFFE NKKAYHDESD TSVTEHPEPP GFWGNKDEGR
KLHTVIRNTS YTVEWGDRSR LEILVGSELK DRYDHTGRLR LEITGDPNWP EYEKQGRLDL
WYDETDSTFR ASQPVTVSDE IRDTPLADEK AALDIGANNL VACTTTTGKQ YLYEGRELFQ
RFRDTTREIA RLQSKLDEGR YSSKRIRRLY QKRTRRRDHA QEALCRDLLE RLYAEGVDTL
YIGGLTDVLD THWSVETNAK THNFWAFKQF TERLVCTAEE YGITVEVRSE AWTSQECPQC
GSTDRTTRHQ DTLTCPCGFE GHADLTASET FLERHTEEAV RPMARPVRFE WDDHDWSESP
RSHRPKEQRT DPSTVHRDGN VASGES