Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0072 |
Symbol | |
ID | 8382332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 74566 |
End bp | 75906 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644971130 |
Product | transposase |
Protein accession | YP_003128994 |
Protein GI | 257051161 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.198369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCGTA CCAACACGTT CGCCGTGCGA CCGCTTTCCG ACAATGGAGA GCGACTGCTA CGGGACTTGT TGGACGCTTC CGCCGCTCTC TGGAACGAGG TTAACTACGG ACGCCTCATG CGGTACAACG ACGAAGGCGG CTACGAAAAC GAAGACGTGT GGGACGCCGA TACAGGCCGA CTCGAAGGCA AATACAAAGG CGTCCTCGGT GCGTCCACCG CCCAACAGGT GATACGGAAG AACTCCGAAG CATGGCGCGG ATTCTTCGAG AACAAGAAAG CGTATCACGA CGAGTCGGAT ACGTCCGTCA CTGAACACCC GGAACCACCG GGATTCTGGG GCAACAAAGA CGAGGGCCGC AAACTCCATA CCGTCATCCG CAACACGTCG TACACCGTCG AATGGGGCGA TCGCTCCCGA CTCGAAATAC TGGTCGGGAG TGAATTGAAA GACCGATACG ACCACACTGG GCGTCTCCGT CTGGAAATCA CTGGCGACCC GAACTGGCCC GAGTACGAGA AACAGGGTCG GTTAGACCTG TGGTACGACG AGACTGATAG CACGTTCAGG GCTTCGCAAC CCGTGACTGT TTCTGACGAG ATACGGGATA CTCCACTGGC CGATGAAAAG GCCGCTCTGG ACATTGGTGC AAACAATCTC GTCGCCTGTA CCACCACAAC CGGTAAGCAA TATCTGTACG AGGGCCGCGA GTTGTTCCAG CGATTCCGCG ACACGACGCG AGAAATCGCC CGGTTACAGT CGAAACTCGA TGAAGGGCGA TACAGTAGCA AGCGTATCCG GCGGCTGTAC CAGAAACGGA CTCGTCGCCG GGACCACGCT CAGGAAGCGT TGTGTCGTGA CCTGTTGGAA CGACTGTACG CCGAAGGCGT GGACACACTG TATATCGGCG GGTTGACCGA CGTACTGGAC ACGCATTGGT CGGTCGAGAC AAACGCCAAG ACCCATAACT TCTGGGCATT CAAGCAATTC ACCGAGCGAC TGGTCTGTAC TGCCGAAGAA TACGGTATCA CGGTAGAAGT CCGGTCAGAG GCGTGGACCA GTCAGGAATG CCCGCAGTGT GGTTCGACAG ACCGAACGAC ACGGCATCAG GACACACTCA CCTGTCCGTG TGGGTTCGAG GGGCACGCCG ACCTTACAGC GTCGGAAACA TTCCTAGAGC GGCACACAGA GGAAGCGGTC AGGCCGATGG CACGGCCCGT GCGGTTTGAG TGGGACGACC ACGACTGGTC GGAGTCACCA CGCTCTCACC GTCCCAAAGA ACAGCGCACA GACCCGAGTA CCGTCCACCG TGACGGGAAT GTTGCCTCCG GCGAGTCGTA G
|
Protein sequence | MKRTNTFAVR PLSDNGERLL RDLLDASAAL WNEVNYGRLM RYNDEGGYEN EDVWDADTGR LEGKYKGVLG ASTAQQVIRK NSEAWRGFFE NKKAYHDESD TSVTEHPEPP GFWGNKDEGR KLHTVIRNTS YTVEWGDRSR LEILVGSELK DRYDHTGRLR LEITGDPNWP EYEKQGRLDL WYDETDSTFR ASQPVTVSDE IRDTPLADEK AALDIGANNL VACTTTTGKQ YLYEGRELFQ RFRDTTREIA RLQSKLDEGR YSSKRIRRLY QKRTRRRDHA QEALCRDLLE RLYAEGVDTL YIGGLTDVLD THWSVETNAK THNFWAFKQF TERLVCTAEE YGITVEVRSE AWTSQECPQC GSTDRTTRHQ DTLTCPCGFE GHADLTASET FLERHTEEAV RPMARPVRFE WDDHDWSESP RSHRPKEQRT DPSTVHRDGN VASGES
|
| |