Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2389 |
Symbol | |
ID | 8384688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2435500 |
End bp | 2437023 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644973462 |
Product | hypothetical protein |
Protein accession | YP_003131288 |
Protein GI | 257053455 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.573673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGTG ATGATACGGA CGAACCGACG GGAGAATCGA CTACCAGTGC CACGACCACC GATTCCGGGG GGCGATCACG CGATCGTCCG TCCGTCTCGG CCCAAACCCG TCGGCGGTTC CTCCTGACGG GGGCCGGCGT AGGGTTGGGT GCGCTCGCAC TCAACGCGAG CGGGCCGGCC TCGGCGGCGA CGGTCGAGGA GGTCTGTAAC TCCGACGACT ACGGGTCGAT CGACGTCGCC GACGGGTTCA CCCTGGTGGA CAACCAGTGG GGGAACTCAA ACGCCGATCA GTGTGTCTGG CTCAACGACG ACGGCAGTTA CGGCTATGAC TTCGACGCCG CGGGTGGCAG TGGGATCAAC TATCCCGAGG TGATCTGTGG GACGAAACCC TGGGGGACTG ACACCGGGGT GGCGGAGTTC CCGATTCGGC GTCGGGACGT TGACGAACTC GTTATCGACG TCGAGGCCGA GTACTCCGAG TCCGGCGGCG AGTGGGACTG GGCCGAGGAG TGGTGGCTGA TGGACCAGCC ACCGAGCCAG GAGACCGGGA CCCACCAGTA CGAGATCATG CTTCTCCTGG ACTGGAACGA CCAGCACGAC CACGGCGCGG TCGAGGCCGA GAACGTCTGG ACCGACCGAT TCGGCAACAC CGTCGATCAT TGGACGACCT ACAACTCCGG CGGAACGAAT GCGACGTTCT ACATCTTCCG AATCCAGGGC GGTCACGACG GCGGCCGGAT CGACCTGACC GAGATCGTCG ACTATCTGAC CGCGGAACAC GGCGTCGACG AGAGTCTCTG GCTCTCGGGT GTCGAACTCG GCAACGAGTA CTGGGAGGGC AGTTCCGGCG AGACTACCTA CAACACCTTC GACGTCACGA TCAACGGGTC GACCTACGAA AGCGGGAGCG GGACCGACAC GCCGACCCCG ACTGAGACGC CGACTCCGAC TGAGACGCCG ACCCCGACCG AGACGCCGAC GGACACTGAA ACCGAGACGC CGACGGACAC TGAAACCGAG ACGCCGACAG ACACTGAAAC CGAGACGCCG ACAGACACTG AAACCGAGAC GGAGACGCCG TCGGGTGACG CGCTCGTCGT CAACGATTAC GACGGCGATC CGGCGTGGTC GAGCAATCGC AACGATCTCG GGCAGTGGTG CGGGGCCGGC TCCTTCGAGA ACGGGAGCGG CGACGTGCAG GACGGCGCAC TCGTTCTGGA GTACGACAAC GCCGGCTGGT TCCAGGAGCA AATCAATCAA GACCTTTCGG GGTATTCGGA CCTCGTCTTC GTCCTCAGCG GCGCCGATGG CGGCGAAGAA GACGACTTCC TGCTTGACGT CGGTGGCGCT CGCGGGCTTC TCTCGGCGTT CAGCGACGAT GCCATCGGGA CGTCGGCTTC GACGGTCACC GTCGACATGG AGTCGGCCGG CATCGATCCG TCAGCCGGGG GACTGTCGGT CCGATTGAAC TTCTGGCAGG GCGGCAGTGG CACGCTCGAA ATCGACGAGA TCCGATTCGA ATAG
|
Protein sequence | MTRDDTDEPT GESTTSATTT DSGGRSRDRP SVSAQTRRRF LLTGAGVGLG ALALNASGPA SAATVEEVCN SDDYGSIDVA DGFTLVDNQW GNSNADQCVW LNDDGSYGYD FDAAGGSGIN YPEVICGTKP WGTDTGVAEF PIRRRDVDEL VIDVEAEYSE SGGEWDWAEE WWLMDQPPSQ ETGTHQYEIM LLLDWNDQHD HGAVEAENVW TDRFGNTVDH WTTYNSGGTN ATFYIFRIQG GHDGGRIDLT EIVDYLTAEH GVDESLWLSG VELGNEYWEG SSGETTYNTF DVTINGSTYE SGSGTDTPTP TETPTPTETP TPTETPTDTE TETPTDTETE TPTDTETETP TDTETETETP SGDALVVNDY DGDPAWSSNR NDLGQWCGAG SFENGSGDVQ DGALVLEYDN AGWFQEQINQ DLSGYSDLVF VLSGADGGEE DDFLLDVGGA RGLLSAFSDD AIGTSASTVT VDMESAGIDP SAGGLSVRLN FWQGGSGTLE IDEIRFE
|
| |