Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2507 |
Symbol | |
ID | 8384809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2579092 |
End bp | 2580162 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973581 |
Product | Protein of unknown function DUF1119 |
Protein accession | YP_003131404 |
Protein GI | 257053571 |
COG category | [S] Function unknown |
COG ID | [COG3389] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0629064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGGT ACCGCGGACT CACGCTTTCG ATTTCGGTGA TCGTCTCGAT CTTCCTGTTC GTTCAACTCG GCGCGCTTGC GCTGGTCGAT CCGTTCAAGA CTGCCGGACT CCAGGCGGTC GAGGATCCCC AGAACCCGGT CAATAGCCTG CTGTACATCG CGGCGATCCT CGTGATGACC GGCGTGATGC TCGCCGCGTT CAAGTACGAG GTCCAGTGGG CGATCCGTGG CCTGATCGTC GCGACCGGCG CGTACATCGC CCTGCTCGTG TTCTCGATCC TGCTGCCGCC CGTCGTGACG CTGCCAGTCG GGGACGGCCT CCACGGGCTT GCGTGGGTCG GCGCGATCGG CCTCGGCGTC GCACTGTACG CCTATCCGGA GTGGTACGTC ATCGATGCCA CGGGTGCCGT CATGGGTGCG GGAGCGGCCG GCCTGTTCGG TATCACCTTC GGTGTGTTCC CGGCCCTTGT CTTGCTCTCC GTCCTGGCTG TCTACGATGC CATCAGCGTC TACGGCACCG AACACATGCT GACGATCGCT TCGGGCGTGA TGGATCTCAA AGTCCCCGTC GTACTCGTCG CGCCGATGTC CGTCGGCTAC TCCTTCCGGG AGGATACGGC AGGGCTCGAC GAGGAGTCCG ACAATGAGCA AGCGGATCCG ACTGCGGACG ACGCCACCAC TGAGCCGGAG GACACCGACG TAACTGCCGA ATCCGGATCA GCCGAGGCTG CTGAGGGCGA CAGCGCCGAT CCACTCGAAG ACCGTGAGGC GCTGTTCATC GGTCTCGGCG ACGCGATCAT TCCGACGGTG CTGGTCGCAT CAGCCGCATT CTTCGCGGAT GCGTCCGTTC CGACCGTCGA TATCGGCGCG TTCTCGGTCG CCGTGCCCGC AGCCACTGCC GTGGTCGGGA CGTTCCTCGG GCTGGCCGTG TTGTTACGGA TGGTTCTGGC CGGGCGCGCA CACGCCGGGC TCCCACTGTT GAACGGTGGG GCCATCGCGG GGTACCTCGT CGGGGCACTC GCCAGCGGGA TGACCCTCGT CGAGACGCTC GGACTGGGGC CGTATCTTTA G
|
Protein sequence | MARYRGLTLS ISVIVSIFLF VQLGALALVD PFKTAGLQAV EDPQNPVNSL LYIAAILVMT GVMLAAFKYE VQWAIRGLIV ATGAYIALLV FSILLPPVVT LPVGDGLHGL AWVGAIGLGV ALYAYPEWYV IDATGAVMGA GAAGLFGITF GVFPALVLLS VLAVYDAISV YGTEHMLTIA SGVMDLKVPV VLVAPMSVGY SFREDTAGLD EESDNEQADP TADDATTEPE DTDVTAESGS AEAAEGDSAD PLEDREALFI GLGDAIIPTV LVASAAFFAD ASVPTVDIGA FSVAVPAATA VVGTFLGLAV LLRMVLAGRA HAGLPLLNGG AIAGYLVGAL ASGMTLVETL GLGPYL
|
| |