Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1972 |
Symbol | |
ID | 8384266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1994926 |
End bp | 1996206 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644973042 |
Product | Protein of unknown function DUF650 |
Protein accession | YP_003130873 |
Protein GI | 257053040 |
COG category | [S] Function unknown |
COG ID | [COG1602] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.177811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCTTG AGGACTACAT CGAGGGATTC GAGCGCGACG AGGCCGCCGA GAAGCGACGC CTCGCCGAGG AGAAGTCCTA CGCGATCACC GATCACCTCG AAGACGTCGA GCGCCAGCTC GAAGAAACCC TGCAGGGTGA CGCGCTCTTT GGCTCGACCG CGCCCGAGAT CTTCGTCGGG CGGTCGGGCT ACCCGAACGT CTCCTCCGGC GTGCTCTCGC CGGTCGCCGA CGAGGGCGAC CCCACGGACT TTGCGACCAG CGGCCAGTGG TACGCCAACG GCCTGGGGAT CGAGGACGTC CTCCAGCGTC GGACGGGCCT GCTCAACTCC CAGCGCTCGG CGAAGGTGGA CGTGAATGAC GTCTGGGATG GGTTCGTCGG CACCCAGCGT GAAGTCGCCA TCGCCGACCG GCCCGTCGAC GTCGAGATCG GGCTGGACGG GACGCCCGAT TTCGACCTGA CCACTGACGA CATCTCCACG CCCACGGGCC CGCGGGCACG GGCGACCGAA GCCACACTCG CCGAGAATCC CCACGTCCCC CGTCCCGTCG AGAAGACCCT CGAAGACGAC GACTGGCGCG CCGAGGGCGC GATGACCTAT CTCTACCGGA AGGGCTTCGA CGTCTACGAC GTCAACACCA TCCTCTCGGC GGGCGCGCTG GGGCAAGGAG CCAACCGGCG ACTCGTCCCG ACACGGTGGT CGATCACCGC CGTCGACGAC ACGGTCGGGC AGTATCTGCA TGGCCAAATC CGCAACGCGA ACACCATCGA CGAGACCCAG GTCTGGTACA ACGAGTACAT GGGCAACCGC TACTGGATCA TCCTCACGCC CGGCGACTGG GAGTTCGAGC TCGTCGAGAT GAAAGCCCCC GAGAGCGTCT GGAATCCCCT CGGGGAGACC CACTACCTCG CCAGCGCCCA CGAGGGCTAC GAAGGGCGGA CGAGCTACGT CGAGGAGACC GCCGGGGCCT ATTACGCATC CCGACTCGGC GTCCTCGAAC ACCTCGTCGA CATCGATCGA CAGGCCAAGT GTCTCGTGCT CCGGGAGGTG ACCGATGACT ACTGGGCCCC GGTCGGCGTC TGGCAGGTCC GGGAAGGAGT CCGCAACGCC TTCGAGGATC CGGAGGGTCT GCCCGACGCG CTTTCGGGCC GATACGGCGA GGCCGGGAGT TTCCGCGATG CAGTGACCAG CGTGACCGAG CAACTGCCGG TGTCGCTGAC TGCGCTCCGT CGGAAGTCCG AGATGGTCGC CGGCCTCCAG GCGACGCTGT CGGACTTCTG A
|
Protein sequence | MRLEDYIEGF ERDEAAEKRR LAEEKSYAIT DHLEDVERQL EETLQGDALF GSTAPEIFVG RSGYPNVSSG VLSPVADEGD PTDFATSGQW YANGLGIEDV LQRRTGLLNS QRSAKVDVND VWDGFVGTQR EVAIADRPVD VEIGLDGTPD FDLTTDDIST PTGPRARATE ATLAENPHVP RPVEKTLEDD DWRAEGAMTY LYRKGFDVYD VNTILSAGAL GQGANRRLVP TRWSITAVDD TVGQYLHGQI RNANTIDETQ VWYNEYMGNR YWIILTPGDW EFELVEMKAP ESVWNPLGET HYLASAHEGY EGRTSYVEET AGAYYASRLG VLEHLVDIDR QAKCLVLREV TDDYWAPVGV WQVREGVRNA FEDPEGLPDA LSGRYGEAGS FRDAVTSVTE QLPVSLTALR RKSEMVAGLQ ATLSDF
|
| |