Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1548 |
Symbol | |
ID | 8383827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1522100 |
End bp | 1523506 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644972610 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003130456 |
Protein GI | 257052623 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATAT CTGTGCTTAC AGCAGTGGCA GAGGCCAGTG CGTACCTGTA TACCGTACCG ATCGTCGGCA TCGAGCTTTC AAAAACAGGT GTCACGGCGA TCGGCATCCT CATGATCCTG TTTCTCATCG TCGGATCGGG CTTTTTCTCC TCCTCCGAGA TCGCGATGTT CTCGCTGGGG ACCCACCGGA TCGACCCGAT GGTCGAACAG GGGCTCCGTG GGGCGAAAGC GATCAAGTCA CTCAAGGAGG ACCCCCACCG GTTGCTCGTG ACGATCCTGG TCGGGAACAA CATGGTCAAC ATCACGATGT CCTCGATCTC GACGACCATC GTGGGTTTCT ACTTCGATCC GGGGACGGCA GTCCTCGTCT CGTCGTTCGG GATCACGTCA CTGGTGTTGA TATTCGGTGA GACGGCACCC AAATCCTACG CCGTCGACAA CACCGAGTTA CATGCACGCC GCGTGGCTCC AGTACTGCAG TTCGTCGAGA AACTGCTGTG GCCGCTGATC ACCCTCTTTC ACTACGTGAC CCAGTTCGTC AACAAACTCA CGGGCGGCGG GCCGGCCATC GAGTCGTCGT ACCTCAGCCG GTCGGAGATC CGGGAGATGA TCCAGACCGG CGAGCGCGAG GGAGTCCTCG ACGAGGAAGA GCGACAGATG CTCCAGCGGA CCCTCCGGTT CAACCGGACG ATCGCCAAGG AGGTCATGAC GCCGCGCCTG GACATGGACG CCATCTCGGC CGACTCGTCG GTCGAAGAGG CGATCGCGGA GTGTGTCCAC AGCGGCCACA CCCGGCTGCC GGTCTACGAG GGTGGTCTCG ACAACGTCAT CGGGGTCGTC AACATCCGTG ATCTCGTCCG TGACGCCCAG TACGGCGGGA CAGACGATGT CGAGCTTCAA GACCTCATCG AGCCGACGCT GCACGTCCCC GAAAGCAAGA ACGTCGACGA TCTCCTGACG GAGATGCGGA GCGAACGCCT CCACATGGTG ATCGTCATCG ACGAGTTCGG CACCACAGAG GGACTCGTCA CCATGGAGGA CCTCACCGAG GAGATCGTCG GCGAGATCCT CGAAGGCGAA GAGGAACACC CGATCGAATT CGTCAACGAC GACACCGTGA CGGTCAAAGG GGAAGTCAAC ATCGAGGAAG TCAACGAAGC GCTGTCGATC GACCTCCCGG AGGGCGAGGA GTTCGAGACC ATCGCCGGGT TCATCTTCAA CCGAGCGGGT CGGCTCGTCG AGGAAGGCGA ATCCATCGAG TACGAGGGGA TTCAGATCCG TGTCGAGCAA GTCGAGAACA CCCGGATCAT GAAAGCCCGG ATCACACGAC CGGAAGAGGG AGCGACACTC GAATCGGAAG CCGAAGGCGG GGACGACGAC CACGAGAGTG ACACGAACGA CGCCTGA
|
Protein sequence | MGISVLTAVA EASAYLYTVP IVGIELSKTG VTAIGILMIL FLIVGSGFFS SSEIAMFSLG THRIDPMVEQ GLRGAKAIKS LKEDPHRLLV TILVGNNMVN ITMSSISTTI VGFYFDPGTA VLVSSFGITS LVLIFGETAP KSYAVDNTEL HARRVAPVLQ FVEKLLWPLI TLFHYVTQFV NKLTGGGPAI ESSYLSRSEI REMIQTGERE GVLDEEERQM LQRTLRFNRT IAKEVMTPRL DMDAISADSS VEEAIAECVH SGHTRLPVYE GGLDNVIGVV NIRDLVRDAQ YGGTDDVELQ DLIEPTLHVP ESKNVDDLLT EMRSERLHMV IVIDEFGTTE GLVTMEDLTE EIVGEILEGE EEHPIEFVND DTVTVKGEVN IEEVNEALSI DLPEGEEFET IAGFIFNRAG RLVEEGESIE YEGIQIRVEQ VENTRIMKAR ITRPEEGATL ESEAEGGDDD HESDTNDA
|
| |