Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2451 |
Symbol | |
ID | 8384753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2527715 |
End bp | 2528695 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644973527 |
Product | Rhomboid family protein |
Protein accession | YP_003131350 |
Protein GI | 257053517 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACCT GTGACGTGTG CGGCACGGAG ATGAATATGC CGTATCACTG CAATCACTGC GGGGGAACGT TCTGCTCGGA ACATCGCCTC CCGGAGAACC ACGACTGTCC GGGACTGGAC GACTGGGGTG ACCCGGAAGG GGTCTTCGAC AGCGGGTTCG ACGACAGCGT CTCGGCGGGC AACCAGTCCC GCTCGTCGAA GGGTCTCCTC GAACGGATCG GTATCGACAC TGGGCCAGGT GGTCCGCTGG CGTACTTCCG CGGGAACATG ACCTACGTCT TCCTCGGGCT GATGTGGATT ACCTTTCTCT TTCAGTTTCT CGTTGCCACC GTGGCGATTG GGACGCCCAA CCTTCAGGTT GCAACGCTTT CTTCGGAGTT GTATCGATCG ATTTTCGTCC TGGCTCCACA GCATCCCGAA TACGTCTGGA CGTGGTTCAC GTCGGTACTT TCACACGGGG GATTCGCACA CATTGCATTC AACAGTATCG TGATCTTCTT CTTTGGACGG CTAGTCGAGG ACTACATCGG CTCGCGGGAC TTCACGCTCC TGTTCCTCTC TAGCGGGGCG CTTGCCGGGC TTGGACAGGT CCTCATTCAA CTCTATCAGG GTCTACCGTC CGCAGCGGCG GTCGGGTACT TCCCTGGTGG AGTCGTCGGC GCGTCGGGTG CAGCCATTGC GATCATGGGC GTCTTGACCA TCCTCAACCC CAGTCTCCGA GTGTACGTGT ACTTCATTTT TCCCGTCCCG ATCTGGCTGG TGACGATCGG CCTGGTCGCG ATGAACGTCC TCGGGATGTT CGGTGCGGGC GGCCAGGGCG TTGCCAACGC CGCTCACCTG ATCGGGCTCG CGATCGGTCT CGCCTACGGC CAGCACGTCC GCGATCGGAT CCGGGTCCCG AACCAGCTCC AACTCGGCGG CGGTCGCGGC CCTGGCGGCC CGGGCGGCCC CGGCGGCCCG GGTGGCCGCG GGCCGTTCTG A
|
Protein sequence | MTTCDVCGTE MNMPYHCNHC GGTFCSEHRL PENHDCPGLD DWGDPEGVFD SGFDDSVSAG NQSRSSKGLL ERIGIDTGPG GPLAYFRGNM TYVFLGLMWI TFLFQFLVAT VAIGTPNLQV ATLSSELYRS IFVLAPQHPE YVWTWFTSVL SHGGFAHIAF NSIVIFFFGR LVEDYIGSRD FTLLFLSSGA LAGLGQVLIQ LYQGLPSAAA VGYFPGGVVG ASGAAIAIMG VLTILNPSLR VYVYFIFPVP IWLVTIGLVA MNVLGMFGAG GQGVANAAHL IGLAIGLAYG QHVRDRIRVP NQLQLGGGRG PGGPGGPGGP GGRGPF
|
| |