Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1980 |
Symbol | |
ID | 8384274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2004769 |
End bp | 2005794 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644973050 |
Product | hypothetical protein |
Protein accession | YP_003130881 |
Protein GI | 257053048 |
COG category | [S] Function unknown |
COG ID | [COG0392] Predicted integral membrane protein |
TIGRFAM ID | [TIGR00374] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.335213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGCG ATCGGAAGAC GACCCTTGCC GGGTTCGCCG GCGCGCTCGC CGTACTCGGG TTGCTGTTCT GGGCCGTCGG CATCGACGCC TTGGTCGGCC ATCTCTCGCG TGCCCGTCTC CCGCTCGTTC TCGCTGTCGT GACGATGACC GTCGTCTGGC TGTCCTGTTG GGGGATGTCC CTGCACACGG TCCTTGGCGC ACTCGATTCG CCGGTCCCGG CTCACACGGC GATCCTAGTG TTCTCCTCGG CTGTCTTCTC GAACGCGATC ACGCCGTTCG GGCAGGCCGG TGGCGAACCG GTCGCGGCGT TGCTCGTCTC GGAGGCCGCC GACAGCGAGT ACGAGACCGG CCTGGCCGCG ATCGCGAGCG TCGATACGCT GCACTTCGTG CCCTCGATCG GCCTCGCGAC TGTCGGACTG GGGACGTTCG CCGTCGAGTC AGTCAACCTC GACCAGAATC TCTATTTTGC GGCGGCTGCG GTCGGATTGC TCGCGGCAAC TTTCCTCGGT GCGGCACTCC TTGGCTGGCA GTATCGCTAC GAGATCGAAC GCGTCGTCGT CGGGGTGTTG ACGCCGGTCA TCCGTCGTGT GAGTGGGGTG CTCCCCCGCG TAGAACCGCC CGAAGGGAAT GTCATCGAGG ATCGTATCGA GGGATTTTTC ACGGCGATCG ACCGGGTTGC GGGCAGTCGC CGAACGATCT TGCTTGCCTC GCTGTACTCG ACGGCGGGGT GGCTCTCGCT GTCGACGGCG CTGTGGCTGT CGTTGGCCTC GCTCGGCCAC GTCGTTCCGT TCGTCGCCAT GCTGGTCGTC GTCCCGGTCG CCTCGATCGC GGCGATCACG CCACTCCCCG GCGGGCTCGG CGGGATCGAG GCCGCATTTA TTGCCCTGAT CGTCTCGACG ACGGGGCTCG CCGCGTCGGT CGCCGGGGCG GGCGTCGTCA TCTACCGGCT CTCGACGTAC TGGCTCACGC TGTTCATCGG GGGCACGACA GCAGCGATTC TGGGCGAACG GTATCGATCC TCGTAA
|
Protein sequence | MDGDRKTTLA GFAGALAVLG LLFWAVGIDA LVGHLSRARL PLVLAVVTMT VVWLSCWGMS LHTVLGALDS PVPAHTAILV FSSAVFSNAI TPFGQAGGEP VAALLVSEAA DSEYETGLAA IASVDTLHFV PSIGLATVGL GTFAVESVNL DQNLYFAAAA VGLLAATFLG AALLGWQYRY EIERVVVGVL TPVIRRVSGV LPRVEPPEGN VIEDRIEGFF TAIDRVAGSR RTILLASLYS TAGWLSLSTA LWLSLASLGH VVPFVAMLVV VPVASIAAIT PLPGGLGGIE AAFIALIVST TGLAASVAGA GVVIYRLSTY WLTLFIGGTT AAILGERYRS S
|
| |