Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0940 |
Symbol | |
ID | 8383213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 905886 |
End bp | 907112 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644972004 |
Product | PBS lyase HEAT domain protein repeat-containing protein |
Protein accession | YP_003129856 |
Protein GI | 257052023 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCTGT TCGAACTCCA GCGCGAGGGC GACGTCCAGG AACTCATCAG GCTCCTCCGG GAGAGCGACA ACGAGGAGGT TCGGATGCGC ACCGCGTCGA TGCTCGGGGA GTTCGACGAG CATGCGGATC GGCGGGATGT CGTGAGTGCA CTCGTCGAGG CTGCTGAAAC TGACGATTCG GGCATGGTGA CTGCCGCGGC AGTCGACTCC CTCGACGAAC TCGGCCAAGA TGCCATCGAG GCACTCATCG AGTCGATGGC GGGGGTCGAC TTCGAGGGTG GCGAAGCGGA CTGGGTGCGC GCGAAGGCCT TCGTCAACGC ACTCGAAGCC GACGTGCCCG AACTCCGGAT GGCCGCCGCC AACGGCCTCG GGCAGTTCGG TGACACCGAT GCGATCGAAC CACTTGTCGG GCGGTTTACC GATCCTGATC CCCGAGTGCG TGCGCGGGCT GCCAGGGCGT GCGGGTCGAT CGGCGACCCG CGGGCGACGG AGCCGCTTGA ATCCCTGTTG ACGGACGACG CCGGCGTCGT CCGGCGGGAA GCCGCCGAGG CACTCGGACA GATCGGGAAC CGACAGGCAC TCCAGGCCCT GCTCGACCTC TACGACGATC CAGCGGAGCG TGTCCGCCGG ATCGCGGTCA ACGCTTTCGG TAACTTCGAT AACGCCGCGC CCGTCGATGC GCTGGTCAAT GCACTCGGTG ACGATGCGGC GACCGTTCGG CGGACGGCCG TCTATTCGAT CATTCAATTG CTCGCGAACG TCCCCACACA GAAGAGTCAC GAGATCCGCG AGACGATCGT CGACCGGTTG AGCGAGACTG ACGACGACAG CGTCGTAGCT CCACTGGTCG AGATTTTGGA GGAGGGGACC CAGGTCGCAC AGCGCCGGAA CACGGCGTGG CTACTCGGCC GGGTCGTCAC CGAACCTGAC GATCGGGTCC TCGACGCGAT GATCGCGGCG CTCGACGACG ACGATCAGAT GACCTCACAG TTCGCGGCGA CCAGTCTGAC TGAACTTGAA GACGTCAGCG TCGAGCGCCG TCTTCTCGAC GTCGTCACGG ACGATGAGCG CCCGACACAG GCCCGGACCC AGGCGATCTT TGCATTGGGG AAGGTCGGTG GCGACCGGTC GCGAGAGACG CTGGACACGC TGATCGACGA GACCGACAAC GAAGAGATAC GCAAACGCGC GTTCTCGGCC ATCTCAAAGC TCGGAGGGCG ACTGTGA
|
Protein sequence | MSLFELQREG DVQELIRLLR ESDNEEVRMR TASMLGEFDE HADRRDVVSA LVEAAETDDS GMVTAAAVDS LDELGQDAIE ALIESMAGVD FEGGEADWVR AKAFVNALEA DVPELRMAAA NGLGQFGDTD AIEPLVGRFT DPDPRVRARA ARACGSIGDP RATEPLESLL TDDAGVVRRE AAEALGQIGN RQALQALLDL YDDPAERVRR IAVNAFGNFD NAAPVDALVN ALGDDAATVR RTAVYSIIQL LANVPTQKSH EIRETIVDRL SETDDDSVVA PLVEILEEGT QVAQRRNTAW LLGRVVTEPD DRVLDAMIAA LDDDDQMTSQ FAATSLTELE DVSVERRLLD VVTDDERPTQ ARTQAIFALG KVGGDRSRET LDTLIDETDN EEIRKRAFSA ISKLGGRL
|
| |