Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0874 |
Symbol | |
ID | 8383147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 842823 |
End bp | 843872 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644971938 |
Product | hypothetical protein |
Protein accession | YP_003129790 |
Protein GI | 257051957 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGGCC AGGACTTTCA CGCCGAATTC GACTTCGAGT TGGGTGTCTG TCAGTGGGCC GAGCGCGCCT GGCCGCCCGA TAGCGACTGC GAGAACCCAC TGGTCGTCGC CCGGCAACTT GGCACGAAAC GTCGCCGGTG GGACACGATC GTCCTCGAAT GTGATCGCGA TGGTCTCCGT CAGCGTGCCG AGTTCGGCGA ACGCGCTCTC GACTCGGATC TACTCGACGT CATCCCGCAT GCGCCCGCCG AGTGGACCTA CTACCGCGAT GCACTCCCCG ATCCCGGCTA TCCCTGGCGG TACGTCCGGG AGGCGATCCA TCGTGCAGGA GATCGAGGAA TCCTGGCTGT CCGCAAGAGC GGTGGCCGGA TCGAGATCCG CCGGAAGTCC GTGTATCCGT CATGGATCGA GCGTGTGATC GCGATCGAGA ACAAACCGGA TCTCGACGCG AGTGCGGCGC GCCACCTCGC CCCTCAGATC GAACGCGACG TCGCGATGGG ACTGGCCGAC GAAGTGTGGG TCGCCACGGC CGAGACGGGC GAGCGGATCG AACCCGCGTT ACTCGAGAAC GTCCCGGTCG AGGCCGGCAT CCTGGTCGTC GATCCATCGG CTGGCGATGC CGATGTCGCC TGGAACCCCC GGACGCTCGA CCCGTCGAAA CCTGGCGTCC GGATTCTCGA CCGCCCCTCG GGTGGCGACC ACGACGCCTC GGCCGCGCGT TTCGAGTACG TCGACCCGGA GTGGAAGGCA CAAAAACGCC GAGAGATCGC CGAACGCGCC TACGAACGCG GGTGGCGAAG TTACGTCGAG TCGATGCGGC CCGACTGTCG CCATTTTGAG CTGAGTTGGG AAGAGCCCGG ACCGACCCCG TTCTGTGCCG CCAAGGACCG ACGGCAGACT GCCGCGGAGT GTCGCGGTTC GTGCGGTGAA TTCGAACCCG AACCGCCAGT CTGGCGGACG AAGGGATGGC CGATCGAGGG CGGGCCAGGT GCGGCGATCA AGCGACTGCT GGCACGGCGG CGTCGGCGTC AGCGCCCCGG CATGAAGTAA
|
Protein sequence | MTGQDFHAEF DFELGVCQWA ERAWPPDSDC ENPLVVARQL GTKRRRWDTI VLECDRDGLR QRAEFGERAL DSDLLDVIPH APAEWTYYRD ALPDPGYPWR YVREAIHRAG DRGILAVRKS GGRIEIRRKS VYPSWIERVI AIENKPDLDA SAARHLAPQI ERDVAMGLAD EVWVATAETG ERIEPALLEN VPVEAGILVV DPSAGDADVA WNPRTLDPSK PGVRILDRPS GGDHDASAAR FEYVDPEWKA QKRREIAERA YERGWRSYVE SMRPDCRHFE LSWEEPGPTP FCAAKDRRQT AAECRGSCGE FEPEPPVWRT KGWPIEGGPG AAIKRLLARR RRRQRPGMK
|
| |