Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0974 |
Symbol | |
ID | 8383247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 937448 |
End bp | 939223 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644972038 |
Product | hypothetical protein |
Protein accession | YP_003129890 |
Protein GI | 257052057 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCTCCC GAACACAGAT CTTCGTGATG GTCGTCGGGC TCGCACTGGT GGGCGTACCA GTCGCCGCGG CGGCCGGCTC CCCGAACACA GCCGCTGCCC CAGACAGCTC TCCAACCAGC ACAGATCTCT CGAACGTGAC GAGTGACGCC ACCGTTGCTG ATCAGCATCC GCCCGACCCC GAGTCGGACG TCCTCGGCTG GGAGAGCGGC TACTGGTACA ACGAGTCGAT TGCCGTCACG CCCGACGATG GGCTGAACGA CTCTGAACTC GACGCGGTGG TCGCCCGCGG GATGGCACGC GTCGAGCAGA TCCGGCGTCT GGAGTTCGAA GAGACGCCGC CCGTCGAAGT GATCAGCCGG GAGAACTACA CCGAGCGTGT CGGCAATCAG ACGGCGAGCG TGACCACCGC ACAGCGTCTC CACCAGAACG TCAAATACGA GGCGCTGTTG ATGATCAACG AGTCGACCGA CGCGATCGCC GCCCAGGAAC GGAATCAGGC TGGTGGTGTC GGTGGGTTCT ACGATCCCTC GAACGGCGAA ATCAAGATCG TCTCCGAGAA CGCCACGACG CCGAAGATGA ACGAAATCAC CCTCTCACAG GAACTGTTCC ACGCGCTGCA GGACCAGCAG TTCAACATCT CGTCGTTCAA CCAGTCGACC CAGGAACTCC ACAACGCCAA AGACGGGATC ATCGAAGGCG ACGGCAACTA CGTCGACCGG CTATACCAGC AGCGATGCGA AGGTGAGTGG CAAGGCGACT GTATCATGCC CGGGGAGTCA CAGACGCCGG CGAACTTCAG TCCGCACATC GGCCTCTACC AGATCACCCT CCAGCCCTAC AGTGACGGCC CGGCATTCGT TCGAGACCTC CACCAGTCCG AAGGCTGGGA CGCGGTGAAC GCGGTCTACG AGAACCCGCC GGCCTCGACC GAGCAGACGA TCCACCCCGA GAAGTACGGT GAAGACGAGC CGTCGCCGGT TCCCATCGAA GATACAAGCG ACGACCGCTG GCGACCACTC GACATTAACG CGAGTATCAA CCACGCCTCC TTCGGCGAGG CTGGACTGTT CGTGACGCTG TGGTATCCGG GCTACGAAAG CAGCACTGCG ACCCAGATCA TCCCGTATCG AGCCCACCTG AACATCGGCG CTGACGGGCT CAACGAACTC GACCCATACA ACTACAACCA CACCTACACC AACGGCTGGG ACGGGGACAA GTTACTGCCG TACGTGACTG AGGAGTCGAG CGAGACCAAC GAGACCGGCT ACGTCTACAG GACTGCCTGG GACTCGCCGA CAGACGCCGA AGAGTTCCAG AACGGCTACG AGCAACTCCT TGCGTTCCAC GGCGCTGAAT CCGTCGACGA TCACGAGAAC GTCTACCGAA TCCCCGACGG CGAAGGGTTC AACGACGCCT TCTACCTCGA TCAAAGCGGC GAGACGCTCA CGATCGTCAA CGCGCCGACC CTTGCAGAAC TGTCGGGCGT AGATGTATCG GCCCCGCAGA TCGAGGAATC CACAGAGACG ACGCCCGACG ACGGAACGAC CACTGAGGCG GACGACGACG AACCGACGAC CGACGACAGC GAGGACGAGA CCACGACGTC CGACGGCGAG ACGACGACGG ACGAACCTGA CGACAGCGAG ACCACCCAGT CGACAGACAC CGACGACTCG ACCGACGCGA CGACGACCAA CGGCCCCGGA TTCGTCGCCA CGAGCGCGCT GATCGGCCTT CTCGCCGTTG CACTCCTGGC GCTCCGGCGA CGGTAA
|
Protein sequence | MVSRTQIFVM VVGLALVGVP VAAAAGSPNT AAAPDSSPTS TDLSNVTSDA TVADQHPPDP ESDVLGWESG YWYNESIAVT PDDGLNDSEL DAVVARGMAR VEQIRRLEFE ETPPVEVISR ENYTERVGNQ TASVTTAQRL HQNVKYEALL MINESTDAIA AQERNQAGGV GGFYDPSNGE IKIVSENATT PKMNEITLSQ ELFHALQDQQ FNISSFNQST QELHNAKDGI IEGDGNYVDR LYQQRCEGEW QGDCIMPGES QTPANFSPHI GLYQITLQPY SDGPAFVRDL HQSEGWDAVN AVYENPPAST EQTIHPEKYG EDEPSPVPIE DTSDDRWRPL DINASINHAS FGEAGLFVTL WYPGYESSTA TQIIPYRAHL NIGADGLNEL DPYNYNHTYT NGWDGDKLLP YVTEESSETN ETGYVYRTAW DSPTDAEEFQ NGYEQLLAFH GAESVDDHEN VYRIPDGEGF NDAFYLDQSG ETLTIVNAPT LAELSGVDVS APQIEESTET TPDDGTTTEA DDDEPTTDDS EDETTTSDGE TTTDEPDDSE TTQSTDTDDS TDATTTNGPG FVATSALIGL LAVALLALRR R
|
| |