Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1085 |
Symbol | |
ID | 8383359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1058937 |
End bp | 1060496 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644972146 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003129997 |
Protein GI | 257052164 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTAAAC GACTCACCCT CGCGTTTTTC GTGAGCAGGC CGAGGGCTTT CGACGTAGAA CAGATCGATA GCCCAGACAC CAACACCTTC AATGTCGTCT CCGACAACGC CTGTATAATG CGAGCCACGC GTCGATTCTG GGCCAGCAGT GCGACGATCC TCGCCCTCGC AGGGCTGGCA CTCCTCTACA CCGCCCCCAC ACTCCTTGCA GGCGTCGTCC TCCTCTCGGC GTGGCTGCTG ACCGAACAGT TTCGCTTCGC CCGCCGTGCA GCCCACACGA TCGACACAAC GACAATCACA CAAACACTCC CACAGCAACG CGTCGCCGTC GAATCGTCGA CAACAGTCAC ACTCACCGTG CAGCGAGAGT CCACAGCGCT GGACATGGAG ATCATCCCAC AAGTACCAAC CGGCGCACGC GGTACCCCAC AACCACTTAC CCTGGACCCG ACCGACCACG ACGCAGAGAC GACCGACGAC CTCTCCTGGC CGATCGCCGG CGCGTTCACC CTTCCGAAAC CCACCATCAC ACTCCGCGAC CGGTTTGGCC TCTTCGAAGA GACGCTTACG CTCGGGACCA CACCCGACCT CGTCGTCGAG CCACATGCCC CCCGGAACCT CCATGTCGGG GCCGGTGGCG ACGAGATCGC CGCGACCTAC GGCAGCCATA GTGCCCAGCG CGGCGGCAGC GGCCTCGACC CGGCGGAACT CCGCAAGTAC GTGCCTGGCG ATCCCTCGAA CCAGATCGAC TGGAAGGCGA CAGCCAGACT CAACGAGACG TACGTCCGGG AGTTCGAGGC CCAGACCGAT CGCGAGACGA TGCTGGTCGT CGATCATCGC GACTCGCTCG CTGACGGCGA CGAGGGCCAG ACGAAGTTCG CGTACCTGCG GGAGGTGCTG CTGGCGATGA CCGACGTCGC CGAGGGCCTC GACGATCCAC TCGGGATCAC GGCGATCGAC GACGACGGAA TTACCGCACA AATCGACCCA TCCCAGACAC GCAATCAGTA CGAATCAGTC CGGACTCGAC TGCACGAACT CACGCCGACA GGGAGCCAGT CCCACGGCGA TCAGAGACAG CAGACCGCCA CACAGGCGAC ACGAGCGAAT ACAGTCGGGA CGGTGCTACA GGGTGACACG ACAGAGTACG GAACGACACT GCAGCCGTAT TACGCCTCAC GCAAAACCCA CGTCGAACGT GTCACCGACG AGCCGTTGTT CCACGCAGTC AATCAGCTGA GCGACAGCGA ACCAACGCGG CTCCTCGTCG TCGCAACTGA TGACACACAC CGACGGGAAC TCCAGGAGGC GGTGAAGCTC GCGGTCCAGC GCGGCAACCA GGTTGTTGCC TTTTTGACTC CGAACGCTCT CTTCGAGAGG TACGCACTCG CAGATATGGA GGCGACCTAC GAAGCATACG TGTCCTTCGA AGAGTTTCGT CGTACACTCG ATCGGCTCCC ACGCACGCGG GTCTTCGAGG TTGGGCCAGG TGATCGGATC GACGCCGCAT TGCGTGCCGG CCGCACACGA CGCGGCGGGG AGGTGGCCCA TGAGTCATAG
|
Protein sequence | MAKRLTLAFF VSRPRAFDVE QIDSPDTNTF NVVSDNACIM RATRRFWASS ATILALAGLA LLYTAPTLLA GVVLLSAWLL TEQFRFARRA AHTIDTTTIT QTLPQQRVAV ESSTTVTLTV QRESTALDME IIPQVPTGAR GTPQPLTLDP TDHDAETTDD LSWPIAGAFT LPKPTITLRD RFGLFEETLT LGTTPDLVVE PHAPRNLHVG AGGDEIAATY GSHSAQRGGS GLDPAELRKY VPGDPSNQID WKATARLNET YVREFEAQTD RETMLVVDHR DSLADGDEGQ TKFAYLREVL LAMTDVAEGL DDPLGITAID DDGITAQIDP SQTRNQYESV RTRLHELTPT GSQSHGDQRQ QTATQATRAN TVGTVLQGDT TEYGTTLQPY YASRKTHVER VTDEPLFHAV NQLSDSEPTR LLVVATDDTH RRELQEAVKL AVQRGNQVVA FLTPNALFER YALADMEATY EAYVSFEEFR RTLDRLPRTR VFEVGPGDRI DAALRAGRTR RGGEVAHES
|
| |