Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1944 |
Symbol | |
ID | 8384237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1968974 |
End bp | 1970311 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644973014 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003130846 |
Protein GI | 257053013 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC GCACGGTCGA CCGGATCAAT CGGTCAGACG GCGCGATTGT CACGACGTTT CTGGTCGCCG GCATCGGATT CGTCGCCGGG AGTCCATTCC TGATCGTCGC CGCGACGGTC CCGCTTTGGT ATGCCGCCGC GAGCGTCATC GGCACCCGGG AAGACGCGGA AATTCGCGCC CACCGTGAGA TGGTCCGCAA TGGCGACGGT TGCGATAGCG ACGGGACGGC GACACGCGAG GAGCCGCTCA CCGGTGATCC GGGTGACGTC GTTGCCGTCC GGACGACCGT CGAGAACGTC GGCTCGGAGC CGATCGTCGA CCTTCGGCTC GTCGATGGCG TGCCAGCGGA ACTCCCTGTC GTCGATGGAA CGCCACGGGC CTGCGTGAGT CTCGACGCCG GCGAGACCGT GACGATCGAG TACGACATGG AGCTCCATCG CGGCGAGCAT ACGTTCGAGG CGGTGGATGT CCGGACGCGG GATCTCACTG GAACTGTCGT CGAGACCTGG AACGTTGCGG CGACCGGCGC GGAGGCGCTC AGCTGTCTGC CACCCATCGA AACCGTCCCG CTTCGCGGTG GCGCAAACGA TTTCGCCGGC ACGGTCCCGA CTGACGACGG CGGGAGTGGC GTCGAGTTTT ATTCTGTGCG AGACTACGAA CCGGGCGACG CCATCGGCTC GATCGACTGG CGTCGCTACG CCCGGACGCG GGATCTGACG ACCGTCGAGT ATCGTGCCGA GCGGGCGACC CGGATCGTCT GTCTCGTCGA CGTCCGACAC AATCAGTTCC GCGGGGCCTC TCAGGATCGG CTCGCCGCCG CCGAGTTATC CGCAGACGCT GCGGAGCGAA CCTTCGAGAC GCTCGTCGAG GCCGGTCATC CGACCGGCGT CGTCGCCGTC GGCGATGACA ACCACTCCTC GGTCCCGCCG GGGACTGACC CGGGGACCCG GGAGGCCGCC ACGACGTTGC TCGAAGCGGC GCGATCGAGC AAGCGTCTCA CGAACGTTCC ATTCTGGCAC TTTGGCATGT CAAAAGATCC GTTCACGAAG ATCGAAACCA CGCTGCCCGG CGAAGCCCAA CTCTACCTGT TCTCGTCGTT CGTCGACGAC AGGCCGGTCG AGTTAATCGA ACGGTTACGG ACCCAGGGGT ACACCGTTCG CGTCGTCTCC CCGGACCCGA TAGCCGACGA CAGCACCGAA GGTCGTTTCG AGGCGCTCGT TCGTCGGACC CGACTCGCCC GTGCCAGGTC GGCCGGGGCC CGGGTCGTCG ACTGGGACCG CACGCGACCG CTCGGCATCG TGCTCCGCAA CACGATCGGG ACGGTGACGA CACGATGA
|
Protein sequence | MTDRTVDRIN RSDGAIVTTF LVAGIGFVAG SPFLIVAATV PLWYAAASVI GTREDAEIRA HREMVRNGDG CDSDGTATRE EPLTGDPGDV VAVRTTVENV GSEPIVDLRL VDGVPAELPV VDGTPRACVS LDAGETVTIE YDMELHRGEH TFEAVDVRTR DLTGTVVETW NVAATGAEAL SCLPPIETVP LRGGANDFAG TVPTDDGGSG VEFYSVRDYE PGDAIGSIDW RRYARTRDLT TVEYRAERAT RIVCLVDVRH NQFRGASQDR LAAAELSADA AERTFETLVE AGHPTGVVAV GDDNHSSVPP GTDPGTREAA TTLLEAARSS KRLTNVPFWH FGMSKDPFTK IETTLPGEAQ LYLFSSFVDD RPVELIERLR TQGYTVRVVS PDPIADDSTE GRFEALVRRT RLARARSAGA RVVDWDRTRP LGIVLRNTIG TVTTR
|
| |