Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2154 |
Symbol | |
ID | 8384448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2206397 |
End bp | 2207845 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644973223 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003131054 |
Protein GI | 257053221 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.690685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGTGA CGCGACGGTT CTGGGCCGCG GTCGGGGCCG GTGGAGTCCT CTCCGTTCTC GGACTGGTCT TTGCCCGGCC GATCCTGGTT GTTGGTGCGG GGGGGATCTG GGGGCTCGTT GTCGGCGCGC AACTCGTCTT TGTGCTTCGG CTCCTCGCAC TCGATCGATC GATGACGATC ACCCAGACGC TCGACAGCCC GTTCGTCACC ACCCGCGGGA CCGTCCGGTG GTCACTCGAA GCGACCCTCG CTCAACCAAC TCCACTCGAG GTGTCCATCG TGCCGACGTT TCCGGTGACT CTTGACGTCT CGAAACAGCC ACGTGTGACG ATCCCACCGG GAGAGACGGG GGGGTGGGCG GACGCCACGG TGACCGCGAC GGTTGCCGGG ACGACGACGA TCCCACGGCC GACCGTTGCA GTCTCGGGCA CCTGGGGGCT GTTCGGCGAA CAGTTTCGCC GCGGACCGAC GACGGACCTC ACCGTCGAAC CTCGACAGGT CGGTGACGTT CACGTCGGGC AGGGTGGCGA GTCGGTGATC GCGACGCCGG GTGGCAGGCA TCGGACTGGC GAAATCGGAT CCGGTATCAG CCCGGCGGAA GTCCGCGAGT ACGTTCCAGG GGACACTGTC AGTGACATCG ACTGGAAGGC AACGGCACGC CTGGCGTCCC CGCATGTCCG GGAGTTCGAA GTCGAGACCG ACCGGCAGAC AGTCCTGCTT TTCGACCGCC GGAGTCGGCT GGAAAGCGGC CCTGGCGGCG AATCGATGCT CGCATATCTC CGGGAGGTGG CACTCAGGTT TGTCTCGGCC GCGGCGAACC TCGACGATCC GCTCGGGCTC TATGCGATCG GTGACGGCGG TGTGACGGAC GAAGTCATGC CGCGGGCGGA CGAACGGACC TACGAACACA TCCGGTCACG ACTCCAGACC GCGACGCCGA CCGGTGGGGC CGAGACGGCG GATGCTTCGA CAGCGATGGA GGCCGATCTG ATCGGTCCCG GAACCGCCCG GCGGAACGCG ACACGGCTTC GCGAGGCGGC GTCCCCGTAC GCCCGGTCTC TCCACCCGTT CTTCGCGGAC GGCACCCGGT ACGTCCGGCA GATCGCCGAT CGACCGCTGT TTGGGGCTGG GAAAGCCTAC CTGCCCCGGA TCGACGGCGA GATGTGGGTC GTTATATTCA CTGACGACCG TGACCGAACC GAGGTGCGAG AGACAGTCAA ACTGGCCAGA GAGCACGGGA GACGGGTCGT GGTATTTCTT GCGCCGGGGG CGCTGTTCGA GACGGAGCTG GTCGGTGATC TCGATGCTGC GTATACCTCC TACCGTGGGT TCGAGGAATT CAGGCAGACA CTCGCTGGAC TCGATCGGGT CGAGGCGTAC GAAGTTGGGC CAGGTGATCG CGTGGAAGCA CTGCTTTCAA CACGCCGCGA GCAAGGGGGA CGACAATGA
|
Protein sequence | MEVTRRFWAA VGAGGVLSVL GLVFARPILV VGAGGIWGLV VGAQLVFVLR LLALDRSMTI TQTLDSPFVT TRGTVRWSLE ATLAQPTPLE VSIVPTFPVT LDVSKQPRVT IPPGETGGWA DATVTATVAG TTTIPRPTVA VSGTWGLFGE QFRRGPTTDL TVEPRQVGDV HVGQGGESVI ATPGGRHRTG EIGSGISPAE VREYVPGDTV SDIDWKATAR LASPHVREFE VETDRQTVLL FDRRSRLESG PGGESMLAYL REVALRFVSA AANLDDPLGL YAIGDGGVTD EVMPRADERT YEHIRSRLQT ATPTGGAETA DASTAMEADL IGPGTARRNA TRLREAASPY ARSLHPFFAD GTRYVRQIAD RPLFGAGKAY LPRIDGEMWV VIFTDDRDRT EVRETVKLAR EHGRRVVVFL APGALFETEL VGDLDAAYTS YRGFEEFRQT LAGLDRVEAY EVGPGDRVEA LLSTRREQGG RQ
|
| |