Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2536 |
Symbol | |
ID | 8384841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2601746 |
End bp | 2602747 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644973613 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003131433 |
Protein GI | 257053600 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCTG GCATCGTCGT CGCAGCCATC GTCATGGCGT GGCTGTTCGG TGCTCGATCG TTGAACGCCG TGGCCGTCAC GGCGCTAGTC GCTGTCATCG CCGCGGTCGC CCAGGTCCGA CTGGCCGATC GCCCATCCGC AGAGCACTCG AACCCACCGG CTGGCTTCCC GGGGGACACT CGTGACGTCA CCGTGTCGGT CACCGGTACC CAGGGAACCA TCGTCGAAGC CAGCGATCCA CTGGATTCAG GTCTCTCGAG TGCTGGAAAC ACGTTTTCGG CAAGCATTCC GTCGACGGTC ACCTACGAGG TGACGTTGGC CGAGCGTGGA GTGCAGTCGA TCGGTCCACT TTCGGTTCAC CTCCGGGACG TGTTCGGACT GGTCACCCAG GAACTCGAGA TCGGTGATTC GACGAGGGTT ATCGTCTATC CGGAGATATA CGATGTCACT GGCTCGACTA TCCTCACGGC CGAACTGCAA CGCAATCGCC AACCGGAACG CCAGGAGATC GACCAGTTGC GCGAGTACGT CCCTGGCGAC CCGTTACGCG ACATCGACTG GAAGTCCTCC GCGAAGCGTC TCCCCGACCT CGTCGTCAAA GAGTTCATCG GCCGGGAAAC CACGGGAACG ATCAAAATCG CCGTCAGCAC CGACCGTGAC ACCGTCGAAG CGGCGACCAG TGCAGTCGGC AGCGTCGGCC TGTTTTTCGC CAGAGCTGGA CTCGAGGTGG GCGTGACCCT GCCTGACGGG GAACTCGACC CAGCACTGGG AGAGACCCAC CAGGATGAAC TGTTGCTATT GCTGGCCGAA ACGGGGCCAG GCACGATCGG GGAGCACGAC TGGGCAGAGG CCGACGTTCG CATGGTCGGC GAAGACGGAT CGGTCACCGT TGACGTCAAC GGCCGGGTGA CGACCTTCGA AGACATGCGC GGAGCGGACG ACAGCGACCA ATCACCCTCC GACCACGAGG GACCCGAGAC CGCAGCGGTG ACAGCCACAT GA
|
Protein sequence | MAAGIVVAAI VMAWLFGARS LNAVAVTALV AVIAAVAQVR LADRPSAEHS NPPAGFPGDT RDVTVSVTGT QGTIVEASDP LDSGLSSAGN TFSASIPSTV TYEVTLAERG VQSIGPLSVH LRDVFGLVTQ ELEIGDSTRV IVYPEIYDVT GSTILTAELQ RNRQPERQEI DQLREYVPGD PLRDIDWKSS AKRLPDLVVK EFIGRETTGT IKIAVSTDRD TVEAATSAVG SVGLFFARAG LEVGVTLPDG ELDPALGETH QDELLLLLAE TGPGTIGEHD WAEADVRMVG EDGSVTVDVN GRVTTFEDMR GADDSDQSPS DHEGPETAAV TAT
|
| |