Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2419 |
Symbol | |
ID | 8384719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2484965 |
End bp | 2486881 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644973493 |
Product | protein of unknown function DUF58 |
Protein accession | YP_003131318 |
Protein GI | 257053485 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGAAC GCTCTCGGCG CCCCGACGAG GAGACGGAGG GGGCCGCCGC AGTCGGATTG GGCCGGTTTC TGAGTGTGCT CGGTGTGTTC GCGGTCGCGG TCGGCTTCCT GCTTTTGGTT TCGCCGGATC TCGGGGAGTC GCTCTCGGTG CAATACATCG TCGTGCTGGC GATCGGTGGG CTGTTGCTCT TGTTCGCCGC TCGCCGGTGG TTCAAACGAC TGACTTCGTC GATCGACACC GCCGAAACCC CGCCTGTCGA GCGTCGAACG ACGGTCTCGG TTCCGGGCGA CGGTTTCGAC AAACTGCTCG TCGACACCGC ATCGAACGCG ATGGGCCGGT TGCAGGTGAA GGCGACGGCC CGCGAGCGCA TTCGTGCGGT TGCCAAAGTG GTGGTTAGTG GGGATCCCGA GGATATCGAC GACCAACTTG CAGCCGGGGC CTGGAGTGAT GATCCCGACG CCAACGCGCT GTTCTCGAAC GGAACTGCCA GCGTCCGGGA TCGGGTCTCG TCGTTCGTCA GCGGGACATC CATCCTGAAA CGGCGGGTTG TCAGCGCAAT CGAAGCCCTG GCCCGTCTCG CTGACGAGGA TGTCGAATGG GAGGCCGAGC CGGTCGTCCC GGAAGTAACG CAAGAGCCCG CCGAGGAAGG CGACCATGCG ACCGGCCGCT GGAACGGACT GACTGCCGTG GCACTGACGG TAGTCGGGTT CGGTGTTCTT CTGGCCCGGC CCGGTTTGGT TCTTTCAGGC GCTGTTCTGT CAGGGCTCGG CGCGTACGCG GTCGCCGGGT CCTCCCCCTC GACGGCGGTC CGCATCTCCC GGGAGATCCA GCCGGCCGCT CCGCGACCCG GCGAACCCGT CGACGTCACC GTCGAAGTCG AAAACATCGG TGAGCAGTTT CTCCCTGACC TCCGGATCGT CGACGGCGTG CCGGCTGATC TCACAATCGA GGCGGACAGC CCACGTCACG GGACCGCGCT TCGCCCCGGC GCGACGATGG AATACACGTA TACAGTCCGT GGGATCCGCG GCAGCCACAC GTTCGAGGAC GCGTTTCTCG TCTCCCGAAA CCTTCCGGGG ACGCTCGAAC GAGTCGAGGA ATTCGGTGTC GATGGCGACC GGACCGTCAC GTACGATGTC TCCTCGGCCC TCGATCTGTC GGTCCCGCTT CGCAAACAGG CCTCGATGCA CGTTGGGCGT GTCTTGACTG ACTCAGCCGG GAGTGGCCTG GAGTTTCACT CGGTTCGGGA ATATCGAAGC GGTGACCCAC TGACACGTAT CGACTGGAGT CGGGCGGCCC GTGGCGAAGG GCTGGCGACG CTGCAGTTTC ACGAGGAGCG AGCGGCGACT GTCGTCCTGT TGATCGACGC TCGCAAGGAG GCCTACGTCG CCAACGACGA CGATTCCCCC TCGGCCGTCG ACCGGAGTGT CCTCGCGGCG GCGAAGCTCG CGTCGGCGTT GCTCGCGGCG GACGATCGAG TCGGCTTGGC CGCCCTCTCG CCCAGACAGT GCTGGCTCGC GCCCGGAGCA GGGCATACGC ACCTCGCACG CCTGCAGGAC GTGCTCGCGA CTGACGGGGC CTTCGCTCCG TCGCCACCGA CGCTCCCGTA CTACCAACGG ATCAACCTGC CTGCGCTCCG GAAACGGTTA TCGTCCGACA GCCAACTGGT CGTGTTCTCG CCGCTGGTCG ACGACGAAGT AGTCGACATC GTCCGCCAAC TCCAGGCTAG CGGCCACCCG GTAACGATCA TCAGTCCGGA CGCTTCTGGC AGTGGAACGC CGGGTCGGAC GCTCGCGCGA CTCGAACGCC GGAAACGACT CTCGGAGCTT CGGGGAGCCA ACGTTCGCGT GGTCGACTGG GACGCCGATG AATCGCTCGC ACTCGCGCTG ACGAACGCCG GACGGCGGTG GTCATGA
|
Protein sequence | MRERSRRPDE ETEGAAAVGL GRFLSVLGVF AVAVGFLLLV SPDLGESLSV QYIVVLAIGG LLLLFAARRW FKRLTSSIDT AETPPVERRT TVSVPGDGFD KLLVDTASNA MGRLQVKATA RERIRAVAKV VVSGDPEDID DQLAAGAWSD DPDANALFSN GTASVRDRVS SFVSGTSILK RRVVSAIEAL ARLADEDVEW EAEPVVPEVT QEPAEEGDHA TGRWNGLTAV ALTVVGFGVL LARPGLVLSG AVLSGLGAYA VAGSSPSTAV RISREIQPAA PRPGEPVDVT VEVENIGEQF LPDLRIVDGV PADLTIEADS PRHGTALRPG ATMEYTYTVR GIRGSHTFED AFLVSRNLPG TLERVEEFGV DGDRTVTYDV SSALDLSVPL RKQASMHVGR VLTDSAGSGL EFHSVREYRS GDPLTRIDWS RAARGEGLAT LQFHEERAAT VVLLIDARKE AYVANDDDSP SAVDRSVLAA AKLASALLAA DDRVGLAALS PRQCWLAPGA GHTHLARLQD VLATDGAFAP SPPTLPYYQR INLPALRKRL SSDSQLVVFS PLVDDEVVDI VRQLQASGHP VTIISPDASG SGTPGRTLAR LERRKRLSEL RGANVRVVDW DADESLALAL TNAGRRWS
|
| |