Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1173 |
Symbol | |
ID | 8383448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1147936 |
End bp | 1149345 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644972232 |
Product | conserved repeat domain protein |
Protein accession | YP_003130082 |
Protein GI | 257052249 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCAT CCGGTGACGA TGCGCCCGAT CGAAGACAGC GGGCGGACCG CGGTCGCGCG GGGCACGTCG AGGCCGGCGT CGTCGATCAC CGAACCGGTC ACTGGACCGG CGTGGGCGCG GTCGCGTTCG TCTTCGGCGG CCTCGGGATC GCCCTCCGGG AGCCAGCCCT GTTGCTCGCC GGCGTCGTCG GTGCGGTCTT TACGGGGTAC GCACGTAGCG CAGAGCCGAT CCCCGTCCGC GATCCGGAAA CCGGCGAGCC GGCGCTGTCG ATCCGTCGAC GACTCGAGCC CGACGCGCCT GAACCCGGCG AGGACGTCAC CGTCACGGTC GAACTCCGCA ACGAGGGGGC GGGCGTCCTC ACCGACGTGC GGATCGTCGA CGGCGTCCCG CCGGCGCTCG ATGTCACTGA CGGCACGCCA CGCCACGGCG CGGTACTTCG GCCCGACGAG GGTGTGACCT ACGCCTATAC GGTGACCGCG ACCCGCGGCG AGCACGACTG GCGACCCGCG CAGATCACGG TCGCGGATCC GAGCGGGGCT GTCGAACGCG AGACGACCGT CGACGCCCCC ACGACGCTGT CCTGTTCGCT CCCGGCGCTC ACCAGCGAAC AACTCCCGCT TCGGGGGTTG ACGACGCCGT ACGTCGGGCG CGTCGACACG GACGAGGGTG GATCGGGCGT GGAGTTCTTC TCGACGCGGG AGTACCGGTC GAGCGATCCG CTCTCGCGGA TCGACTGGCG TCGCCACGCC AAGACCGGGG AGCTCGGCAC GCTCGAATAT CGCGAGGAAC GCGCCGCGAC GGTCATGCTG GTGATCGACA CCCGGCAGGC CGCCTATCGA GCGCCGGAAC CGGGGGCGTA TCACGCCGTC GAGCGGAGCG TCGACGCGGC CAACCGGGTC TTCGCCGCCT TGCTCGACAC CGGCGACCGC GTCGGTGTGA CAACGCTCGG CCCGGCCAGT GAGGAGTGCT GGCTGTCGCC CGGGGCCGGC GACGAACACC GCGCTCACGG CCGTGCGTTG CTGAACTCCC ATCCCGCACT CTCGCCGATC CCACCCGACG AGGGGCTGTT CGAGACGATC CGCGACGACC GCCGTGAGCA ACTCAAGGAC CGGCAGGTCA CCCGGCTCCG CCGTCGGCTC TCCGCGGACA CCCAGGTGTT CGTCTTCTCG CCGTGCTGTG ACGGGTACGT CCCGACCGTC GCTCGCCGGC TCGACGCTCA CGGGCAACTC GTGACGGTGT TGAGTCCCGA TCCGACGACC GACGATACGC CCATTCGCGA ACTCGCCAGC ATGGAACGCG CGGATCGCCT GGACGGCCTC CGTGGCGAGG GGATCCGTGT GCTCGACTGG CGCGACGGGG AGTCCTTCGC GGCGGCGCTC GCCCGCGCGC AGGCACGGTG GTCGGCATGA
|
Protein sequence | MTASGDDAPD RRQRADRGRA GHVEAGVVDH RTGHWTGVGA VAFVFGGLGI ALREPALLLA GVVGAVFTGY ARSAEPIPVR DPETGEPALS IRRRLEPDAP EPGEDVTVTV ELRNEGAGVL TDVRIVDGVP PALDVTDGTP RHGAVLRPDE GVTYAYTVTA TRGEHDWRPA QITVADPSGA VERETTVDAP TTLSCSLPAL TSEQLPLRGL TTPYVGRVDT DEGGSGVEFF STREYRSSDP LSRIDWRRHA KTGELGTLEY REERAATVML VIDTRQAAYR APEPGAYHAV ERSVDAANRV FAALLDTGDR VGVTTLGPAS EECWLSPGAG DEHRAHGRAL LNSHPALSPI PPDEGLFETI RDDRREQLKD RQVTRLRRRL SADTQVFVFS PCCDGYVPTV ARRLDAHGQL VTVLSPDPTT DDTPIRELAS MERADRLDGL RGEGIRVLDW RDGESFAAAL ARAQARWSA
|
| |