Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1880 |
Symbol | |
ID | 8384171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1885223 |
End bp | 1887181 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644972949 |
Product | conserved repeat domain protein |
Protein accession | YP_003130783 |
Protein GI | 257052950 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACGAGA GTCTCCGCCG GTTCGCGGTC GTGATCGGCC TCGCGTCGCT GGCCGGCGGG AGCGTGGCAG TCCTAACGGC AGGGGCGGGC GGCACCTCGA GCCTGACGGC CGTCCTGTAC GGGGGTGGCG GCGTGATCGC CGGCGTGATG GCGTTGCTGG TCTACCGGGA CTGGCGGACA TCGAGTCGCA TCGAGCCCCC AGCGGTCGAA CGGACGCCCA CGCCGCCGAC ACCCGGAACC GAATTCGACC GGACGCTGGG ACAGTTTGAC GGGAGTGGAA CCGGATACCT CCCGGATCGC GCCGACATCC ACGATCGGCT TCGCGAACTC GCGGCCAGGA TTCTCGCCCG AAAGCGCGGG ACGACACCGG AGGCCGCACT GGCCGCGATC GATGCCGGCG ACTGGCCGCG TGAGGAGGCG AGTTCCGTGT TCCTCCGGAA CGCCGACGCG TCGGTCCCCG AGTCGCTGTC CGAACGCGCT CGTGGCCTAT TCAGCGACGA CCCCTCGGAC TTCCAGCGGC TGGTCAGGCG GACGGTCGCG ACACTCGAAG GCGAGTCCGA CCTCGTCGAT GAGGCGGGCG AAGGACGGCT CGTCCTCGAC GACGACCGGC GATCCGAGGA GACCCACTCG ACGAGCGATC TCGCGGCGGA GATGGTCGGC GGAAGCCAGG TATACGACGG CGTCACCAGC GCGCGCTTCC GGCCGCTGGT CCCGCTCGGC GTCGCGTTCG TCGGCCTCGG CCTGTTCGTC CAGAACGCCG GGATCATCCT GGCCGGGACC GTCCCGATCG GCGTCCTGGC GTTCGCCCGG TTCGTGACGC CGCCGGACGC CACGCTGACC GTCGAGCGGA CGCTCGATCC CCGCCATCCC GATCCCGGAG ACGAGGTCAC GGTGACGACG ACGGTTCGCA AGGACGGCGA CCGGACGCTG CCCGATCTCC GGGTCGTCGA CGGCGTCCCG GAAGGACTCT CAGTGGTCGA GGGATCACCG CGGCGTGGGA CGGCACTTCG GCCCGGCGAG GCAACGACCG TCGAGTACAC AGTGACTGCC CGCCGAGGGA CACACGAGTT CGGTCCTGTC TACGCGATCG TCCGGGACTA TGCGGGCCGG TCGGCACGCA CGCAACTCGT TGAGGCCGTC GAAGACGCCG ACACGTTGCT GTACTGTATT CCGGTGCTAC AGGCGACACC GGTGCAGGTC CCACTGTTCG AACACGCAAG CGAGTCACTC GGCCGGATCC CCGCCGAGGG CGGGGACGGC GTGGCGTTCT ACGCGACCCG GGAGTATCGA TCGGGCGACC CGACGAATCG CATCGACTGG AAACGCCTGG CGCGGTCGCC GAGCGAGGAA CTCACGACGA TCGAGTTCCG GGAGGAACAC GCCGCGTCGG TCGCCATCGC CATCGAGACC GCCGGATCGT CGTATACGGC ACCCGCGCCG GACGAGCCGA CAGCACTCGA ACGGTCGATC GCGGCCGCTC GACGGCTATG TGGGTCGTTG TTGGGGACCG GCGACCGGGT CGGGATCGCG GCACTCGGAC CGACGCCGGT GTGGATCGCG CCCGGGACCG GGTACGACCA CCGCGAACGC GTCGAGCGCG TGCTGGCGAC CGACGACGCG TTCCCGGCGA CGACACCCGA CAGCGATACG TTCAGCCCGC GCTGGGTCCG GGAGTTCCAC CGCCGGTTCC CCGAGGAGAC CCAGGTCGTG CTGGTTTCGG CACTCACGGA CCCCACCTAC CACTTCGTGA TCCATCGACT CCGCGCGTAC GGCCATCCGG TCACGGTGCT GAGTCCGGAC GTGACGACGG GCGAAACCGT CGGCGAACGA CTCGTCCGCC TCGAACGGCG CCACCGGATC GAGGCGTTAC GGGAGGCCGG CGTCCGGGTC ATCGACTGGG CAGACGAGGA AGAACTGGGC GTTGCACTCA CGCGGGCCGG ATCGAGGTGG TCGGCATGA
|
Protein sequence | MNESLRRFAV VIGLASLAGG SVAVLTAGAG GTSSLTAVLY GGGGVIAGVM ALLVYRDWRT SSRIEPPAVE RTPTPPTPGT EFDRTLGQFD GSGTGYLPDR ADIHDRLREL AARILARKRG TTPEAALAAI DAGDWPREEA SSVFLRNADA SVPESLSERA RGLFSDDPSD FQRLVRRTVA TLEGESDLVD EAGEGRLVLD DDRRSEETHS TSDLAAEMVG GSQVYDGVTS ARFRPLVPLG VAFVGLGLFV QNAGIILAGT VPIGVLAFAR FVTPPDATLT VERTLDPRHP DPGDEVTVTT TVRKDGDRTL PDLRVVDGVP EGLSVVEGSP RRGTALRPGE ATTVEYTVTA RRGTHEFGPV YAIVRDYAGR SARTQLVEAV EDADTLLYCI PVLQATPVQV PLFEHASESL GRIPAEGGDG VAFYATREYR SGDPTNRIDW KRLARSPSEE LTTIEFREEH AASVAIAIET AGSSYTAPAP DEPTALERSI AAARRLCGSL LGTGDRVGIA ALGPTPVWIA PGTGYDHRER VERVLATDDA FPATTPDSDT FSPRWVREFH RRFPEETQVV LVSALTDPTY HFVIHRLRAY GHPVTVLSPD VTTGETVGER LVRLERRHRI EALREAGVRV IDWADEEELG VALTRAGSRW SA
|
| |