Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2764 |
Symbol | |
ID | 8385070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2835009 |
End bp | 2838263 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644973839 |
Product | PKD domain containing protein |
Protein accession | YP_003131658 |
Protein GI | 257053825 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTTCT GGGGCGACGA GCGCGCGCAA GCGATTCAAA TCGGTGCTGT ACTGCTGTTC GCTGTTCTCG TGATCTCGTT CTCGCTGTAC CAGGCGTTCG TCGTCCCGAA CCAGAACGCG GGGATCGAGC TCAACCACAA CTCTGAGGTC CAATCACAAC TCCAGGAGCT GCGCAACGCC ATTCATGGGA TCGCGGGCGG GAACCCGGGT TCGGGAGTTA CGGTCAATCT GGGCACGACC TATCCCACTC GAGTGGTCGC ACTGAATCCT CCACCGCCGT CCGGACGGCT GTATACGGAT GGGACGAGGA ACCCGTCAGT GAATTTCACG ATCGTCAATG CGACCGCTGA CGGGGAGACC GGTGATCTCT GGAACGGGTC AGCCCGACCG TACAACACGG GGGGAATCAT GTACGCACCC GATTATAACG AGTACCGGTC CCCGCCGACG ACGATCTACG AGAACTCGCT GGTCTACAAC CAGTTCGAGA GCCAAACGCT GTCGTTGACG GATCAGTCGC TGATCTCCGG AGATCGGATC TCGATCGTAA CTCTCAACGG CTCGTTCGAT CGGTCCGGAT CCCGGTCGAT GACCGTCGAT CCTGAGCCGA TCAGCACCTC CGACCGGACG ATATCGGTGG CCGATGGAAC CGGCCCGATC ACTCTCAACA TCACCACGCG ACTCTCGGTC GATCGATGGC AGACGCTGTT GAACGACGAG GACAACGTCC TGACGGCCGA TATCCGAACT GCCCCCGATA CTGACTCGGT TCCGGATCCG TTTCGACGCG TGCTGATCCC ATTGAACCGG AGCGAGACCT ACACTCTCGG GATGACGAAA GTCGGGGTCG GCTCGGGAGT GAGTGAGGAA TCCGGTGAGT ACCTCACACT AGTTGATCGG CCGAGTCGGT CGATAAGAAA TGGGTCGAGT CACTTCATCA CGCTTGAAGT TCGTGATCGA TTCAACAATC CCGTATCGGG CTTTCCAGTG AACGCTATGG TGTCCGGAGG CGGGCAGTTC GAAAGCGGCG GAACAAGCGC GTCAACCATC ACAAATCCCG CAGGGCAAGC AGTCTTCAAC TACACTGCAC CGGCAGGCGA GCAATCACCA ACCCTCCAGT TCAATATTAC TGAGGGGACA CCCCAAGCTT ACGAGGAAGT GTCGTTCGGT CTCGAAACGT ACCTGACTGG CGAGGGTGAA AGTGGTAACG CAACAGCCTC GGTAGACGCG ACCTATGGAG GACTTTCAGA TTTCGCTTCG CAGTATGATG GGGCCACGAA CAGTGCTGAA GAGCTTTTCA CACAAGCGAA CGGTAAGTGG ACGAATGTCA ATCAGACTGG GAGTCTGAAC CTCAGCGCTG CCGAACCAAT CTTGCAAGAA AGTACGGGGC ACGACCGACT TGATATCGCC TTCACGCTGC GGGGCGGGTC CACTCCACAG TACAATTACT CCCTCGCGGT CACTGCCGAG ATCGCGGCTG ACGGCAGTAT CAACAGTCGA TCGGTGTCAC TCCGAAACAC GACTGGGACG ACAAATTCCC AAAGCAGCGG CTCACTGTCC GATAGTGCCG TTCGAAAATT GCTCGATGGC GGATCCGTCA ATCTCCTCGA TTCGGGCTCA TATTTGAGCT CACCGTCGTG GCTATCGGAA ATTCAGCGGC TGGAGACGGC GTCTCCGATC ACCTGGCTGA CGAACCGGAT GGAGGGACGC GTCAACGTGA CGTGGCAGAA CGCTCCGCCG GTCGCGGATG CTGGTTCGCC GAGCGACATC GAGGAGGGAT CGAGCCAAAC ACTCGATGCG AGTGCAAGCT CCGATCCCGA CTTGGATACG CCATTCACCT ACTCTTGGAC GGTTATCAGT GGACCAGGGA GTATTTCAGG TACTGGTCCG ACCCCGACAT ACAACGCACC AGCAGACGTG GGCAGCGACC AATCTGTCAC CATCGAGGTC ACGGTTACTG ACGCCGACGG CGATTCGAGT ACGGATCAAG CCACCTTCAC CGTAGAGGAT GTCCCTGATA ATATCTCACC GACTGCTGAA GCCGGAACAT ATGCAGACAT CGACGAGGGA GGGAGTATTA GTCTCGACGG ATCAAGTAGT TCCGACGCGG ACGGGACTAT TTCGACCTAC GATTGGGCTG TCGCTACCGG GCCAGGCTCG ATCTCCGACT CAGATAGCTC AACACCGGGT GCGACATACA ACGCACCGTC GGACGTGACA AGCGACCAGT CCGTCACCGT CGAACTGACC GTCACGGATA ACGAGAGTGC CACCGACACC GACACCGCGA CGTTCACCGT TCGCGATAAT GCGGAGCCGG CAGCGAATGC TGGTGGCCCG TACACGGTAA ACGAAAGTGA TTCAGTGACC CTCGATGGGA CTGCATCATC AGACGATACC GGGATTACGA GTTACTCTTG GGCAATTACG GACAACGCCT CTGCGGGGAG TCTTTCGAAC AGTAACACGG CGTCGCCAGA ATTCATCGCC ACTTCGACAA GTGGGGGTAC GATTGTCACG GTCGAACTTA CGGTGACGGA CGACGCTGGA CAGACCGATA CCGATACTGC TACGATCACG ATCAACGACC CGCCCGTCGC GAACTTCACG TACTCGCCGA GTTCACCTGG ACCAGGCGAG CAGATCTCCT TCGACGGATC AAGTTCGAGT GACTCTGACG GCAATATCAG CACGTACGAG TGGGACTGGA CTGGCGATGG GACGACGGAC GATACCGGTC AGACTGCGAC CCACACATAC GATAGCAGTG GGACCTACGA CGTGACGCTC CGGGTGACCG ACAATGATGG GGCTCAGAAC ACGACGACCC GGACGATCTC AGTCACGCAA GCAAATATTC TACAGAACGT GGAAGCAGAA GCCCAGACCT CAGGAAACAG CGGAAAAGCG TCTTTCGTGC TTAGCAATAA CGGATCCGCT ACCGTGACCG TGTCTGGTAT ACTCTTCAAT AGTTCCAATA GTTCGACGTA TGTGGAAGCA AACAATCAGA ATACGCCGGA GATTGAGGAC GATAAGGCAT CAACGCCAAG TCCTATCCTC AATGTCCCCG GACAAATGAA CTTCAACCAA CAGTACTCGT TCAATAACTC CGTTCCGATT GATCCGGGGG AGTCGATTCA ATTCAACATT GATCGGTTCA GAAATGATGG GAACATGCAA AGTGCAACAC TGACCTTTAC CATCTACCTA AACGACGGAA GTCAGGAAAC CTACGACATC ACATTCTCTA ATTAA
|
Protein sequence | MSFWGDERAQ AIQIGAVLLF AVLVISFSLY QAFVVPNQNA GIELNHNSEV QSQLQELRNA IHGIAGGNPG SGVTVNLGTT YPTRVVALNP PPPSGRLYTD GTRNPSVNFT IVNATADGET GDLWNGSARP YNTGGIMYAP DYNEYRSPPT TIYENSLVYN QFESQTLSLT DQSLISGDRI SIVTLNGSFD RSGSRSMTVD PEPISTSDRT ISVADGTGPI TLNITTRLSV DRWQTLLNDE DNVLTADIRT APDTDSVPDP FRRVLIPLNR SETYTLGMTK VGVGSGVSEE SGEYLTLVDR PSRSIRNGSS HFITLEVRDR FNNPVSGFPV NAMVSGGGQF ESGGTSASTI TNPAGQAVFN YTAPAGEQSP TLQFNITEGT PQAYEEVSFG LETYLTGEGE SGNATASVDA TYGGLSDFAS QYDGATNSAE ELFTQANGKW TNVNQTGSLN LSAAEPILQE STGHDRLDIA FTLRGGSTPQ YNYSLAVTAE IAADGSINSR SVSLRNTTGT TNSQSSGSLS DSAVRKLLDG GSVNLLDSGS YLSSPSWLSE IQRLETASPI TWLTNRMEGR VNVTWQNAPP VADAGSPSDI EEGSSQTLDA SASSDPDLDT PFTYSWTVIS GPGSISGTGP TPTYNAPADV GSDQSVTIEV TVTDADGDSS TDQATFTVED VPDNISPTAE AGTYADIDEG GSISLDGSSS SDADGTISTY DWAVATGPGS ISDSDSSTPG ATYNAPSDVT SDQSVTVELT VTDNESATDT DTATFTVRDN AEPAANAGGP YTVNESDSVT LDGTASSDDT GITSYSWAIT DNASAGSLSN SNTASPEFIA TSTSGGTIVT VELTVTDDAG QTDTDTATIT INDPPVANFT YSPSSPGPGE QISFDGSSSS DSDGNISTYE WDWTGDGTTD DTGQTATHTY DSSGTYDVTL RVTDNDGAQN TTTRTISVTQ ANILQNVEAE AQTSGNSGKA SFVLSNNGSA TVTVSGILFN SSNSSTYVEA NNQNTPEIED DKASTPSPIL NVPGQMNFNQ QYSFNNSVPI DPGESIQFNI DRFRNDGNMQ SATLTFTIYL NDGSQETYDI TFSN
|
| |