Gene Huta_2764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2764 
Symbol 
ID8385070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2835009 
End bp2838263 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content56% 
IMG OID644973839 
ProductPKD domain containing protein 
Protein accessionYP_003131658 
Protein GI257053825 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTCT GGGGCGACGA GCGCGCGCAA GCGATTCAAA TCGGTGCTGT ACTGCTGTTC 
GCTGTTCTCG TGATCTCGTT CTCGCTGTAC CAGGCGTTCG TCGTCCCGAA CCAGAACGCG
GGGATCGAGC TCAACCACAA CTCTGAGGTC CAATCACAAC TCCAGGAGCT GCGCAACGCC
ATTCATGGGA TCGCGGGCGG GAACCCGGGT TCGGGAGTTA CGGTCAATCT GGGCACGACC
TATCCCACTC GAGTGGTCGC ACTGAATCCT CCACCGCCGT CCGGACGGCT GTATACGGAT
GGGACGAGGA ACCCGTCAGT GAATTTCACG ATCGTCAATG CGACCGCTGA CGGGGAGACC
GGTGATCTCT GGAACGGGTC AGCCCGACCG TACAACACGG GGGGAATCAT GTACGCACCC
GATTATAACG AGTACCGGTC CCCGCCGACG ACGATCTACG AGAACTCGCT GGTCTACAAC
CAGTTCGAGA GCCAAACGCT GTCGTTGACG GATCAGTCGC TGATCTCCGG AGATCGGATC
TCGATCGTAA CTCTCAACGG CTCGTTCGAT CGGTCCGGAT CCCGGTCGAT GACCGTCGAT
CCTGAGCCGA TCAGCACCTC CGACCGGACG ATATCGGTGG CCGATGGAAC CGGCCCGATC
ACTCTCAACA TCACCACGCG ACTCTCGGTC GATCGATGGC AGACGCTGTT GAACGACGAG
GACAACGTCC TGACGGCCGA TATCCGAACT GCCCCCGATA CTGACTCGGT TCCGGATCCG
TTTCGACGCG TGCTGATCCC ATTGAACCGG AGCGAGACCT ACACTCTCGG GATGACGAAA
GTCGGGGTCG GCTCGGGAGT GAGTGAGGAA TCCGGTGAGT ACCTCACACT AGTTGATCGG
CCGAGTCGGT CGATAAGAAA TGGGTCGAGT CACTTCATCA CGCTTGAAGT TCGTGATCGA
TTCAACAATC CCGTATCGGG CTTTCCAGTG AACGCTATGG TGTCCGGAGG CGGGCAGTTC
GAAAGCGGCG GAACAAGCGC GTCAACCATC ACAAATCCCG CAGGGCAAGC AGTCTTCAAC
TACACTGCAC CGGCAGGCGA GCAATCACCA ACCCTCCAGT TCAATATTAC TGAGGGGACA
CCCCAAGCTT ACGAGGAAGT GTCGTTCGGT CTCGAAACGT ACCTGACTGG CGAGGGTGAA
AGTGGTAACG CAACAGCCTC GGTAGACGCG ACCTATGGAG GACTTTCAGA TTTCGCTTCG
CAGTATGATG GGGCCACGAA CAGTGCTGAA GAGCTTTTCA CACAAGCGAA CGGTAAGTGG
ACGAATGTCA ATCAGACTGG GAGTCTGAAC CTCAGCGCTG CCGAACCAAT CTTGCAAGAA
AGTACGGGGC ACGACCGACT TGATATCGCC TTCACGCTGC GGGGCGGGTC CACTCCACAG
TACAATTACT CCCTCGCGGT CACTGCCGAG ATCGCGGCTG ACGGCAGTAT CAACAGTCGA
TCGGTGTCAC TCCGAAACAC GACTGGGACG ACAAATTCCC AAAGCAGCGG CTCACTGTCC
GATAGTGCCG TTCGAAAATT GCTCGATGGC GGATCCGTCA ATCTCCTCGA TTCGGGCTCA
TATTTGAGCT CACCGTCGTG GCTATCGGAA ATTCAGCGGC TGGAGACGGC GTCTCCGATC
ACCTGGCTGA CGAACCGGAT GGAGGGACGC GTCAACGTGA CGTGGCAGAA CGCTCCGCCG
GTCGCGGATG CTGGTTCGCC GAGCGACATC GAGGAGGGAT CGAGCCAAAC ACTCGATGCG
AGTGCAAGCT CCGATCCCGA CTTGGATACG CCATTCACCT ACTCTTGGAC GGTTATCAGT
GGACCAGGGA GTATTTCAGG TACTGGTCCG ACCCCGACAT ACAACGCACC AGCAGACGTG
GGCAGCGACC AATCTGTCAC CATCGAGGTC ACGGTTACTG ACGCCGACGG CGATTCGAGT
ACGGATCAAG CCACCTTCAC CGTAGAGGAT GTCCCTGATA ATATCTCACC GACTGCTGAA
GCCGGAACAT ATGCAGACAT CGACGAGGGA GGGAGTATTA GTCTCGACGG ATCAAGTAGT
TCCGACGCGG ACGGGACTAT TTCGACCTAC GATTGGGCTG TCGCTACCGG GCCAGGCTCG
ATCTCCGACT CAGATAGCTC AACACCGGGT GCGACATACA ACGCACCGTC GGACGTGACA
AGCGACCAGT CCGTCACCGT CGAACTGACC GTCACGGATA ACGAGAGTGC CACCGACACC
GACACCGCGA CGTTCACCGT TCGCGATAAT GCGGAGCCGG CAGCGAATGC TGGTGGCCCG
TACACGGTAA ACGAAAGTGA TTCAGTGACC CTCGATGGGA CTGCATCATC AGACGATACC
GGGATTACGA GTTACTCTTG GGCAATTACG GACAACGCCT CTGCGGGGAG TCTTTCGAAC
AGTAACACGG CGTCGCCAGA ATTCATCGCC ACTTCGACAA GTGGGGGTAC GATTGTCACG
GTCGAACTTA CGGTGACGGA CGACGCTGGA CAGACCGATA CCGATACTGC TACGATCACG
ATCAACGACC CGCCCGTCGC GAACTTCACG TACTCGCCGA GTTCACCTGG ACCAGGCGAG
CAGATCTCCT TCGACGGATC AAGTTCGAGT GACTCTGACG GCAATATCAG CACGTACGAG
TGGGACTGGA CTGGCGATGG GACGACGGAC GATACCGGTC AGACTGCGAC CCACACATAC
GATAGCAGTG GGACCTACGA CGTGACGCTC CGGGTGACCG ACAATGATGG GGCTCAGAAC
ACGACGACCC GGACGATCTC AGTCACGCAA GCAAATATTC TACAGAACGT GGAAGCAGAA
GCCCAGACCT CAGGAAACAG CGGAAAAGCG TCTTTCGTGC TTAGCAATAA CGGATCCGCT
ACCGTGACCG TGTCTGGTAT ACTCTTCAAT AGTTCCAATA GTTCGACGTA TGTGGAAGCA
AACAATCAGA ATACGCCGGA GATTGAGGAC GATAAGGCAT CAACGCCAAG TCCTATCCTC
AATGTCCCCG GACAAATGAA CTTCAACCAA CAGTACTCGT TCAATAACTC CGTTCCGATT
GATCCGGGGG AGTCGATTCA ATTCAACATT GATCGGTTCA GAAATGATGG GAACATGCAA
AGTGCAACAC TGACCTTTAC CATCTACCTA AACGACGGAA GTCAGGAAAC CTACGACATC
ACATTCTCTA ATTAA
 
Protein sequence
MSFWGDERAQ AIQIGAVLLF AVLVISFSLY QAFVVPNQNA GIELNHNSEV QSQLQELRNA 
IHGIAGGNPG SGVTVNLGTT YPTRVVALNP PPPSGRLYTD GTRNPSVNFT IVNATADGET
GDLWNGSARP YNTGGIMYAP DYNEYRSPPT TIYENSLVYN QFESQTLSLT DQSLISGDRI
SIVTLNGSFD RSGSRSMTVD PEPISTSDRT ISVADGTGPI TLNITTRLSV DRWQTLLNDE
DNVLTADIRT APDTDSVPDP FRRVLIPLNR SETYTLGMTK VGVGSGVSEE SGEYLTLVDR
PSRSIRNGSS HFITLEVRDR FNNPVSGFPV NAMVSGGGQF ESGGTSASTI TNPAGQAVFN
YTAPAGEQSP TLQFNITEGT PQAYEEVSFG LETYLTGEGE SGNATASVDA TYGGLSDFAS
QYDGATNSAE ELFTQANGKW TNVNQTGSLN LSAAEPILQE STGHDRLDIA FTLRGGSTPQ
YNYSLAVTAE IAADGSINSR SVSLRNTTGT TNSQSSGSLS DSAVRKLLDG GSVNLLDSGS
YLSSPSWLSE IQRLETASPI TWLTNRMEGR VNVTWQNAPP VADAGSPSDI EEGSSQTLDA
SASSDPDLDT PFTYSWTVIS GPGSISGTGP TPTYNAPADV GSDQSVTIEV TVTDADGDSS
TDQATFTVED VPDNISPTAE AGTYADIDEG GSISLDGSSS SDADGTISTY DWAVATGPGS
ISDSDSSTPG ATYNAPSDVT SDQSVTVELT VTDNESATDT DTATFTVRDN AEPAANAGGP
YTVNESDSVT LDGTASSDDT GITSYSWAIT DNASAGSLSN SNTASPEFIA TSTSGGTIVT
VELTVTDDAG QTDTDTATIT INDPPVANFT YSPSSPGPGE QISFDGSSSS DSDGNISTYE
WDWTGDGTTD DTGQTATHTY DSSGTYDVTL RVTDNDGAQN TTTRTISVTQ ANILQNVEAE
AQTSGNSGKA SFVLSNNGSA TVTVSGILFN SSNSSTYVEA NNQNTPEIED DKASTPSPIL
NVPGQMNFNQ QYSFNNSVPI DPGESIQFNI DRFRNDGNMQ SATLTFTIYL NDGSQETYDI
TFSN