Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2147 |
Symbol | |
ID | 8384441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2192235 |
End bp | 2196905 |
Gene Length | 4671 bp |
Protein Length | 1556 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644973216 |
Product | PKD domain containing protein |
Protein accession | YP_003131047 |
Protein GI | 257053214 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.927191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAG ATGATACATT TCTCTCTGAT GACTTTGAGG AGTATGCTAT TGGTTCGTTC CCTGGCCGAT GGCAACAGAG TGGAAACAGC GATCAAAAAA TCGTTGATGA ACCCGTCACA AGCGGCGACA AAGCGCTTCA GCTTGTAGGT AGTTACGGCA GTTGTTGGCA GGCATTAGCG CACCGGGAAC TTGGTAGCTC TTTGCCTAAG GACGAGTCAG TCATCGTCCG CGGAGAGATC GATCCGACTG GTGAGGGAGG AGGCGGCTGT CATGGCACAA AGAATGGAAG CATTGACATA CGAGAGAATC CATCGGCTTG GCCAAATGTC GGATTCAAAA GAAAGCTGAT TCAATTCGAT GCAGAGGGGT CCGTTCGAGG CGGCGGTATT GATCTCGGAT CATGTGAGAT TGGTGTGTAT AATAGTTTCA AAATTGAATA CTACTGGGAT TCAAACGGAG AAGAAGTGAA TCTTCAGTAC CAAATCAATG GCGAATTCAG AGGGGAGACG ACTGTTGATG TCCAGACCAA TGATAGTGGG GATGTTGTTG AATCCAAGCT TCAATACCTC ACTGTAGTTA GTGGTGACTA TAAAATACAT GTGGACCGCC TTTCAATCGT TTCTGGCTCA GATATTATAA CACGTCCAGA TCTCAATTCA GAAATTTCCG TTTCGACCGC TACTACGTCT CCTGGCAACC AGATTACCTT TGATGCGAGT AAGTCGACGG GCCAAATTGA TACGTATAAA TGGGAGTTCA GCGATGATAC TTCAGCCACA GGATTAACCG TCACCCGTAG CTTCGATACT ACGGGAATAT ATTCTATCAC TTTGACTACC GTAGATGGTG ATGAGACGGA TACGACGACA TCAGAGGTTG CCGTTGTCAG GGAAACGGAA CTCAATTTCT CGGCCCAACC AGCACAGCCG GTAACCGGAC AATCCGTCAC ATTCGAAGGA ACCGGAGGCA ATTCATACAC TTGGGATTTC GGTGACGGGA GAGGGGCAAC TGGTCAGCAA GTCACCCACA CCTATGACGA GTCTGGCGAC TATCGCGTGA CACTTGCGAC AGAGCAGGAT ACAGTCACTC GTACTATCTC GGTGAATTCA GCGAACATAG AGATAGCCAA TCTCGATCGA GATATCGGCG GAACATTGCT TCCGAAACTT GGTATCGATG AGGGTGTCGA AGCGACCATT AACACGGCTG ATGGACAGGA TATCGATCGT GTGGAATTCT ATTTTGCTGG TCAAAAAGCC ATCTCCAAGA GTCCGCCATA CGATGCGTCC CTCACGATCG ACGACATCGA ACCACCTGCC ACTCCGCTCA CGGTCCGTGC GGTCACGACT GATGGAATGC AACAAGACTT TCAACAGGAC ATTCCTATTC AGCGACTCCC TGACTGGCTT GAATTCTTGC TGAAGTCTAC GGACACACTC GGCGTTGCAC TGACCGACGA AGAAATTGAG ATCACATATA CTCCCCTATC GAATCTCGCC CCGGGGATTG ACATTCCAAA TGATCTTCTT GGAGGAGATG AATCTCCCCG CGAGAACGGT GACGGCGATT ATGATTTCGG CGTCGCGATG GGCGGTATCT ACGATCCCCG GACCAACGAG GCCGAACTGA CTGCCGCGGG GAACATCACT GCCGAAGTGA TGGCACTCGC ATTCGAAATC AAGGTCGCAT TGATTGGGGA AATCGAAGCA ACGACGCTTG AACTGCAGAG TGCGCAAGCA GATATTGACA GCCTCCTATC ATTTGATGTC GCACCACCCA CAATACCCGT TCCAGTCAGC ATTCCGATTC CCGGGACTGA CAGTGCTATC GGTATCGTGC CAACAATCCT CATTTCCGCC GACGGAACGT TCGATTTCAA TGCAGACCTT TCGTTCGATA CTGGCACAGT GGAACCAGGT GTCGAACTTC AAGTGTCGAT CGGTATCGCG CTTGAGCTCC CTGCCCTTCC TGACGGGGAA CTCAAAGGCG TTCCGAGTGG TGGAATTGAT GGTCAATTCG ACGTTGGCAC TGACGACTGG AACATTCAGG CCACATTTTT CCTCGCGGGA AAAGTCGTCC TTGATCCACC GTTCCTTCCC GGAATCACAC TGGAGCAAGA TCCGATCTGG GATCAGCCAC TCACAGGGAG TACTACAGAA CAAGTAACAG CGACCACAGT GTCTCGACCA CGTGTCTGTC ACAAGTCTGC GGGAGGCCCG CAACCGCTCC CAGAGATCGA TAGTGTGGAC ACTGTTTCCG TTGATACTGA CACAATTTCA CCGCGGGAGT CGTTCCGTCT CACTGACCGG CCTTACGAGG ACATCGAACC GGTACTCGCA AGCCCGGCAG AGAATACACA AGTCGTGATC TGGGGGCAAC AGGCCGAGAA CAAACCAGCT GATGCAGGTC ACGATCTTGT CGGCCGGTGG TACGAGAACG GTTCCTGGAG CAAGACGTTT GCAATTACCG ACGACACATA TGCAAATGCC AGTCCAGTTT GTGCAGCGAT GTCGACTGGT GAGATTCTTC TCGCCTGGAA ACGCCTACCC GAAGACCTCA CGGCAAGTGG AGAGCAACTG GACACGCTTG CTGCGGTCAA CAACACACGA GACAAAGCGG AAATTGCCTA CAGCATTTAC GACGGGAATT CGTGGTCGGA TCCGACGCTG CTCACAAGCA CGAGCGTTAC CCAACGACGT CCAACGGTAG CCAAAGCGGA CGGTGAATGG CATCTAGCCT GGGAGTCGTT CGACCGCGAG ACGGGTTCGA CGACTGTTCG CTCGACAGTT GTCTCCCCAG ACGGAACGAC AGCGCCCATC TCAGAGTGGT CCGGGGCTGC AAGCCCGGAT CTGGGACGGC GCAACGATGG CACTGTCGAT CTCGCATATC TTGTCTGGGA TGGAGCGCAG GTCACGGGTG TAACTCACAC AATCCGTGAT GGAACGGCGA CTATGTCCGA CCAAACCTAT AGTGCGACGG CTGCTGATAC GGTCGTCGTT TCGAATGGAC GGGTCCTTTG GGCGACGAAT ACAAACCGAG ACCCGACCTT GGTCGAAGGA ACAGACGGGA CGACAACCGA ACTGTCTCTT CGTGAGGATG TTGCGGAGGT CCGGGAACTG TCTTTTTCGT CGCGGTCAGA CGAGGCGATA CTCTCCTATA TCTCGACGCT GGAAGGAAAA GACACTCGCG ATCAAGTGTA TCGACTCGAT CGTGGCAATG GCTGGATTTT TGATCGGAGA ATCACCGAGA ATATCAAAGA TGGGCTTCGG GTTCGGTACA GTGATCTAGT TTTTGCCGGG AGCGCTTCGT TCCTCTCGGC CTACGCTGTC CGTGATACAG GGACTGATGC GGTGAGTGAC GTGTTTGCGA CCCTTCAGGA GTTCGGTCCT GCATACGCCA TTGATGGATC GATCGATAGT GGTACAGCGG GCGGAGAAAC GACATTGTCC TACACGCTGG AGAACCGCGG CGACGTTGAC GGGGCAGAAC AGGTCACTGT TAGAATACTC CGTGATGGAA CAGAAATCAA GGCGATCACG CACAAGCCCT TGGAGTCCGG CGGAACGCTG GCTCGCAATT GGACAGTCAC GGTCGGTGAT GGGGGCGAGT TCGAACTCCA GTTGGATGTG CCGGAACCGT CGCTTGAGAC AGAACCACAC AGTGTCGAGT TGATTGCTGC GACTGTCCAG CTACGTGTTG ATACAGTGGC TGCGACGCGG ATCGGACCGA GCGAAGCGAC AATCGCCGTG ACAGTCACGA ACCATGGCGG TGCGGTCGCT GAGAACGTAC CTGTGGAACT GAGCGATGCC GCTGGTCCGG TAGGCACACC AATGCTCGAT CGAATCGAGC CAGAAACGAC CACCACTGTC GAGACTGTCA TTGACCCAGT GTCGCTGGAC AACTCCGACA CCCATACTGT TCGACTTGAT CCAAATGAGA GACTCCCGCC ACAGGCAGAA ACAACTTCGC TACGCCGAAC GTACTTGGTT CGTCCAGACC TTCGTGTTGA GGATATCCGC TACCGAGAGG ACAATGATCG ATTCGTTCGA GTGCTCGTGT CGAACCATGG TCCAGGCGAG GGAACAGCGA GCCTTACGAT TAGAGACGGC ACCGACACGG AACTGGCGAC GACAGATATC TCATTGCCGC CGGCGAAGAC AGAAAACGGG AGTACTGTTG CAGTGCATCG CGCCATTGAT TTACAGGTTC CAACCCTGGA ACCGGCGCAA ACGGTTGCCA TCGAGGCGGA GCCAGACGTA AGTAATCTGC ACCAGGAGAC ACTTTCCCGC GTCGAGACAG CCGAGCCGAT TTTGCCGGGA GAGTACAGAG GCGATGTTGG TGATCTCACT GTGAATGCCT CTAATGAGAC CATTCCAGTC GGCGGAGAAG CATCAATCGA TGTTTCTGCG GACAATGTCG GGCAACTTAT AATCGAAAAT ATCTGGACGG ACTGGACAGT GTCTGTCGAT GTACCAAGCG ACATCGTCGA GAAATCTGTC GAAGCCAACG GCAGAGTGAC ACTCACTTGG CCAAGTACGA AGAGTGCTGT TTCGCCAACG CTGGCAATAT CAGTACCTGA CCGTTACATC GGTGGGACAT ACGAAGTGAA CCTCACTGCA ACAAACGCAA GCGATGTCGC CGAAACCACT TCGATGCTCG TCATAAAATA A
|
Protein sequence | MAKDDTFLSD DFEEYAIGSF PGRWQQSGNS DQKIVDEPVT SGDKALQLVG SYGSCWQALA HRELGSSLPK DESVIVRGEI DPTGEGGGGC HGTKNGSIDI RENPSAWPNV GFKRKLIQFD AEGSVRGGGI DLGSCEIGVY NSFKIEYYWD SNGEEVNLQY QINGEFRGET TVDVQTNDSG DVVESKLQYL TVVSGDYKIH VDRLSIVSGS DIITRPDLNS EISVSTATTS PGNQITFDAS KSTGQIDTYK WEFSDDTSAT GLTVTRSFDT TGIYSITLTT VDGDETDTTT SEVAVVRETE LNFSAQPAQP VTGQSVTFEG TGGNSYTWDF GDGRGATGQQ VTHTYDESGD YRVTLATEQD TVTRTISVNS ANIEIANLDR DIGGTLLPKL GIDEGVEATI NTADGQDIDR VEFYFAGQKA ISKSPPYDAS LTIDDIEPPA TPLTVRAVTT DGMQQDFQQD IPIQRLPDWL EFLLKSTDTL GVALTDEEIE ITYTPLSNLA PGIDIPNDLL GGDESPRENG DGDYDFGVAM GGIYDPRTNE AELTAAGNIT AEVMALAFEI KVALIGEIEA TTLELQSAQA DIDSLLSFDV APPTIPVPVS IPIPGTDSAI GIVPTILISA DGTFDFNADL SFDTGTVEPG VELQVSIGIA LELPALPDGE LKGVPSGGID GQFDVGTDDW NIQATFFLAG KVVLDPPFLP GITLEQDPIW DQPLTGSTTE QVTATTVSRP RVCHKSAGGP QPLPEIDSVD TVSVDTDTIS PRESFRLTDR PYEDIEPVLA SPAENTQVVI WGQQAENKPA DAGHDLVGRW YENGSWSKTF AITDDTYANA SPVCAAMSTG EILLAWKRLP EDLTASGEQL DTLAAVNNTR DKAEIAYSIY DGNSWSDPTL LTSTSVTQRR PTVAKADGEW HLAWESFDRE TGSTTVRSTV VSPDGTTAPI SEWSGAASPD LGRRNDGTVD LAYLVWDGAQ VTGVTHTIRD GTATMSDQTY SATAADTVVV SNGRVLWATN TNRDPTLVEG TDGTTTELSL REDVAEVREL SFSSRSDEAI LSYISTLEGK DTRDQVYRLD RGNGWIFDRR ITENIKDGLR VRYSDLVFAG SASFLSAYAV RDTGTDAVSD VFATLQEFGP AYAIDGSIDS GTAGGETTLS YTLENRGDVD GAEQVTVRIL RDGTEIKAIT HKPLESGGTL ARNWTVTVGD GGEFELQLDV PEPSLETEPH SVELIAATVQ LRVDTVAATR IGPSEATIAV TVTNHGGAVA ENVPVELSDA AGPVGTPMLD RIEPETTTTV ETVIDPVSLD NSDTHTVRLD PNERLPPQAE TTSLRRTYLV RPDLRVEDIR YREDNDRFVR VLVSNHGPGE GTASLTIRDG TDTELATTDI SLPPAKTENG STVAVHRAID LQVPTLEPAQ TVAIEAEPDV SNLHQETLSR VETAEPILPG EYRGDVGDLT VNASNETIPV GGEASIDVSA DNVGQLIIEN IWTDWTVSVD VPSDIVEKSV EANGRVTLTW PSTKSAVSPT LAISVPDRYI GGTYEVNLTA TNASDVAETT SMLVIK
|
| |