Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5045 |
Symbol | |
ID | 8745912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013748 |
Strand | - |
Start bp | 33641 |
End bp | 36922 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 646515658 |
Product | KAP P-loop domain protein |
Protein accession | YP_003406605 |
Protein GI | 284176329 |
COG category | [R] General function prediction only |
COG ID | [COG4928] Predicted P-loop ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCCTG AGAATGAAGA TCTCTACTTA GCGGATCGAG CTATTACCTC AGAGACCGAT GATGAATTTG GACACAAAGA GTACGCAGAC AGTTTAGAGA GGATTCTACG AAATGCTAAT CCACCGTGGC ACATCGGTAT CTTCGGAGAG TGGGGTTCCG GTAAAACCTC TATAATCCGG ATGTTGTATA ATCGTCTCAG GGCAAAAGAA GAGTTTGAAG ATACTGTTTT TGTGGAATTT GATGCTTGGT CTCACGCGGA AGGATCCATC CGAACTGAGC TACTTCTTGA ACTAGATGAA AGAATAGGTA AGGAGGTAAA CGGAGAAGAA TCAGATGGCA TTCTTGGCGA AGATGAGATC ACAGGCAGGC TATACGACGT GGAAGAGGAA GAAGAGGTTT CGAGGCCTGC TAACCCAAAA GAAGTTATAG TGAATTTCTG GGAAGATAGT CCAATACTCA CAATCAGTTT TGTCGTTATT GCCGGTGCTG CTGTACTACT ACAATATCTT GGCCAGTCGA CACTGGCATC AGTGGCTGTT ACTGGATTAC TACTCCCGAT TCTCGGGTAT ATTCTCCAGC AGTTGGATAC GGTGGCTCAG ACGATTCAAA AAAAGTTTTT GCATCCAAGA AAAGAGTGGT CTGGTGCGTA CCAGCGAATA TTCGAGGATA TAATCGAAAA ATCGAACGCA GAAACCATCG TAATCTCTAT CGATAATTTA GATAGGTGCG AGAGTTCGAC CGTCTACGAC GTTCTTGTGT CTCTCAAGAC GTTTATGGAA AATGATAATT GTATATATCT TATTCCCTGT GATGACAAAG CCCTAGAGAG CCATATCAAG TCAATTGATA AAGGTGAATA CTTTGAGGAA GGGAAGAATG AACGAGAGTT TCTCCGGAAG TTCTTCCAAA CCCACATCCG GATTCCACCG TTCTTGCCGG AGGATATTGA GGAATATACT GAAACGGAGA ATCAGAGGCT TTCCAATCCA TTCGAAGAGG AAGCTCTCGA CGTACTCACT AATGCATACA TCGACAATCC ACGGAGAATC AAGCATGCAC TCAACCGTCT TTCAACACTC AGAGTTTTAG CAGAGGAGAT CGAGGAAACC GGTGGGTTAA CTGAGGGTAG AATCACCAAT AATATCCCAT TCTTAGCGAA GGTATCCATT TTAGAAGAGG AATACCCAGA GTTCTATGAC ACACTCTCGG AGAATCCTCA GTTGCTGGGA GATATTAACA ACTACTTCCG AGACCAGCTT GGTGATTCAA GCGACCAGAA CCGAATCGAA GCGGTTCTGA AACCAGATGA TGGTTCAGAG AAGAACGGCA ATCAAGAATC TCGTCTAGAG GCGTTTCTCC GCTCTACTCG CCGTATCCAT ATTGAGAATC CAAATCCGTT TCTGAACTTG TCTGAACCGT CTTATGCAAC CAGTCTTACC GATGTCGACG GCTTCCTGCA AAATCTTAGG ACTGGGCAGG AAGAAGAGGT TCGGAAAGAA CTTGAACAGG TGGATGACTA TGGTCCATAT GTTGACGCAA TTGAGAATAC GGTCCAAGAG TACACTACAA AGAGGAGGGA ACAACCACTG TTCGGGACTA TAGACACCAC AATCGCGGCC TTCGACGAAT TTGATCAGCG TTCACAAGGG AGATTGATCC GACTCCTTGG GGAGTCTCTA ACCGTCGAAC CTGGGAAGGG ATTCTTGGTT GATCTGGAAC CTAATTCAGT GTTCCCAATT ATTATTCAAA TGCGCGACCC CGACTCAGAG TCACTACTGG CAGAGTATGC TGACCTTCTA GTTGATGGCA AGGACCTGCG CAAGAATATA TTAGAAGCCT TTGTGAACTA TGCTGAAGAC ATTCCGCAGA GCATTGCTCG TAAGGCGTCT AACCAAGTCG AGTCACTCGG AGATGACAAA TTCAAGACCG CACTTGACTG TATAAACTCT TTTGAGGAAG CAAAGAACTG CATCATTCGT ACAAAGACGT TGGAACGGGC CGTATCGATG ATAGAACTCC AAGATAACAA GAATGAGTTC ACAGATACAG AGTACTATAC TCGTTTTGAG GATGTGGCAA GTCAGAAAGC AAGAGGAGCA TACGCAACGC ACCTTCTTGA TCTGAGACAG GATTATTCAA GCAACCAAGA ACAGCAAGTC GATCAAGAGC TAGCAGTCCG GTTGCTAGAT ATTAAGGGAC ATATTCTTCA ATCGAGCGGG GAGAAGGTTT TCGAAGGTAT TCGTGACATC GTGAAAAATA GCGGAAATAG ACAGCCCGTC GATCTTGTCG AGATTTGTTT CCATTTCTAC GATTCGTTCC CTACCACGAC ACAAGACGAG TTTCATAGTT GGTTAAGCGA ACTATTCCGT AACTGGAATG CCAATAATAC GGAGGAACTA TTCGATCTAA GTGACGAGTA TCGGGTCTCT ATCGTAGAGA CAGAGGAAGA AGTTAATTCA ATTCTACAGC AGATCCCGAA CCCGATAGAC AACGAATCCT TTGTCAAGAA TAAGATTATA CCTGTCATTC CCGAGGAGTT CAGCCAAGAT CTCCATGAGA AGGTGAAGAA TCTTATAAAG AACAACAATA ATAATCACAG TCAGCTTGGA ATCGACACCT TCACCGAATA TTCAGACCGG TTCGAGCCTA TCTGGGGGGA AGTTATTGAC ATTTGTGGTA ATCAAGCCAA CCGCGAAAAT AACATTAATC GAAAGAGGCG CTACTTGGAG GTTGGAGCAA AGATCTTCTC GAAGCTTAAT GGGCCGGAAC AGGAGAGCTA CATCAGTCAG ATAGATAATC TACTCAGTGG TAACCAGAAC GAGTACCAGT TGTACCGAGA TCTCTGGGAG AAGATAGAGT CTGACGTTGA CTCAGACAGG AAGGAGACCG TAGCAGAAGA CGTTCGCAAT GAGCTGGCGC AAGCTTTCAG TCGGAATCAG AATCCTGGAC ATCTTGATGC TTTGATCGAG GTGATGAGTT CTATTACAGG ATATCTGGTA GAGGAGAACG GTCAGCAGTT CATGGACCGC ATTAGCAAGC AGCTCACACA GAACAATTTG AATGACCGTC AAAAGGCCAG GGTATTAGGC CATATATCTG AGTTTGATGG GTTCTTCGGA AAGGAGGATC AGATTCTGGA CCGGCTAGAG AATCTCTTGA AGCGGTCCAA CCACAATAAT GTCTCACAGA ACGCTGAGGT GCTGCTTGAT AAATTTGAAA ACCTCGGAAA AGTTGATGAA GCAAGAATCG ACCAGATTAG AGAAGATTAC CTTTTAAATT GA
|
Protein sequence | MPPENEDLYL ADRAITSETD DEFGHKEYAD SLERILRNAN PPWHIGIFGE WGSGKTSIIR MLYNRLRAKE EFEDTVFVEF DAWSHAEGSI RTELLLELDE RIGKEVNGEE SDGILGEDEI TGRLYDVEEE EEVSRPANPK EVIVNFWEDS PILTISFVVI AGAAVLLQYL GQSTLASVAV TGLLLPILGY ILQQLDTVAQ TIQKKFLHPR KEWSGAYQRI FEDIIEKSNA ETIVISIDNL DRCESSTVYD VLVSLKTFME NDNCIYLIPC DDKALESHIK SIDKGEYFEE GKNEREFLRK FFQTHIRIPP FLPEDIEEYT ETENQRLSNP FEEEALDVLT NAYIDNPRRI KHALNRLSTL RVLAEEIEET GGLTEGRITN NIPFLAKVSI LEEEYPEFYD TLSENPQLLG DINNYFRDQL GDSSDQNRIE AVLKPDDGSE KNGNQESRLE AFLRSTRRIH IENPNPFLNL SEPSYATSLT DVDGFLQNLR TGQEEEVRKE LEQVDDYGPY VDAIENTVQE YTTKRREQPL FGTIDTTIAA FDEFDQRSQG RLIRLLGESL TVEPGKGFLV DLEPNSVFPI IIQMRDPDSE SLLAEYADLL VDGKDLRKNI LEAFVNYAED IPQSIARKAS NQVESLGDDK FKTALDCINS FEEAKNCIIR TKTLERAVSM IELQDNKNEF TDTEYYTRFE DVASQKARGA YATHLLDLRQ DYSSNQEQQV DQELAVRLLD IKGHILQSSG EKVFEGIRDI VKNSGNRQPV DLVEICFHFY DSFPTTTQDE FHSWLSELFR NWNANNTEEL FDLSDEYRVS IVETEEEVNS ILQQIPNPID NESFVKNKII PVIPEEFSQD LHEKVKNLIK NNNNNHSQLG IDTFTEYSDR FEPIWGEVID ICGNQANREN NINRKRRYLE VGAKIFSKLN GPEQESYISQ IDNLLSGNQN EYQLYRDLWE KIESDVDSDR KETVAEDVRN ELAQAFSRNQ NPGHLDALIE VMSSITGYLV EENGQQFMDR ISKQLTQNNL NDRQKARVLG HISEFDGFFG KEDQILDRLE NLLKRSNHNN VSQNAEVLLD KFENLGKVDE ARIDQIREDY LLN
|
| |