Gene Htur_5045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5045 
Symbol 
ID8745912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013748 
Strand
Start bp33641 
End bp36922 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content45% 
IMG OID646515658 
ProductKAP P-loop domain protein 
Protein accessionYP_003406605 
Protein GI284176329 
COG category[R] General function prediction only 
COG ID[COG4928] Predicted P-loop ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCCTG AGAATGAAGA TCTCTACTTA GCGGATCGAG CTATTACCTC AGAGACCGAT 
GATGAATTTG GACACAAAGA GTACGCAGAC AGTTTAGAGA GGATTCTACG AAATGCTAAT
CCACCGTGGC ACATCGGTAT CTTCGGAGAG TGGGGTTCCG GTAAAACCTC TATAATCCGG
ATGTTGTATA ATCGTCTCAG GGCAAAAGAA GAGTTTGAAG ATACTGTTTT TGTGGAATTT
GATGCTTGGT CTCACGCGGA AGGATCCATC CGAACTGAGC TACTTCTTGA ACTAGATGAA
AGAATAGGTA AGGAGGTAAA CGGAGAAGAA TCAGATGGCA TTCTTGGCGA AGATGAGATC
ACAGGCAGGC TATACGACGT GGAAGAGGAA GAAGAGGTTT CGAGGCCTGC TAACCCAAAA
GAAGTTATAG TGAATTTCTG GGAAGATAGT CCAATACTCA CAATCAGTTT TGTCGTTATT
GCCGGTGCTG CTGTACTACT ACAATATCTT GGCCAGTCGA CACTGGCATC AGTGGCTGTT
ACTGGATTAC TACTCCCGAT TCTCGGGTAT ATTCTCCAGC AGTTGGATAC GGTGGCTCAG
ACGATTCAAA AAAAGTTTTT GCATCCAAGA AAAGAGTGGT CTGGTGCGTA CCAGCGAATA
TTCGAGGATA TAATCGAAAA ATCGAACGCA GAAACCATCG TAATCTCTAT CGATAATTTA
GATAGGTGCG AGAGTTCGAC CGTCTACGAC GTTCTTGTGT CTCTCAAGAC GTTTATGGAA
AATGATAATT GTATATATCT TATTCCCTGT GATGACAAAG CCCTAGAGAG CCATATCAAG
TCAATTGATA AAGGTGAATA CTTTGAGGAA GGGAAGAATG AACGAGAGTT TCTCCGGAAG
TTCTTCCAAA CCCACATCCG GATTCCACCG TTCTTGCCGG AGGATATTGA GGAATATACT
GAAACGGAGA ATCAGAGGCT TTCCAATCCA TTCGAAGAGG AAGCTCTCGA CGTACTCACT
AATGCATACA TCGACAATCC ACGGAGAATC AAGCATGCAC TCAACCGTCT TTCAACACTC
AGAGTTTTAG CAGAGGAGAT CGAGGAAACC GGTGGGTTAA CTGAGGGTAG AATCACCAAT
AATATCCCAT TCTTAGCGAA GGTATCCATT TTAGAAGAGG AATACCCAGA GTTCTATGAC
ACACTCTCGG AGAATCCTCA GTTGCTGGGA GATATTAACA ACTACTTCCG AGACCAGCTT
GGTGATTCAA GCGACCAGAA CCGAATCGAA GCGGTTCTGA AACCAGATGA TGGTTCAGAG
AAGAACGGCA ATCAAGAATC TCGTCTAGAG GCGTTTCTCC GCTCTACTCG CCGTATCCAT
ATTGAGAATC CAAATCCGTT TCTGAACTTG TCTGAACCGT CTTATGCAAC CAGTCTTACC
GATGTCGACG GCTTCCTGCA AAATCTTAGG ACTGGGCAGG AAGAAGAGGT TCGGAAAGAA
CTTGAACAGG TGGATGACTA TGGTCCATAT GTTGACGCAA TTGAGAATAC GGTCCAAGAG
TACACTACAA AGAGGAGGGA ACAACCACTG TTCGGGACTA TAGACACCAC AATCGCGGCC
TTCGACGAAT TTGATCAGCG TTCACAAGGG AGATTGATCC GACTCCTTGG GGAGTCTCTA
ACCGTCGAAC CTGGGAAGGG ATTCTTGGTT GATCTGGAAC CTAATTCAGT GTTCCCAATT
ATTATTCAAA TGCGCGACCC CGACTCAGAG TCACTACTGG CAGAGTATGC TGACCTTCTA
GTTGATGGCA AGGACCTGCG CAAGAATATA TTAGAAGCCT TTGTGAACTA TGCTGAAGAC
ATTCCGCAGA GCATTGCTCG TAAGGCGTCT AACCAAGTCG AGTCACTCGG AGATGACAAA
TTCAAGACCG CACTTGACTG TATAAACTCT TTTGAGGAAG CAAAGAACTG CATCATTCGT
ACAAAGACGT TGGAACGGGC CGTATCGATG ATAGAACTCC AAGATAACAA GAATGAGTTC
ACAGATACAG AGTACTATAC TCGTTTTGAG GATGTGGCAA GTCAGAAAGC AAGAGGAGCA
TACGCAACGC ACCTTCTTGA TCTGAGACAG GATTATTCAA GCAACCAAGA ACAGCAAGTC
GATCAAGAGC TAGCAGTCCG GTTGCTAGAT ATTAAGGGAC ATATTCTTCA ATCGAGCGGG
GAGAAGGTTT TCGAAGGTAT TCGTGACATC GTGAAAAATA GCGGAAATAG ACAGCCCGTC
GATCTTGTCG AGATTTGTTT CCATTTCTAC GATTCGTTCC CTACCACGAC ACAAGACGAG
TTTCATAGTT GGTTAAGCGA ACTATTCCGT AACTGGAATG CCAATAATAC GGAGGAACTA
TTCGATCTAA GTGACGAGTA TCGGGTCTCT ATCGTAGAGA CAGAGGAAGA AGTTAATTCA
ATTCTACAGC AGATCCCGAA CCCGATAGAC AACGAATCCT TTGTCAAGAA TAAGATTATA
CCTGTCATTC CCGAGGAGTT CAGCCAAGAT CTCCATGAGA AGGTGAAGAA TCTTATAAAG
AACAACAATA ATAATCACAG TCAGCTTGGA ATCGACACCT TCACCGAATA TTCAGACCGG
TTCGAGCCTA TCTGGGGGGA AGTTATTGAC ATTTGTGGTA ATCAAGCCAA CCGCGAAAAT
AACATTAATC GAAAGAGGCG CTACTTGGAG GTTGGAGCAA AGATCTTCTC GAAGCTTAAT
GGGCCGGAAC AGGAGAGCTA CATCAGTCAG ATAGATAATC TACTCAGTGG TAACCAGAAC
GAGTACCAGT TGTACCGAGA TCTCTGGGAG AAGATAGAGT CTGACGTTGA CTCAGACAGG
AAGGAGACCG TAGCAGAAGA CGTTCGCAAT GAGCTGGCGC AAGCTTTCAG TCGGAATCAG
AATCCTGGAC ATCTTGATGC TTTGATCGAG GTGATGAGTT CTATTACAGG ATATCTGGTA
GAGGAGAACG GTCAGCAGTT CATGGACCGC ATTAGCAAGC AGCTCACACA GAACAATTTG
AATGACCGTC AAAAGGCCAG GGTATTAGGC CATATATCTG AGTTTGATGG GTTCTTCGGA
AAGGAGGATC AGATTCTGGA CCGGCTAGAG AATCTCTTGA AGCGGTCCAA CCACAATAAT
GTCTCACAGA ACGCTGAGGT GCTGCTTGAT AAATTTGAAA ACCTCGGAAA AGTTGATGAA
GCAAGAATCG ACCAGATTAG AGAAGATTAC CTTTTAAATT GA
 
Protein sequence
MPPENEDLYL ADRAITSETD DEFGHKEYAD SLERILRNAN PPWHIGIFGE WGSGKTSIIR 
MLYNRLRAKE EFEDTVFVEF DAWSHAEGSI RTELLLELDE RIGKEVNGEE SDGILGEDEI
TGRLYDVEEE EEVSRPANPK EVIVNFWEDS PILTISFVVI AGAAVLLQYL GQSTLASVAV
TGLLLPILGY ILQQLDTVAQ TIQKKFLHPR KEWSGAYQRI FEDIIEKSNA ETIVISIDNL
DRCESSTVYD VLVSLKTFME NDNCIYLIPC DDKALESHIK SIDKGEYFEE GKNEREFLRK
FFQTHIRIPP FLPEDIEEYT ETENQRLSNP FEEEALDVLT NAYIDNPRRI KHALNRLSTL
RVLAEEIEET GGLTEGRITN NIPFLAKVSI LEEEYPEFYD TLSENPQLLG DINNYFRDQL
GDSSDQNRIE AVLKPDDGSE KNGNQESRLE AFLRSTRRIH IENPNPFLNL SEPSYATSLT
DVDGFLQNLR TGQEEEVRKE LEQVDDYGPY VDAIENTVQE YTTKRREQPL FGTIDTTIAA
FDEFDQRSQG RLIRLLGESL TVEPGKGFLV DLEPNSVFPI IIQMRDPDSE SLLAEYADLL
VDGKDLRKNI LEAFVNYAED IPQSIARKAS NQVESLGDDK FKTALDCINS FEEAKNCIIR
TKTLERAVSM IELQDNKNEF TDTEYYTRFE DVASQKARGA YATHLLDLRQ DYSSNQEQQV
DQELAVRLLD IKGHILQSSG EKVFEGIRDI VKNSGNRQPV DLVEICFHFY DSFPTTTQDE
FHSWLSELFR NWNANNTEEL FDLSDEYRVS IVETEEEVNS ILQQIPNPID NESFVKNKII
PVIPEEFSQD LHEKVKNLIK NNNNNHSQLG IDTFTEYSDR FEPIWGEVID ICGNQANREN
NINRKRRYLE VGAKIFSKLN GPEQESYISQ IDNLLSGNQN EYQLYRDLWE KIESDVDSDR
KETVAEDVRN ELAQAFSRNQ NPGHLDALIE VMSSITGYLV EENGQQFMDR ISKQLTQNNL
NDRQKARVLG HISEFDGFFG KEDQILDRLE NLLKRSNHNN VSQNAEVLLD KFENLGKVDE
ARIDQIREDY LLN