Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4375 |
Symbol | |
ID | 8745003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 637975 |
End bp | 640665 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646514914 |
Product | Phosphoenolpyruvate carboxylase |
Protein accession | YP_003405861 |
Protein GI | 284167583 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0553395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCTCC ACGACAGATC CATCAGTCAA GATATTCACG ACCTCGGCGC GCTCCTGGGC AACATTCTCG AGGAGCAGAC GTCTCTCGAG GCGTTTGAGA CAGTCAAATC GTGCCGGCGG GCGGCGATCG ACTACCGATC GGGCGACCTC GACACCCGCG ACCCGCTGAT CGCGGAACTC GAAGGACTGT CGCCTCACAA TCAGCGGATC GTCGCGCGGG CGTTCACGAC CTACTTCGAA CTAATCAACC TCGCGGAGGA ACGGGAGCGG ATCCGAACGA TCCGAACGGG CTCTCACGAG CGGACCCTCG ACGACGGCCT CGAGGACGCG GCGGCGGAAC TCGGCGAGGC CGACGACGAG ACCGTCCAGC GGATTCTGGA CGATATCCTC ATCGAGCCGA CGTTCACGGC ACATCCGACC GAAGCCCGCC GGAAGACCGT CAAATCCAAA CTGAGGGATA TTTCGACGTC CCTCGAGACC CTGGACGAGC GGCTACTGAC CGAGAAGGAG GAGGAACAGG TCTGGCGGGA CATCGACGCC GAGGTGACGA GTCTCTGGCA GACCCCACAG GTGCGAAACC GCCAGCCCGA ACCCGAAGAC GAGGCGCGCA ACGTCCAGTG GTACCTCGAG AACACGCTGT TCGACGTGGT CGGCGAGGTC TACGACGAGC TCGACGATGC GATCGATTCG GAGCTCCCGC GGGACATCGA CGTTCCGAAG CTGTTCGAGT TCCGCTCGTG GGCGGGTAGC GATCGCGACG GCAATCCCTA CGTGACGCCC GAGGTCACGG CGAACGTCTT GGAGCGCCAG CGGGAGGTCG TCCTCGAGAA GTACCGTGAC GAGCTCAAGC GCCTGTCGGG CGTGTTGAGT CAGGACGGGG GTCGGATCGA TGCGGGAGGC GAGTTCGAGG CCTCACTCGA GGTGGACCGC GAGCGGCTGC CCAGCGTCGC CCGCACGGCC GAGGAGCGCT ACCCCGGCGA GCCCTACCGC CAGAAGCTCA AGCTCATGCG CGAGCGGCTC CGTCGCGTCG GGGACGTCCG GCCGGGCGGC TACGACGACG TCGACGAACT CCTCGAGGAT TTGGACGTCA TCGCGACGAG TTTGCGAAAC AACGGCGGGG AGACCGTCGT CGAGGCCCAT GTCGACCCCA TCCGCCGACG GGTCGACACG TTCGGGTTCT CGCTCGCGAG CCTCGATCTG CGCGACCACC AGCAGAAACA CACCGACGCC ATAGCCGAGG CCCTCGAGCG AGAGGGGATC GACTACCGCG GCCTCTCGGA GGCCGAGCGC GTCGAATTCC TGACCGACGC CGTCCTGCAG GACGAGCCCG TGCTCGACCT CGGCGAGACC GACGGACTGA CCGACGACTC GACCCGCGTC CTCGAACTCT TCGACAGCCT CGCCGACTGG CAGACCGAGT ACGGCGTCGA GGCAATCGAC ACCTACTGTA TCTCGATGAC CGACGAACCC AGCCACGTCC TCGAGGTGCT GTTCTTGGCC GATCAAGCCG GCGTCGTCTC GCTGCCCGAA CACTCGGGGA TCGACATCGT CCCGCTGCTT GAGACCGAGT ACGCCCTCTC GGGAGCGCGA CGCATCATGG GGACGCTATT CGAGAACGAA GCCTACAGTC AGGCCCTCGA GGCCCGCGGG CGAACCCAGG AGATCATGCT GGGCTATTCG GACTCGAACA AGGAGAACGG CTTCCTGGCG GCCAACTGGT CGCTGCACAA GAACCAGCGC CGGCTGGGCG AGATCTGCGA CGACCACGAC GTGACGATGC GGCTGTTCCA CGGCCGCGGC GGTTCGATTT CCCGCGGCGG CGGTCCGATG AACGAGGCGC TGCTGGCCCT GCCGAACAAC ACGATCACCG GCCAGGTCAA GTTCACCGAG CAAGGGGAAG CGATCGCCGA AAAGTACGGC AATCCCCGCA TCGCCGAACG CAACATCGAG CAGATGCTCA ACGCCCAACT CCGGGCGCGA AAGCAGGCGA TCGACCAGCC CGAGGAGGAC GTTCACGCAG AGTGGATCGA CGCCATGGAA ACGATGGCCG ACGCCGCCCG ACAGGAGTAC CGCGACCTCC TCGAGAGCGA TGGCTTCGTC CGGTACTTCG AGCAGGCGAC GCCGATCACG GTCATCGAGG ACCTGAACCT GGGCTCGCGT CCGGCCTCCC GCAGCGGCGA GCGGACCGTC GAGGACCTGC GGGCGATCCC GTGGGTGTTC TCCTGGACCC AGTCGCGGTG TATCCTCCCG GGCTGGTACG CCCTCGCGAC CGGTATCGAA GCGTACTTAG ACAACGGTGG CTCGATGGAT ACCCTTCAGG AGATGTACGA CGAGTGGCCG TTCTTCCGGA CGACGCTGGA CAACGCCGCC CTCTCGTTGT CCCGCACAGA ACTCGAGATC GCCGAACAGT ACGCGGGGAT GGCCGATGCG GCGCTCCGCG ATCGGTTCTT CCCGCGCGTG ACCGACGAGT ACGAGCGGGC GACCGAACTG ATCACGGAAA TCGGACAACG CGATCGCCTC CACACTCGCG ACTGGCTCGG CGAGAATCTG GAGCAACGGA ACCCCTACGT CGATCCGCTG AACATGCTGC AGGTACATCT GCTCGACAGG AACCACCGCA CGAATATCGA GGAGCGGACG CTCCGACTGA CGGTCAAAGG GATCGCCGCC GGGATGAAAA ATACGGGCTA A
|
Protein sequence | MELHDRSISQ DIHDLGALLG NILEEQTSLE AFETVKSCRR AAIDYRSGDL DTRDPLIAEL EGLSPHNQRI VARAFTTYFE LINLAEERER IRTIRTGSHE RTLDDGLEDA AAELGEADDE TVQRILDDIL IEPTFTAHPT EARRKTVKSK LRDISTSLET LDERLLTEKE EEQVWRDIDA EVTSLWQTPQ VRNRQPEPED EARNVQWYLE NTLFDVVGEV YDELDDAIDS ELPRDIDVPK LFEFRSWAGS DRDGNPYVTP EVTANVLERQ REVVLEKYRD ELKRLSGVLS QDGGRIDAGG EFEASLEVDR ERLPSVARTA EERYPGEPYR QKLKLMRERL RRVGDVRPGG YDDVDELLED LDVIATSLRN NGGETVVEAH VDPIRRRVDT FGFSLASLDL RDHQQKHTDA IAEALEREGI DYRGLSEAER VEFLTDAVLQ DEPVLDLGET DGLTDDSTRV LELFDSLADW QTEYGVEAID TYCISMTDEP SHVLEVLFLA DQAGVVSLPE HSGIDIVPLL ETEYALSGAR RIMGTLFENE AYSQALEARG RTQEIMLGYS DSNKENGFLA ANWSLHKNQR RLGEICDDHD VTMRLFHGRG GSISRGGGPM NEALLALPNN TITGQVKFTE QGEAIAEKYG NPRIAERNIE QMLNAQLRAR KQAIDQPEED VHAEWIDAME TMADAARQEY RDLLESDGFV RYFEQATPIT VIEDLNLGSR PASRSGERTV EDLRAIPWVF SWTQSRCILP GWYALATGIE AYLDNGGSMD TLQEMYDEWP FFRTTLDNAA LSLSRTELEI AEQYAGMADA ALRDRFFPRV TDEYERATEL ITEIGQRDRL HTRDWLGENL EQRNPYVDPL NMLQVHLLDR NHRTNIEERT LRLTVKGIAA GMKNTG
|
| |