Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4459 |
Symbol | |
ID | 8745088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 45183 |
End bp | 46883 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646514996 |
Product | thiamine pyrophosphate protein domain protein TPP-binding protein |
Protein accession | YP_003405943 |
Protein GI | 284172561 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.325037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCGA CCGACTCCGA GGGCGGCCCG CCGACCGCGG CCGGAAGCGA ACAGCTCTAC GACGCGCTGG TCGACGCCGG GATCGACCTC CTCGTCGGAC TTCCCGGAAC GCAGACGCTG CCGCTGGATC GAACCGTCGA CCGGCGGGAC GACATCCGGT ACGTGATGGC CCGCCACGAG ACCGCGATCC CTCACATCGC CTGGGGGCAC TATGAGGCCG GCGGCGACGT CGCCGCGACG CTGACCGTTC CCGGCCCCGG CGATACGAAC GCGATGCACG GGCTGAAGAA CGCGCTCGAC GACCGCGTTC CCCTGGTCCA CATCGCCGCG GACGCCGATC CCGAGGACCG CGGGAAGGGA CCGATCCACG AGATCGAACC CGACACCTAC GACAACGTGG TCAAGGAGAA CTACTCGATC GAGCGCCCGG TCGAGTTCCA CCGAGCGATC CGGTCGGGAA TCGAAACCGC GCTGACGCCG CCGTGCGGCC CGGTCCGACT CGGCGTCCCG AAGCCGCTCC TCGACGCCGA GTTCTGCTCG CCGCCGGTGA CGGTCGATCC GCCGACGAGC CGGTTTGACG GTGACGCGGA GTACGAGACC GCGCGGCAAC TACTCGCCGA CGCCGAACGA CCGGTCGTCT ACCTCGGCGT CGGCGCCCGT CGGACGGGTG ATCCGGACGC CGTTCGGGCC CTCGTCGAGA CGCTCGACGC TGCCGTCGTC GCCTCCTACA AGGGCAAGGG CGTCTTCCCC GAGGGCGATC CGCGCTGGCT CGGCGTGACG GCCAGTCACC TCCCAGCGGG AGCGGAACGC GCGCTCGAGG CCGCCGACGT CGTACTCGCG CTCGGCACTC GGTTCGACGG TGTAGTGACC GCCGACTGGT CGCTTCCCAT GGGCGACGCA CTCGTCCACG TGACCCTCGA CTCGAGTCGG ATCGACGTCG CCTACGACTC GGACGTGGCG ATCGTCGACG ACGTCGGCAG CGCGGTCGAT CGGCTCCGAG ACGGCCTGGG GTCCAGAGAG CGACCCGACG GCGCGTGGGA CGGGGCCGCC GTCGGTCGAC GCGTCCGCGC GGAGTACGAC GACCGGCTCG AGGACCGCGG CCTGCTCGAG GACGACGCGC CGATCGCGAC GGCCGGCGCG CTTCGGACCC TTCGCAAAGC GCTGCCACGC GAAACCGTCG TGACGACGGA CATCGGCGGG TTCCGGCTCT GGGCCAAGCA GACCTTCGAG ACGGAGGAGC CGGAAGCCTA CGTCACGTCC GGCTCCTGGG CGGGAATGGG CGTCGGTATC CCGGCCGCGA TCGGCGCGAA ACTCGCCAGG CCGGAGCGGC CGGTCGTCGC GCTGACGGGC GACGGCGGGG CCATGATGTG CCTGCAGGAA CTGCACACGG CCGCGGCGTA CGACTTGGAC GTGCTCACGA TTCTCTTCAA CAACGAGGAC TACGGGATCA TCAGCAAGTC ACCCGCCATC GACCAGTACG CCGAGGGCCA CCGATTCGAC TGGTCCTCGC CCGATTTCGC CGCGATCGCC GAGGGGTTCG GCTGTCGTGG ACAGACGGTA CAGACGCTCT CGGGGCTCGA GGACGCCGTC GAGGCCGCGC TCGCGCGGGC TGACGGACCG GAGCTGATCG ACGTTCGCGT CGACCCGGAC GAGCCGACGG CCGCGTCGTT CGCCGACTAC GACTCCGAAC TCGAGTTCTG A
|
Protein sequence | MNPTDSEGGP PTAAGSEQLY DALVDAGIDL LVGLPGTQTL PLDRTVDRRD DIRYVMARHE TAIPHIAWGH YEAGGDVAAT LTVPGPGDTN AMHGLKNALD DRVPLVHIAA DADPEDRGKG PIHEIEPDTY DNVVKENYSI ERPVEFHRAI RSGIETALTP PCGPVRLGVP KPLLDAEFCS PPVTVDPPTS RFDGDAEYET ARQLLADAER PVVYLGVGAR RTGDPDAVRA LVETLDAAVV ASYKGKGVFP EGDPRWLGVT ASHLPAGAER ALEAADVVLA LGTRFDGVVT ADWSLPMGDA LVHVTLDSSR IDVAYDSDVA IVDDVGSAVD RLRDGLGSRE RPDGAWDGAA VGRRVRAEYD DRLEDRGLLE DDAPIATAGA LRTLRKALPR ETVVTTDIGG FRLWAKQTFE TEEPEAYVTS GSWAGMGVGI PAAIGAKLAR PERPVVALTG DGGAMMCLQE LHTAAAYDLD VLTILFNNED YGIISKSPAI DQYAEGHRFD WSSPDFAAIA EGFGCRGQTV QTLSGLEDAV EAALARADGP ELIDVRVDPD EPTAASFADY DSELEF
|
| |