Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1925 |
Symbol | |
ID | 6164868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 1697040 |
End bp | 1698725 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641669088 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001795286 |
Protein GI | 171186367 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.906776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAGA TCAAGGTGCG TTCCTCCGCC TGGTACGACG GCGTAGACAA CGCCTCGCAC AGGTCCTACC TAAGGGCCGT CGGCTTCACG GAGGAGGACT TCGCCAAGCC CCTGGTGGGG GTCCTCGCCG CGTGGTCGGA GCTGGGGCCC TGCAACTACC ACACCCTCGA TCTGGCCAGG TACGTGAAGG AGGGAGTTAA GGAGGCCGGC GGCGTTGGGC TGACGGCGCC TACCATCGTC GTAAACGACG GCATAAACAT GGGAACCCCG GGGATGAGGT ACTCCCTAAT TAGCAGGGAC CTAATCGCGG ACACCATAGA GGCGCAGTTC AACGCCCACG GAGTAGACGC TTGGGTGGGC ATAGGCGGAT GCGACAAGAC GCAGCCGGGC ATAATGATGG CCATGGTTAG GCTGGACCTC CCCGCCGTCT ACCTCTACGG CGGCACGGCG GAGGCTGGGT GGCTCGGCGA GCGGGAGCTC ACCATAGAGG ACACCTTCGA GGCAGTGGGG TCCTACCTGG CGGGGAAGAT CACGCTGGAG GAGCTCAAGA GGATAGAGGA GCTCTCCTTC CCCACCTACG GCACGTGTCA AGGCCTCTTC ACGGCCAACA CCATGGCGAT GCTTTCGGAG GCCCTCGGCC TAGCCCTCCT CGGCTCTGCG TCGCCCCCCG CCACCTCGGC CAGACGCAGG GCCTACGCCG TGGCTTCCGG ACGCGCCGTT TTAAAAGCCG CCGAACTCGG CGTAACCCCC CGCAGGGTGG TCACATACGA CGCCATCTAC AACGCCGCGG TGACGCTGTT TGCCACGGCG GGCTCCACCA ACGCGATACT CCACCTCCTG GCCATAGCCC ATGAGGCGGG GGTGAAGTTC ACCCTAGACG ACTTCGACGA GATAAGCAGG AGGGTACCCG TCATAGCCGC CCTACGGCCG GCGGGGCCCT ACGCCATGCA GGACCTCGAC AGAATAGGCG GCGTGCCGAG GATACTCAAG AAGCTCTACA AAGCCGGCTT CCTCAGGCCG GAGGCCCTCA CAGTCGAGGG CGAGACCATA GGCAAGCTCC TGGAGAGGTG GCAACCCCCC GCCGTGCCCG AAGACGGCAT CCTCTACAGC GTGGAGAAGC CCTACAAGCC CTACTCCGGC ATCAGAATCC TCAGAGGCAA CCTAGCCCCA GACGGCGCCG TGATGAAGAT AGGCGCCGCC GACAAGCTGA AGTTCGAGGG GACGGCCAAG GTATACAACG GAGAGGCCGA GGCCTTCAAG GCGGTGGCCG CCGGCGAGAT AAAGCCCGGA GACGTCGTGG TGATAAGATA CGAGGGGCCG AAGGGCGCCC CAGGCATGCC AGAGATGCTT AAAGTCACCG CCGCCATCGT CGGCGCTGGC CTCGGCGAGG CGGTAGCCCT GGTGACAGAC GGCCGCTTCT CAGGAGCCAC CCGCGGCATA ATGGTGGGAC ACGTCGCCCC CGAGGCGGCC GTGGGAGGCC CCATAGCCCT AGTAGAAAAC GGAGACAAAA TAGCCATAGA CGGCGAGACA GGCCGCATAA CCCTACAGAT CCCCCAGGAG GAGCTGGAGA GGAGAAGGAA AAACTGGACG CCGCCGCCCC CCAAATACTC CGGAGGACTC CTCGCCAAAT ACGCCGCACT GGTCCAACAG GCGGACAAAG GCGCGGTCAC CACACCACCA CGATAA
|
Protein sequence | MVKIKVRSSA WYDGVDNASH RSYLRAVGFT EEDFAKPLVG VLAAWSELGP CNYHTLDLAR YVKEGVKEAG GVGLTAPTIV VNDGINMGTP GMRYSLISRD LIADTIEAQF NAHGVDAWVG IGGCDKTQPG IMMAMVRLDL PAVYLYGGTA EAGWLGEREL TIEDTFEAVG SYLAGKITLE ELKRIEELSF PTYGTCQGLF TANTMAMLSE ALGLALLGSA SPPATSARRR AYAVASGRAV LKAAELGVTP RRVVTYDAIY NAAVTLFATA GSTNAILHLL AIAHEAGVKF TLDDFDEISR RVPVIAALRP AGPYAMQDLD RIGGVPRILK KLYKAGFLRP EALTVEGETI GKLLERWQPP AVPEDGILYS VEKPYKPYSG IRILRGNLAP DGAVMKIGAA DKLKFEGTAK VYNGEAEAFK AVAAGEIKPG DVVVIRYEGP KGAPGMPEML KVTAAIVGAG LGEAVALVTD GRFSGATRGI MVGHVAPEAA VGGPIALVEN GDKIAIDGET GRITLQIPQE ELERRRKNWT PPPPKYSGGL LAKYAALVQQ ADKGAVTTPP R
|
| |