Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2269 |
Symbol | |
ID | 5055399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2031202 |
End bp | 2032896 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640469821 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001154465 |
Protein GI | 145592463 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.248787 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAGC TCAGGATAAG GTCGTCTCAG TGGTACGACG GCGTTGATAA TGCGCCTCAC CGACCGTATC TACGGGCGGT GGGGCTCACG GAGGCCGACT TCGCCAAGCC ACTCGTCGGC GTGTTGGTGT CTTGGTCTGA GCTGGGGCCA TGCAACTTCC ACAACCTGGA GCTGGTGAGG TACGTCAAGG AGGGGGTCAA GGAAGCTGGG GGCGTCGGCC TGGCGGCGCC TACGATTGTG GTTAACGACG GCATAAATAT GGGCACGCCG GGGATGCGCT ACTCGCTGAT CAGCCGGGAC CTCATCGCAG ACACCATTGA GGCGCAGTTC AACTCCCACG GAGTAGACGC CTGGGTGGGC ATCGGCGGCT GTGACAAGAC CCAGCCGGGC ATCATGATGG CGATGGTTAG GCTCGACCTC CCGGCGGTGT ATCTCTACGG AGGCTCGGCC GAGGCGGGGT GGCTCGGCGA GCGGGAACTC ACCATAGAGG ACGCGTTCGA GTCGGTGGGG GCGTACTTGG CGGGGAAGAT AACTCTCGAT GAGCTGAAGA GGGTAGAGGA GCTGTCTTTC CCGACATACG GCACTTGCCA GGGGATGTTC ACCGCAAACA CCATGGCGAC TCTCGGCGAG GCGCTTGGGC TATCCCTCTT GGGCTCGGCC TCCCCTCCCG CCACCTCGGC AAGGCGGCGG AAGTACGCGG TGGAGAGCGG CAGGGCGGTG CTCAAGGCGG CTGAGCTGGG CGTGACGCCG AGGAAGGTGG TCACCTACGA CGCGTTGTAC AACGCCGCGG TGACGCTGTT CGCCACTGCT GGTAGCACCA ACGCAATTCT CCACCTCCTC GCCATCGCCC ACGAAGCCAA CGTGAAGTTC ACTCTCGACG ACTTCGACGA GATTAGCAGA AGAGTTCCCG TCATAGCGGC GCTGAGGCCC GCCGGGCCTT ATGCCATGCA GGACTTAGAC AGGATAGGGG GCGTCCCCCG GGTGTTGAAG AAGCTGTATA AGGCCGGCTT GCTGAGGCCC GAGGCGCTGA CAGTGGAGGG GGAGCCCATA GGCAAGTTGC TGGAGCGCTG GGAGCCGCCG GCGGTGCCCG AGGCCGGCAT ACTCTACGAC GTGGAGAAGC CATACAAGCC GTATTCCGGC ATCCGCATCC TCAGGGGCAA TCTGGCGCCC AGCGGCGCCG TGATGAAGAT AGGCGCGGCC GACAAGCTGA GGTTCGAGGG GAGGGCGAAG GTGTACGACT CAGAGGCCGA GGCCTTCAAA GCGGTAGCCG CCGGAGAGAT TAAGCCGGGC GACGTGGTGA TTATCCGCTA CGAGGGGCCT AAGGGCGCGC CAGGCATGCC TGAGATGCTT AAGGTCACGG CTGCCATAGT CGGCGCGGGG CTGGGCGATG CGGTGGCGCT GGTCACAGAT GGGAGGTTCT CGGGGGCCAC CCGCGGCATT ATGGTGGGCC ACGTGGCGCC GGAGGCCGCC GTTGGCGGGC CTATAGCCCT AGTCCAGAAC GGCGATAGGG TGATAATAGA CGGCGAAGCC GGCCTCATAA AGCTGGAGGT GTCCGAAGAG GAGCTGGAGA AGAGGAGGAA GGCGTGGGCC CCCCCGCCGC CGAAATATAA AGGCGGCCTT TTAGCCAAAT ACGCCGCATT GGTACAACAA GCCGACAAGG GAGCGGTTAC GTCACCTTCT GCTTGGGGGA CTTAG
|
Protein sequence | MVKLRIRSSQ WYDGVDNAPH RPYLRAVGLT EADFAKPLVG VLVSWSELGP CNFHNLELVR YVKEGVKEAG GVGLAAPTIV VNDGINMGTP GMRYSLISRD LIADTIEAQF NSHGVDAWVG IGGCDKTQPG IMMAMVRLDL PAVYLYGGSA EAGWLGEREL TIEDAFESVG AYLAGKITLD ELKRVEELSF PTYGTCQGMF TANTMATLGE ALGLSLLGSA SPPATSARRR KYAVESGRAV LKAAELGVTP RKVVTYDALY NAAVTLFATA GSTNAILHLL AIAHEANVKF TLDDFDEISR RVPVIAALRP AGPYAMQDLD RIGGVPRVLK KLYKAGLLRP EALTVEGEPI GKLLERWEPP AVPEAGILYD VEKPYKPYSG IRILRGNLAP SGAVMKIGAA DKLRFEGRAK VYDSEAEAFK AVAAGEIKPG DVVIIRYEGP KGAPGMPEML KVTAAIVGAG LGDAVALVTD GRFSGATRGI MVGHVAPEAA VGGPIALVQN GDRVIIDGEA GLIKLEVSEE ELEKRRKAWA PPPPKYKGGL LAKYAALVQQ ADKGAVTSPS AWGT
|
| |