Gene Pars_2269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2269 
Symbol 
ID5055399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2031202 
End bp2032896 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content63% 
IMG OID640469821 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001154465 
Protein GI145592463 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.248787 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAGC TCAGGATAAG GTCGTCTCAG TGGTACGACG GCGTTGATAA TGCGCCTCAC 
CGACCGTATC TACGGGCGGT GGGGCTCACG GAGGCCGACT TCGCCAAGCC ACTCGTCGGC
GTGTTGGTGT CTTGGTCTGA GCTGGGGCCA TGCAACTTCC ACAACCTGGA GCTGGTGAGG
TACGTCAAGG AGGGGGTCAA GGAAGCTGGG GGCGTCGGCC TGGCGGCGCC TACGATTGTG
GTTAACGACG GCATAAATAT GGGCACGCCG GGGATGCGCT ACTCGCTGAT CAGCCGGGAC
CTCATCGCAG ACACCATTGA GGCGCAGTTC AACTCCCACG GAGTAGACGC CTGGGTGGGC
ATCGGCGGCT GTGACAAGAC CCAGCCGGGC ATCATGATGG CGATGGTTAG GCTCGACCTC
CCGGCGGTGT ATCTCTACGG AGGCTCGGCC GAGGCGGGGT GGCTCGGCGA GCGGGAACTC
ACCATAGAGG ACGCGTTCGA GTCGGTGGGG GCGTACTTGG CGGGGAAGAT AACTCTCGAT
GAGCTGAAGA GGGTAGAGGA GCTGTCTTTC CCGACATACG GCACTTGCCA GGGGATGTTC
ACCGCAAACA CCATGGCGAC TCTCGGCGAG GCGCTTGGGC TATCCCTCTT GGGCTCGGCC
TCCCCTCCCG CCACCTCGGC AAGGCGGCGG AAGTACGCGG TGGAGAGCGG CAGGGCGGTG
CTCAAGGCGG CTGAGCTGGG CGTGACGCCG AGGAAGGTGG TCACCTACGA CGCGTTGTAC
AACGCCGCGG TGACGCTGTT CGCCACTGCT GGTAGCACCA ACGCAATTCT CCACCTCCTC
GCCATCGCCC ACGAAGCCAA CGTGAAGTTC ACTCTCGACG ACTTCGACGA GATTAGCAGA
AGAGTTCCCG TCATAGCGGC GCTGAGGCCC GCCGGGCCTT ATGCCATGCA GGACTTAGAC
AGGATAGGGG GCGTCCCCCG GGTGTTGAAG AAGCTGTATA AGGCCGGCTT GCTGAGGCCC
GAGGCGCTGA CAGTGGAGGG GGAGCCCATA GGCAAGTTGC TGGAGCGCTG GGAGCCGCCG
GCGGTGCCCG AGGCCGGCAT ACTCTACGAC GTGGAGAAGC CATACAAGCC GTATTCCGGC
ATCCGCATCC TCAGGGGCAA TCTGGCGCCC AGCGGCGCCG TGATGAAGAT AGGCGCGGCC
GACAAGCTGA GGTTCGAGGG GAGGGCGAAG GTGTACGACT CAGAGGCCGA GGCCTTCAAA
GCGGTAGCCG CCGGAGAGAT TAAGCCGGGC GACGTGGTGA TTATCCGCTA CGAGGGGCCT
AAGGGCGCGC CAGGCATGCC TGAGATGCTT AAGGTCACGG CTGCCATAGT CGGCGCGGGG
CTGGGCGATG CGGTGGCGCT GGTCACAGAT GGGAGGTTCT CGGGGGCCAC CCGCGGCATT
ATGGTGGGCC ACGTGGCGCC GGAGGCCGCC GTTGGCGGGC CTATAGCCCT AGTCCAGAAC
GGCGATAGGG TGATAATAGA CGGCGAAGCC GGCCTCATAA AGCTGGAGGT GTCCGAAGAG
GAGCTGGAGA AGAGGAGGAA GGCGTGGGCC CCCCCGCCGC CGAAATATAA AGGCGGCCTT
TTAGCCAAAT ACGCCGCATT GGTACAACAA GCCGACAAGG GAGCGGTTAC GTCACCTTCT
GCTTGGGGGA CTTAG
 
Protein sequence
MVKLRIRSSQ WYDGVDNAPH RPYLRAVGLT EADFAKPLVG VLVSWSELGP CNFHNLELVR 
YVKEGVKEAG GVGLAAPTIV VNDGINMGTP GMRYSLISRD LIADTIEAQF NSHGVDAWVG
IGGCDKTQPG IMMAMVRLDL PAVYLYGGSA EAGWLGEREL TIEDAFESVG AYLAGKITLD
ELKRVEELSF PTYGTCQGMF TANTMATLGE ALGLSLLGSA SPPATSARRR KYAVESGRAV
LKAAELGVTP RKVVTYDALY NAAVTLFATA GSTNAILHLL AIAHEANVKF TLDDFDEISR
RVPVIAALRP AGPYAMQDLD RIGGVPRVLK KLYKAGLLRP EALTVEGEPI GKLLERWEPP
AVPEAGILYD VEKPYKPYSG IRILRGNLAP SGAVMKIGAA DKLRFEGRAK VYDSEAEAFK
AVAAGEIKPG DVVIIRYEGP KGAPGMPEML KVTAAIVGAG LGDAVALVTD GRFSGATRGI
MVGHVAPEAA VGGPIALVQN GDRVIIDGEA GLIKLEVSEE ELEKRRKAWA PPPPKYKGGL
LAKYAALVQQ ADKGAVTSPS AWGT