Gene Pars_0780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0780 
Symbol 
ID5054999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp697335 
End bp698696 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content60% 
IMG OID640468339 
Product2-methylcitrate dehydratase 
Protein accessionYP_001153018 
Protein GI145591016 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGGG TAACTAAAAC CCTAGCCGAG TACGCCCAGT CCATAGACTT CTCTAGGGTG 
CCCCCCGAGA CGCGGCACGA GGTCAAGCGG AGGATTATAG ACTCGCTTGC TGTGGCCTTC
GCCGCATACA CCGCGGAGCC TGTCGCCATA GCCAGGAGGG CCGCGTCCAA ATTCCCCACA
GCCAAAGGCG CCAGGATTGT CGGCACACGC TACCAAGTCA CGCCCGACTG GGCTACTTTC
GTCAACGGGC TCATGATTCG CTACCACGAC TTCAACGACA CATATCTCAG CAAGGAGCCC
CTACACCCCA GCGACTTGAT AGGCGCCGCC CTCGCCGTTG GGGACTACGT CGGGGCCAGG
GGCACTGACT TAATTACGGC CATCGCAATC GGCTACGAGG CCTCTGTCAC TTTCTGCGAC
GGGGGGACTC TGCGGAAGAG GGGGTGGGAC CACGTCAACT TCTTGGGCAT AGGCAGCGCG
CTGGCGGCGG CCAAGCTCCT TGGCCTAGAT ACAACAAAGA CGCAACACGC CCTAGCCATC
TACGCAGTGC CCCACGCCGC GATGCGCCAG ACGAGGGTAG GCGAGCTCTC CATGTGGAAA
GGCGCGGCCG CCGCCAACTC CAGCAGAAAC GCGGTCTTCG CCGCCCTGCT GGCGCAGGAG
GGCTTTACGG GGCCCTATAA GCCCTTCGAG GGGGAGATGG CCTTCTTCAA GCAACTACTC
CAGGGGGACT TCGACTTCTC AGTCCTCAAG CCGCTGGAGG AGGGCAGGCC GCCCCGGAGG
ATACTCGACA CATACATAAA GCCCTACCCC GTCGAGTACC ACGCCCAGAC CGCCGTCGAG
GCCGCGCTGA GGCTTAGGGA GAGGGTTAGG CTCGAGGAGA TAGAAAAGAT CAGAATCGAC
ACATACGAGG CGGCGTACAC CATCATAGGC CCTAAGGACC CAGAGAAGTG GGACCCACAC
ACAAGGGAGA CCGCCGACCA CTCGTTGATG TGGATAACGG CGGCGGCCCT CGTCTGGGGC
CCCATAAAAA TTGAACACTA CAAGGACCTT CGCAACCCAG CTGTCCTCTC TCTGATGAAG
AAGATGGAGG TGAACCTAGA CCCGGAGCTG GACAAGCTGT ACCCACAGGC CTTCCCCACC
GTGATAACCG TCTACACGAG GGGAGGCGCC AAGTACACCG AGCGCGTCGA CTACGCAAAG
GGGCATCCGA AAAACCCCAT GACAGACGCC GAGCTGGAGG AGAAGTTCAA CACACTGACC
CGCGACGTGT TGCCAGAAGA CGCCAGGAAG AGGATCTTGC AAATGCTCTG GCGCCTTGAA
AACTACGACA TAAGCGACTT AGTAGAAGCC CTCGCAGTCT AA
 
Protein sequence
MDRVTKTLAE YAQSIDFSRV PPETRHEVKR RIIDSLAVAF AAYTAEPVAI ARRAASKFPT 
AKGARIVGTR YQVTPDWATF VNGLMIRYHD FNDTYLSKEP LHPSDLIGAA LAVGDYVGAR
GTDLITAIAI GYEASVTFCD GGTLRKRGWD HVNFLGIGSA LAAAKLLGLD TTKTQHALAI
YAVPHAAMRQ TRVGELSMWK GAAAANSSRN AVFAALLAQE GFTGPYKPFE GEMAFFKQLL
QGDFDFSVLK PLEEGRPPRR ILDTYIKPYP VEYHAQTAVE AALRLRERVR LEEIEKIRID
TYEAAYTIIG PKDPEKWDPH TRETADHSLM WITAAALVWG PIKIEHYKDL RNPAVLSLMK
KMEVNLDPEL DKLYPQAFPT VITVYTRGGA KYTERVDYAK GHPKNPMTDA ELEEKFNTLT
RDVLPEDARK RILQMLWRLE NYDISDLVEA LAV