Gene Pars_1483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1483 
Symbol 
ID5054244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1341496 
End bp1342683 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content47% 
IMG OID640469023 
Productimidazolonepropionase 
Protein accessionYP_001153692 
Protein GI145591690 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATAA TAAGGGCGAA GCAACTTGTA ACTGCTTTAA ACATGCCCTG GAGATACAGA 
GACGAAGTCG CCGTAATTAA CGACGCCGCT GTCGTGATTA GAGACGGCGT GATAATAGAC
GTCGGCACGT GGGAAGAAAT AAAACGAAGA CATCCACATG CCAATATTTG GGATTTCGGC
GACAATCTCA TAACGCCAGG CCTAGTGGAC CCACATACGC ACTTACTTTT TGCAGGTTCG
CGAGAAGACG AGCTTGAAAG AAAGCTACAG GGCGAGTCGT ACGAAGAAAT AACGAGAAAA
GGCGGCGGCA TATACAAAAC CGTGAAATAT ACGAAAGAGA CAAGCGACCA AGAGCTGTTG
AATATCCTAC AGAAAAGAAT TCAATTAGCT ACATCTTTTG GCACAACAAC AGTTGAGGTA
AAAACTGGAT ATGGGCTAGA CATAGATCAA GAACTGAGAC TCGCCAGAAT TTTAAAGAGC
GTGAAAAGCC CCATAGACGT AGTCACAACA TTTCTAGTAC ACATCCCGCC GCCGGCAGGA
AGAGAAAATT ATGTAAAAGA AGTGCTTAAG GCCATTCCAC ATGCTGGCAC AACATATGTA
GACGTCTTCT GTGACTCTAT AGCGTTTAAT GTGGAGGAGA CAAGAACTAT TTTGAAGAAG
GCCGCCGAGG CCGGCTACAA GCTTAGGCTA CATGCAGACG AGCTTGAGTA CATAGGATGT
AGCGACTTGG TAGAAGAATT GCCTATAGAC TCAGCCGACC ATCTCCTAAA TACGCCGCCT
GAGAATGTAA GAAAAATCGC AAAATCCGGA ACAGTGGCGA CTCTGCTACC GGTTACTATT
CTTACACTCA GAACGTCTAA AAAACCGCCC ATTGATGAGA TGAGGCGACT TAGAGTTCCC
ATAGCTATAG GAACAGACTT TAGCCCTAAC AGCTGGTGTC TAAACATGCA AACGGCAATA
GAACTCGCAG TATATCTCCT GGGGCTCACC CCCTTAGAGG CGCTGATTGC CGCTACGGCT
AACGCCGCCT ACAGCCTACG TCTTACAGAC AGGGGGATTA TCCAGCCGGG CAAAATTGCA
GACTTGGTAA TATGGGATGT ACCGAACTAC CACTGGCTCG CGTATGAAAT TGGCAGAAAT
AAGGCGAAGC TTGTACTGAA AAAAGGGGAG CCACTACGGT TTCTCTAA
 
Protein sequence
MVIIRAKQLV TALNMPWRYR DEVAVINDAA VVIRDGVIID VGTWEEIKRR HPHANIWDFG 
DNLITPGLVD PHTHLLFAGS REDELERKLQ GESYEEITRK GGGIYKTVKY TKETSDQELL
NILQKRIQLA TSFGTTTVEV KTGYGLDIDQ ELRLARILKS VKSPIDVVTT FLVHIPPPAG
RENYVKEVLK AIPHAGTTYV DVFCDSIAFN VEETRTILKK AAEAGYKLRL HADELEYIGC
SDLVEELPID SADHLLNTPP ENVRKIAKSG TVATLLPVTI LTLRTSKKPP IDEMRRLRVP
IAIGTDFSPN SWCLNMQTAI ELAVYLLGLT PLEALIAATA NAAYSLRLTD RGIIQPGKIA
DLVIWDVPNY HWLAYEIGRN KAKLVLKKGE PLRFL