Gene Pars_1390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1390 
Symbol 
ID5055899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1253581 
End bp1254486 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content55% 
IMG OID640468933 
Product5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase 
Protein accessionYP_001153602 
Protein GI145591600 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.355607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.367054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATTA AGCCGCATGT GAAGTTGCTC ACTTTTAGAA AAAACGTAGT GAAGGTGGGT 
CTTTGGAAGG ATGGGAAAAT ACTTGATTTG CCGGAAGCAT ACAAGGCGGT TTTCGGCGCG
TACGAAGCGC CGGACTTCCT TTACAGCATG AGGAAGCTCA TAGCCGTGGG GGAGCCCGCC
CTTGAAATTA TAAGAAAAAT AGAGGCCGAG GCGAGAGGGC CCTTCTACGC GCCGCATGAG
GTGGTCTGGG AGCCTCCCGT GCAGGATCCC GAGAAGGTGC TCGCAGTGGC TGTCAACTAC
AGATCCCACG GCAAGGAGAT GGGGCACGAG CCCCCGCCCC GCCCCTACTT CTTCCCCAAG
TTGCCGAACG CCCTCGTTGG CCACGAGAGG CCGATAATAA AACACCGTGT AGTGCAGAAG
TTGGACTGGG AGGTGGAGCT TGTGGTTGTC ATTGGGAGAG CCGGCAAGTA CATCGACCCC
GAGAGGGCGC TTGACTACGT CTTTGGCTAT ACAGTAGGCA ACGATGTGTC GATAAGGGAT
TGGCAGTACC CAGCTACTCA GTACGGATTC AACTGGATAT GGGGCAAATC CATGGACACA
GCGGCACCGG TGGGGCCTTG GATTGTTACT AAGGACGAGG TGCCAGACCC CAACAAGCTG
GGGCTTAGGC TGTGGGTCAA CGGCCAGCTG GAGCAAGAGG GCAATACCTC AGACCTAATA
TTCAACGTTC AGCTGTTGAT ACACTGGGCG TCCCAGGGCA TAACCCTCAA GCCAGGCGAC
ATGATATTCA CCGGCACCCC GCCAGGAGTG GGATATCCCA AGGGCAAGTT CCTAAAAGGC
GGCGACATCG TCGAGGCAGA AGTGGAGACT GTAGGTCTGC TTAGGAACTA CGTAATAGAG
GAATAA
 
Protein sequence
MRIKPHVKLL TFRKNVVKVG LWKDGKILDL PEAYKAVFGA YEAPDFLYSM RKLIAVGEPA 
LEIIRKIEAE ARGPFYAPHE VVWEPPVQDP EKVLAVAVNY RSHGKEMGHE PPPRPYFFPK
LPNALVGHER PIIKHRVVQK LDWEVELVVV IGRAGKYIDP ERALDYVFGY TVGNDVSIRD
WQYPATQYGF NWIWGKSMDT AAPVGPWIVT KDEVPDPNKL GLRLWVNGQL EQEGNTSDLI
FNVQLLIHWA SQGITLKPGD MIFTGTPPGV GYPKGKFLKG GDIVEAEVET VGLLRNYVIE
E