Gene Pisl_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0039 
Symbol 
ID4618028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp34968 
End bp35900 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content54% 
IMG OID639783120 
Product5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase 
Protein accessionYP_929566 
Protein GI119871559 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.571239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATATAA ACGACGGCCA ACGTCATGTT GTGAAACTAC TTACATTTAG AAGGGGGGAG 
GTTAGAAAAG TTGGGCTTTT TAAAAACGGC AGGATTTTAG ACTTGCCCGA GGCGTACAAA
GCGGTGTTTA ACACAGAGGA GGCGCCAGAT TTTCTATACG ACATGAGACG CCTTATTGCA
CTAGGCGAGC CTGCGCTTGA GATAGTTAAG AAGTTAGACG AGAGAGCCAG AGGGCCGTTT
TACAAGCCAG AGGAGATAAA GTGGGAGCCG CCTGTGCCAA ACCCAGAGAA AATACTCTGC
GTAGCCGTCA ACTACAGAGA ACACGGCGCC GAGACTGGGA TAGAGCCCCC CGACAAGCCC
TACTTCTTCC CCAAGTTTCC AAATGCCCTA GTGGGCCACG AGGGCTATGT AGTGAAGCAC
AGGGTGGTAC AGAAGCTAGA CTGGGAGGTA GAGCTCGTCG TCGTAATGGG GCGCCCCGGC
AAATACATAG AGCCAGAGAG GGCGCTGGAC TACGTCTTCG GCTACACCGT CGGGCTAGAC
ATGTCTATGC GCGACTGGCA GAACCCAGAC GAGAAGACCG CCAGACAGTA CGGAAAGAAC
TGGATATGGG GCAAGACTAT GGACACCGCC GCGCCTGTGG GCCCGTACAT TGCGACAAGA
GACGAGGTGC CAGACCCCAA CAGACTGGGG CTGAGGCTTT GGGTAAACGG CCAGCTAGAA
CAGGAGGGAA ACACCTCCCA GCTCATCTTC AATATCCAAC AGTTGATATA CTGGGCATCC
CAAGGCATAA CCCTCCGCCC CGGCGACCTC ATTTTCACAG GGACGCCGCC CGGGGTGGGC
TGGGCCAAGG GGAAGTTCTT AAAGGGGGGA GACATCGTAG AGGCCGAGGT GGAGTCTATA
GGCCGTCTCA GAGCGTATAT AATTGAGGAG TAG
 
Protein sequence
MYINDGQRHV VKLLTFRRGE VRKVGLFKNG RILDLPEAYK AVFNTEEAPD FLYDMRRLIA 
LGEPALEIVK KLDERARGPF YKPEEIKWEP PVPNPEKILC VAVNYREHGA ETGIEPPDKP
YFFPKFPNAL VGHEGYVVKH RVVQKLDWEV ELVVVMGRPG KYIEPERALD YVFGYTVGLD
MSMRDWQNPD EKTARQYGKN WIWGKTMDTA APVGPYIATR DEVPDPNRLG LRLWVNGQLE
QEGNTSQLIF NIQQLIYWAS QGITLRPGDL IFTGTPPGVG WAKGKFLKGG DIVEAEVESI
GRLRAYIIEE