Gene Pisl_0249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0249 
Symbol 
ID4616871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp243704 
End bp245125 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content60% 
IMG OID639783329 
Productsuccinate-semialdehyde dehydrogenase (NAD(P)(+)) 
Protein accessionYP_929772 
Protein GI119871765 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGATG TAAAAGATTT AATTGTTATA GTTAATCCAG CTACAGAGGA GGTAATCGCC 
GAGCTTCCCA AAGCTACGAG AGAAGACGTG AGAAGGGCCA TAGATGCGGC GTGGGACGCC
TTCGCCAGCT GGTCGGCCCT ACCGCTGAGG AAGAGGACCC GCGTCTTGCT GAAGACCGCC
GAGCTTGCCG AGACCGCCAG GGAGGACCTC CTCAAGACCC TGGTGGCGGA GTCCGGGAAG
CCCATTAAAG ACGCCGAGGC GGAGATCACG AGGGCAATAG AAATCTTCCG CTCCAGCGCG
GAAGAGGCCA AGCTGATCCT AGAGGGGTCG GTCCCCAGGG TAGACGCCTA CGAGTACCCT
ATCGGCAACG AAAACAGACT CGTGGTGGCC GTGAGAGAGC CCGTGGGCGT CGTCGGGGGG
GCCCTCAGCT ACAACAACCC CGTCTCCACT TTCGCCCACA AGGTGGCCCC CGTCATCGCG
GCGGGGAACA CAGTCGTCGT GAAGCCTTCC TCCTACACCC CCCTCACCGC CCTTAAATTC
CTGGAGATTA TGAAGAGGGC TGGGGTGCCC GAGGGCGTGG TAAACGTAGT TGTAGGCAGC
GGGGAGGAGA TCTTCGATGA GCTTATCCAG AGCGACAAGG TCGCTGGGAT AAACTTCACC
GGCAGCACCG CGGTGGGGCT ACAAGTGGCA GCTAAGGCCG CCTCCAGGGG GAAGAAGTTC
ATGATAGCAC CCGGAGGTTC CGACCCGGCC GTGGTGTTTA AAGACGCCGA TTTAGACGCC
GCGGCTAAGA TCATCGCCAG AGCCCGGTAC GAAAACGCGG GCCAGAACTG CAACGCCACC
AAGAGGGTTT TCGTGGAGCG GGAGGTCTAC CCCAAGTTCG TAGAGCTCCT CCTCGGCTAT
GTGAAAGCCA TAAGAGTAGG CGACCCCATG GACTACAGCA CAGACATGGG TCCCCTCATC
TCCGAAAAGA TGGTGAGGGC CATGGACAGC GTAGTGAAAG ACGCCCTCGA GAAAGGCGCC
AAACTGGCGG CGGGGGGCAG GAGGATGAAC AGGAGGGGCT ACTTCTACGA GCCCACCGTC
CTCCTCTTCG ACGGCGACGC CGAGGCTAAG GCGCTTAGGG AGGAGGTCTT CGGGCCGGTT
CTGCCCGTGG TGCCCTTCGA GGGGGAGGAG GAGGCCGTCC GCCTCGCCAA CGCCACCCAG
TACGGCCTAC AGTCTGCTGT CTTCACCTCG GACTACAGGA AGGCGCTTAG GGTGGCGAGA
GCCATAAAGG CGGGGGCGGT CATGATAAAC GACAGCACCA GGGTGAGGTT CGACGCTCTT
CCTTACGGCG GCGTTAAGAT GTCGGGCTTC GGCTGGAGAG AGGGCGTGAG GTCGACCATG
ATCTACTACA CAGAGCCCAA GTTCCTCGTC TTCGGGCTTT GA
 
Protein sequence
MKDVKDLIVI VNPATEEVIA ELPKATREDV RRAIDAAWDA FASWSALPLR KRTRVLLKTA 
ELAETAREDL LKTLVAESGK PIKDAEAEIT RAIEIFRSSA EEAKLILEGS VPRVDAYEYP
IGNENRLVVA VREPVGVVGG ALSYNNPVST FAHKVAPVIA AGNTVVVKPS SYTPLTALKF
LEIMKRAGVP EGVVNVVVGS GEEIFDELIQ SDKVAGINFT GSTAVGLQVA AKAASRGKKF
MIAPGGSDPA VVFKDADLDA AAKIIARARY ENAGQNCNAT KRVFVEREVY PKFVELLLGY
VKAIRVGDPM DYSTDMGPLI SEKMVRAMDS VVKDALEKGA KLAAGGRRMN RRGYFYEPTV
LLFDGDAEAK ALREEVFGPV LPVVPFEGEE EAVRLANATQ YGLQSAVFTS DYRKALRVAR
AIKAGAVMIN DSTRVRFDAL PYGGVKMSGF GWREGVRSTM IYYTEPKFLV FGL