Gene Pisl_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1949 
Symbol 
ID4618134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1763251 
End bp1764222 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content65% 
IMG OID639785040 
Productalcohol dehydrogenase 
Protein accessionYP_931439 
Protein GI119873432 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0237239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value9.46914e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGCGG CTAGGCTCTA CGGCCCGCGC GACTTGCGCG TCGAGGAGAT TCCGGCGCCC 
AAGCCAGAGC GGGGGTGGGC GCTGGTGAGA ACGCTGGCGG TGGGGATCTG CGGCACAGAC
AAGGCCTTCT ACAGGGGGAC CTACCGGCTG TTTAAGACGC CTCTTGTACC GGGCCACGAG
GCGGTGGGGG TGGCGGAGGG GGGCGAGTTG GACGGGAGGG TGGTGGTGAG CGAGATCAAC
TTCGCCTGTG GTAGGTGTGA GATGTGTAGA GCTGGTCTCT ACACGCACTG CCCCTACAAG
AGGACGCTTG GGATAGACTT CGACGGGGGG ATGGCGGAGT ACTTCGTGGC GCCTCTCGAG
GCCCTCCACC CCGCCGAGGG GCTGGACCCA GCCGCCGCCA CTCAGGTAGA GCCGCTGGCG
GCTGTGTTGA ACGCCCTTGC CCAGGTGCCG CCTCCGCCGG GGGCGAAGGT GGCTATCCTG
GGGACGGGGA ACGTGGCCTA CCTCGCGGCG CAAGTCCTCC GGGGGTTCGA CCCCGTAGTG
GTGGCTAGGC GGGGGAGCGC CAAGGCGCAC CTCTTCAGGG GGCTGGGGCT GGAGGTGGTG
GAGTTGGGCG AGCTGGGTGA GTACATGGCG GAGAACGCGC CGCTGGGGTT CGACGTCGTG
TTTGAGGCCA CTGGCGACCC CTCTGCGATT AATACGGCTA TAGAGATAGC GAGGCCCCGC
GGCGTGATAC ACCTAAAGTC CACCCCCGGC TCCCCCGCCC CCGCCAACCT AACGCCGGCG
GTGGTCAAAG AGCTGAGGAT AGTGGGCACT AGATGCGGCA CATACAGAGA GTTCAGACAC
GCCATCAAGC TCATTAGAGA AGGCATCGTG AAGCCCCTCA TCACCTCCGT AGTAACGGGG
ATACACAACG CGAGAGAGGC TTTCGAGAGG GCCCTCCAAC CCAACGAGGT AAAGGTAGTA
CTGAAGCCCT AG
 
Protein sequence
MLAARLYGPR DLRVEEIPAP KPERGWALVR TLAVGICGTD KAFYRGTYRL FKTPLVPGHE 
AVGVAEGGEL DGRVVVSEIN FACGRCEMCR AGLYTHCPYK RTLGIDFDGG MAEYFVAPLE
ALHPAEGLDP AAATQVEPLA AVLNALAQVP PPPGAKVAIL GTGNVAYLAA QVLRGFDPVV
VARRGSAKAH LFRGLGLEVV ELGELGEYMA ENAPLGFDVV FEATGDPSAI NTAIEIARPR
GVIHLKSTPG SPAPANLTPA VVKELRIVGT RCGTYREFRH AIKLIREGIV KPLITSVVTG
IHNAREAFER ALQPNEVKVV LKP