Gene Pars_0657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0657 
Symbol 
ID5055605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp584999 
End bp586039 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content57% 
IMG OID640468217 
Productsaccharopine dehydrogenase 
Protein accessionYP_001152900 
Protein GI145590898 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGATAC TGCTAATGGG ATGCGGCAAC ATAGGGAAGT ACATCTACAA CGCTCTTTCC 
CAGAGGCACG AAGTAGCCGT GGCCGACAAG GCCGGGGGGT GTCCTTCTAC CATTGCTCGC
GACGCTCTGG AGGTGCCCCT CGGCGGGTAC GATCTTGTAA TCAACGCGTT GCCTGGGAAT
ATTGCGTATA AGGCGTCGCG GCGGGCCTTA GAGGTGGGTG TCGACGTTGT AGATGTATCG
TTCTTCCCGG AAGACCCCTT TGAACTCGAC GAGGTGACAA AGAAAAGCGG GGCTAGGTAC
ATCCCAGATG CGGGGGTTGC TCCTGGGCTT AGCAACGTGT TGGCAGGTAG GTTGGTGGCG
GAGCTGGGCA AGGTTGACGA GCTGGGCATA TACGTGGGGG GCATACCCGA GAGACCCGTC
GGTCCTCTCG GGTATTCAAT AACGTGGAGC CCCCTAGACC TAATTGAGGA GTACACGAGA
CCGGCCAGGG TGAGGAGGAG CGGCGAGTTA GTGTCGGTTG ATCCGCTCAG CGGCGTTGAG
CTCGTCCCCT CGCCTCTTGG GATGCTTGAG GCGTTCTACA CAGACGGCCT ACGCACACTC
CTGAAGACGC TGGACGTCCC TAACATGTAC GAAAAAACGT TAAGGTGGCC AGGCCATATA
GAAAAGATCA AACTTCTTCG CGATTTGGGG TTCATGTCGG AGGAGGGGGA TCCGCCCCCG
CGCCTAGTGA CGGCTAATCT GCTTTCCCGC CTCAAATTCG ATGTGCCTGA TGTGGTATAT
ATGAAGGTTG TAGGGAGCGG CGGCCAGAAG AAAGTTCAAT ATGAAGTCAC CGTCAGGCCT
CGCGCCGGGT GGACTGCGAT GCAGGTGGCG ACTGGTAGCG TCGCCATAGG GATGCTGTAC
GTGATCAAAG ACCTAGATCC AGGCGTGACG CCGCCCGAGT ACATCGGCAT GTCCAACAGG
CTCTTTCCCC GGCTCCTCGC CGCTGTAAGG CAACACGGCG TGGAAATCGT CCAAGAGATA
GTAGAAAGAA GAGCGCTATG A
 
Protein sequence
MKILLMGCGN IGKYIYNALS QRHEVAVADK AGGCPSTIAR DALEVPLGGY DLVINALPGN 
IAYKASRRAL EVGVDVVDVS FFPEDPFELD EVTKKSGARY IPDAGVAPGL SNVLAGRLVA
ELGKVDELGI YVGGIPERPV GPLGYSITWS PLDLIEEYTR PARVRRSGEL VSVDPLSGVE
LVPSPLGMLE AFYTDGLRTL LKTLDVPNMY EKTLRWPGHI EKIKLLRDLG FMSEEGDPPP
RLVTANLLSR LKFDVPDVVY MKVVGSGGQK KVQYEVTVRP RAGWTAMQVA TGSVAIGMLY
VIKDLDPGVT PPEYIGMSNR LFPRLLAAVR QHGVEIVQEI VERRAL