Gene Pars_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2089 
Symbol 
ID5056299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1865120 
End bp1866454 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content58% 
IMG OID640469639 
ProductD-lactate dehydrogenase (cytochrome) 
Protein accessionYP_001154287 
Protein GI145592285 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR00387] glycolate oxidase, subunit GlcD 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.491336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTGG GTTTTCTCAG GAAGACGTTT GGCGATAGGT TTGTTGAAGA TTCTTCCGTT 
GCGGCGTTGT ACGTCCACGA CGCCTCTTTT GTGGAGGGGG AAAGCAACGT GCTGGGGGTG
GTCTTCCCCG AGACGGAGGG GGAGGTGGTT GAGCTGGTTA GGTGGGCTAT AAAGCATAAG
GTGCCGTTGT TCCCACAAGG GAGTGCCACC AGCCTCTCGG GCAACGCCGC GGCTACTGCT
AGAGGGCTCG TGGTGAGTTT TGAGAGGATG ACCAAAGTGG AGATAGACCC AGGGGACGGC
GTGGCTGTGG TCGGCCCCGG GGTGAGGATC GAGGAGCTGA ACGTCGAGCT GGCCCGGTAC
GGCTTCTTCT TCCCCGTCGA TCCCGGCTCT GTGAGGAGCG CCACGATTGG CGGGGCTATC
GCCAACGGAG CCGGCGGGAT GAGGGGGGCG AAGTACGGCA CAATAAAGGA CTGGGTGTTG
GGACTGAGAG TGGTGACGGG AAGAGGCGAC GTGTTGAAGG TGGGTTGCAA GACGTTCAAG
TGCCGGAACG GCTATGATCT TGTGAGGCTA TTTGTCGGTA GCGAGGGGAC GCTGGGCCTT
ATTACGGAGG CTGTTTTGAA GCTGGCTCCT GTGCCGGAGT CCGCCGTGGC CGTCTTGGCG
TATTATGACG ACGTGGAGCC GCTTGTAGAG GACGTGGTTA GGGTTAGGGC AAGCAGAATT
TGGCCGCTAT TTGCAGAGTT TTTAGACGCG CCGACTGCCG CCGTGGTGGG GCTTGAGGAG
AGAGACACCC TCTTTCTTGG CGTCGACGTC AATACAGGTG CAGAGGAGAG AGTTTTGAAG
AGACTCCAGT CTATCGTCAG GGGGAGAGTG GCCAGTGTGG CAGTGGGCTG GTCTGAAGCC
ATGAAGCTAC TGGAGCCGCG CAGGAGGCTA TACTCGGCGC AGGTTCACCT CGCTCAGAGA
GGCGGCGGCG TGTTGGTAAT TGAGGACGTT GCGGTGCCCA TTTCGAAGCT CCCAGACGCC
GTGAGGGGGC TTAAAAAGCT GGCGGAGAAA TACGGCGTAC CGCTGTTGCT AGGCGGCCAT
GTAGGCGACG GCAACTTACA CCCAGCTACT TGGTTTAGAA AAGAGGAGGG GCCCGGAAAG
GCGGAAAAGT TTATCAGAGA AATGGCGGAG CTCGTGGTTG GGCTAGGCGG TACAGTGTCG
GCAGAGCACG GGGTCGGGAC CTTGAAGAAA GATCTCATAG CGCTTGAGCT CGGCGATGCG
GTGCTTACAT ATATGAGGGA GCTTAAGAAG GTCTTCGACC CCTACAATAT CCTCAACCCC
GGCAAGATAG CCTAG
 
Protein sequence
MDVGFLRKTF GDRFVEDSSV AALYVHDASF VEGESNVLGV VFPETEGEVV ELVRWAIKHK 
VPLFPQGSAT SLSGNAAATA RGLVVSFERM TKVEIDPGDG VAVVGPGVRI EELNVELARY
GFFFPVDPGS VRSATIGGAI ANGAGGMRGA KYGTIKDWVL GLRVVTGRGD VLKVGCKTFK
CRNGYDLVRL FVGSEGTLGL ITEAVLKLAP VPESAVAVLA YYDDVEPLVE DVVRVRASRI
WPLFAEFLDA PTAAVVGLEE RDTLFLGVDV NTGAEERVLK RLQSIVRGRV ASVAVGWSEA
MKLLEPRRRL YSAQVHLAQR GGGVLVIEDV AVPISKLPDA VRGLKKLAEK YGVPLLLGGH
VGDGNLHPAT WFRKEEGPGK AEKFIREMAE LVVGLGGTVS AEHGVGTLKK DLIALELGDA
VLTYMRELKK VFDPYNILNP GKIA