Gene Pars_2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2266 
Symbol 
ID5055862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2028062 
End bp2029066 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content52% 
IMG OID640469818 
ProductD-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding 
Protein accessionYP_001154462 
Protein GI145592460 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1052] Lactate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.098436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0903762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAT ACAAAGTTGT TGTCTTATCG CCTGTACCTG AGGCATTGAT AAAAATGTGG 
GCTACGCCCA TTGCTCAAAA ATATGGCATA CCCATCGAAG AGATTGAAGT TGTCACCCTC
TTCGAGCCCA ATTACGAGGA GGTAGCTCGC CAGGTTGCCG ACGCAGACGT TGTGGTTGGC
GACTACACCT TCAGAATTAA AATCGACGCC GACTTGTGCC AAAAAATGTC TAAGGTGAAA
CTAATTCAAC AGCCAAGCAC CGGCTACGAC CACATAGACG TCGTAGCTTG CGCAAAGAGG
GGCATCCCAG TGGCGAATAT CGGTGGGGCT AACTCCATCT CTGTGGCCGA GCACACCATA
ATGCTGGCCT TGATGTTGTT GAAGAGAGCT GTGTACGCTC ACCAGAAGCT AGTTAACGGC
CAGTGGACGC AGGGGGAGCT CATGAACACT GTTGGGGAGC TCTATGGCAA AACATGGGGG
ATACTCGGCA TGGGCAGAAT TGGAAAGGAG GTCGCCATAA GGGTCCTAGC TTTTGGTGCC
AAAGTTATTT ACTACGACGT CGTGAGGAGG GAAGATGTAG AAAAACTGGG AGTCGAGTAT
AGGCCTTTCA ACAGATTGCT GGCGGAGAGC GATGTGCTTA GCATCCACGT GCCTCTTACA
GAGAAGACAA GGGGCATGAT CGGGGAGCGG GAGCTTAGGA TGATGAAGCC CACCGCGGTG
CTTATCAACG TCTCGCGCGG CGAAATCACC GACGAAGAGG CACTCGCTAA AGCTGTGCGC
GAGGGCTGGA TTGCCGGGGT GGGGGTAGAC GTATTCTCCG TCGAGCCCCC GCCTCCGGAT
CATCCATTGT TACAAGTCGC AAGGGAGGGC TTCAACGTCA TCGTCACGCC GCATATCGCC
GGCGCCACCA ATGAGGCTAG GATGAGAATT ATCAACGTGA CGCTAGATAA CGTGTTGAGA
GTGCTTGCAG GTCTAAAGCC TGAAAATGTG GTGAATATGC CATGA
 
Protein sequence
MAKYKVVVLS PVPEALIKMW ATPIAQKYGI PIEEIEVVTL FEPNYEEVAR QVADADVVVG 
DYTFRIKIDA DLCQKMSKVK LIQQPSTGYD HIDVVACAKR GIPVANIGGA NSISVAEHTI
MLALMLLKRA VYAHQKLVNG QWTQGELMNT VGELYGKTWG ILGMGRIGKE VAIRVLAFGA
KVIYYDVVRR EDVEKLGVEY RPFNRLLAES DVLSIHVPLT EKTRGMIGER ELRMMKPTAV
LINVSRGEIT DEEALAKAVR EGWIAGVGVD VFSVEPPPPD HPLLQVAREG FNVIVTPHIA
GATNEARMRI INVTLDNVLR VLAGLKPENV VNMP