Gene Apar_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1039 
Symbol 
ID8413912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1173418 
End bp1174422 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content45% 
IMG OID645022628 
ProductD-isomer specific 2-hydroxyacid dehydrogenase NAD-binding 
Protein accessionYP_003180058 
Protein GI257784841 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1052] Lactate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTT TGTTTTATGG TGCTCAAAGT TACGACAAAG ATTCTTTCAA CCTTGAACTT 
CCAAAGTATC CTGATGTTTC AATCGACTAC ATCGAGTCCA ACCTTACACC TATGACCGCG
GGCTTTGCCA AGGGTTACGA CGCTGTCTGT GCTTTTGTCA ACGCAGATGC TTCAATTCTG
ACTCTTGAGA TTCTTGCAGG CTTTGACGTA AAGCTTTTAC TCATGCGTTG TGCTGGATTT
GACGCAGTTG ACGTTAATGC AGCAAAAGAT TTAGGCATTA CCGTAACGCG TGTACCTGCT
TACTCTCCTG AGGCAATTGC AGAGCACGCA ATGGGTCTTG CGCTTGCTGC TAACCGTCGT
ATTCACCGCG GTTATAACCG TATTCGCGAG AACAACTTCT CACTCGTCGG TCTTGTCGGA
GAAACACTGC ATGGTAAAAC CGCAGGTATT GTTGGCACTG GTCGTATCGG TGCTGCACTT
TGCCGCATCT GCAAGGGCTT TGGTATGCAC GTTCTTGGCG CAGATCTCTA CCCAAACATG
AGTCTTGTAA ACGATGAACA TGTCGTTGAT GAATATGTAA GCTATGACGA GCTTTGGGAG
CGCGCTGATT TCATTTCTCT TCATGCATTC TTAAATGAAG AGAGCTATCA CATGATTAAC
GATAAAACTA TCGGCAAGAT GAAAGATGGT GTTGTCCTCG TCAATACCGC ACGTGGTGCC
CTTGTCGATA CCAAGGCTCT TATTCGTGGC ATTCTTTCTG GCAAAATTGG TGCTTGCGGC
CTTGACGTTT ATGAAGAGGA AAATCCAAAC GTCTATAAGG ACCGTGCCGC TGAGGTATTT
GACTCAGTCA CCTCTACTCT CTGTTCATTC CCTAACGTAG TTATGACAAG TCACCAGGCA
TTCTTTACTC ATGAAGCACT CTCTCAAATT GCTCAAGTTA CACTTGATAA TGCAACTGCT
TTTGCTAAGG GCACTGATTA CGTTGATAAA AGTGTGGTTT GCTAA
 
Protein sequence
MKILFYGAQS YDKDSFNLEL PKYPDVSIDY IESNLTPMTA GFAKGYDAVC AFVNADASIL 
TLEILAGFDV KLLLMRCAGF DAVDVNAAKD LGITVTRVPA YSPEAIAEHA MGLALAANRR
IHRGYNRIRE NNFSLVGLVG ETLHGKTAGI VGTGRIGAAL CRICKGFGMH VLGADLYPNM
SLVNDEHVVD EYVSYDELWE RADFISLHAF LNEESYHMIN DKTIGKMKDG VVLVNTARGA
LVDTKALIRG ILSGKIGACG LDVYEEENPN VYKDRAAEVF DSVTSTLCSF PNVVMTSHQA
FFTHEALSQI AQVTLDNATA FAKGTDYVDK SVVC