Gene Pars_0490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0490 
Symbol 
ID5056406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp437290 
End bp438705 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content57% 
IMG OID640468052 
ProductD-lactate dehydrogenase (cytochrome) 
Protein accessionYP_001152737 
Protein GI145590735 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR00387] glycolate oxidase, subunit GlcD 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.450377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCC AAAGCGTGAT TGGGGTATTT GCAAAAAAAG CACGTGAAAT CTTAGGAGAG 
GAAGGCGTTC TTCAAGAAGA AGTGGATCTA CTCGTCTACG AGCAAGACGG CACCTTGGCG
CTTAGAGGGA GAGCTGATGT CGTCGTCTTC CCGCGCACGA CGGAGGAAAT GGCGAAAGTC
GTAGAGCTTG CCTACAACTA CGACATTCCG ATTATAGGCA GGGGCTCGGG CACAAGCCTC
AGCGGCGGGG CGGTTCCCGT CAAGGGCGGC GTGATTGTGA GCACAGCCCG CATGAACAAA
GTTTTGGAGA TAGACTTAGA CAACGAAGTC GCAGTGGTCC AGGCGGGAGT CGTGAACGAC
TGGATAAACT CCTACCTAGC GCGGATGGGC TACCAGTACG CCATAGATCT GGGCTTCCAG
TATATGGCGG ATCCCGCCTC GCAGAGGATC TCCACGATAG GCGGTAACAT AGCCCATAAC
TCGGGTGGCG TTAAATGCTT TAAATACGGC GTGACGGTCA ACCAGATCAG AGGCCTCACA
GTGGTTTTGC CCACGGGCGA GGTGAGGAAG ATAGGCGGCA AGGAGTTCGA ACAAGCCGGT
TACGACTTGA TAGGCCTCTT GGCAGGCTCC GAGGGCACTT TAGCCCTAGT CGCGGAGGCT
GTGCTCAAGA TAGTCCCGAC CTACGAGACT TCGGCTACTA TACTTGCGAA ATTCGACGAT
CTGTCTGTGG CCGGCCGCGC CGTGTCGGCC GTCATAGCCT CGGGCGCCAT GCCGGTGGCC
ATGGAGCTCA TGGACAAGCT TGCTGTCGAG GCCGTCGAGT CAGGCCCCTA CGCTGGCGGC
CTCCCGAGAG ATGCCGAGGC GATCTTATTA ATACAAGTCG AGGGATCCCC GCCCGGAGCC
AAGGAAGAGG CCGCCAAGGT GGCTGAGATC TTGAGGAGAA ACGGCGCAGT GGGGGTCGAA
GTGGTCGAGG ACCCAGCGCG GGCCGCGAAA CTATGGGCCG CCAGGAAGCA AGCGTTCGGC
GCCATGGGCT TCGTAGGTCC CAACTACGTA GTCGAAGACG GCACGATACC CCGGAAGAAG
CTCGCTGAGG CCTTGATGAT AGCGAGGGCC GCCGGCGCGA AGAGGGGGCT TAGAGTTGCC
AACGTGTTCC ACGCAGGAGA TGGGAACCTG CATCCGTTGA TACTTTACGA CGAGAGGAAG
CCCGGCGAGC GGGAAAAAGC CATCGAGGCG GGTGAGGAGA TCCTCGAGGC TTGTGTGGAG
CTCGGCGGAA CGATAACTGG GGAACATGGA GTTGGTTACA TGAAGAAGAA GTTGTTGCAC
AAGATGTATA AGAGGGAGGA GATAGAGCTG ATGAAGGCGA TCAAAGCGAT CTTCGATCCT
AAGGGGCTCA TGAACCCCGG CAAGATATTC CCATGA
 
Protein sequence
MMRQSVIGVF AKKAREILGE EGVLQEEVDL LVYEQDGTLA LRGRADVVVF PRTTEEMAKV 
VELAYNYDIP IIGRGSGTSL SGGAVPVKGG VIVSTARMNK VLEIDLDNEV AVVQAGVVND
WINSYLARMG YQYAIDLGFQ YMADPASQRI STIGGNIAHN SGGVKCFKYG VTVNQIRGLT
VVLPTGEVRK IGGKEFEQAG YDLIGLLAGS EGTLALVAEA VLKIVPTYET SATILAKFDD
LSVAGRAVSA VIASGAMPVA MELMDKLAVE AVESGPYAGG LPRDAEAILL IQVEGSPPGA
KEEAAKVAEI LRRNGAVGVE VVEDPARAAK LWAARKQAFG AMGFVGPNYV VEDGTIPRKK
LAEALMIARA AGAKRGLRVA NVFHAGDGNL HPLILYDERK PGEREKAIEA GEEILEACVE
LGGTITGEHG VGYMKKKLLH KMYKREEIEL MKAIKAIFDP KGLMNPGKIF P