Gene Pars_0646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0646 
Symbol 
ID5055175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp574958 
End bp575989 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content56% 
IMG OID640468206 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_001152889 
Protein GI145590887 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.411826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTGGT GGTCTATAGC TCTCTCTCCT CGTCTTTGGT TTTTCCTATT GATGTTCGCG 
CTGTCGGGGG GCGTCTTATT GACGGTAGTT TGGTTTGAGA GAAAGGCAGC TGCGAGGGTT
CAAATGAGGG TTGGGCCGTA TCACGTGTCG CCATGGTCTG GGGGGTATCT GCAACTTCTG
GCAGACGCCT TCAAGTTCAT TATAAGCGAG CCGATTGTGC CCCGCGGAGC GCACAAGGTG
CTATTTGTCT GGGGCCCGCC GCTCTTTGTA ACGCTCGCCT TCGGCGCCTC GCTACTCCTC
CCGCTGACTC CCGAACTTAG GCTTATAAAA GACCCCGCTC TTTTGCCCTA CGGCCTTATC
TTTTCTCTTG TAGTTCTCCT CCTGGTGTCC ATATCTGTGG TCATCATAGG CTGGTCTGTG
AATAACAAAT TTGCCTACGT AGGCGCGGCG CGCGAGGCCC TTCTGGTAGC CGCCTACGAG
CTTCCTCTTA TACTCTCCTT CTTGGCCATG GCGGTGCTGT ACGGCACGTT GAACCCACTG
GAGATTGTGA ACAAGCAGAG CTTGCTTGTG GGCGCCTTGT GGAACCCCCT CGCCTTCCTA
GTCTTCATAA TTGCCACTGC CATGGCCACA GCTAGGTTCC CCTTCGAAAT CGCCGACTAC
GAGGGAGACT TGGCGACAGG GCCTTACAGC GACTACGGTG GGATATTTCT CGTCCTATCC
TTCGCCGGCG GCACCTACTA CGCCACCTTC TCCTACTCCT TCCTCGCATC TCTGCTCTTC
CTCGGCGGGT GGGCCCTCCC AGGCTTTTCG GCAGGCCCCT GGCCCCAGGA TATTATAGGC
AATTTGATAT TGGCAATATG GGTATATGTA AAGGTAGTCG CCCTCATGTT TTTCTTCGCC
TTCCTGAGGG CAGCGATGCC CGTGCTGAGG CTTGACCACA CCCTAGCGCT CGGCTGGCGG
GGTCTCTTGC TCCTAGGCAT GGCCGGAGTT GTGTGGTCCG TAGTACTGAG GCTTGTGGGG
GTGGCGCCAT GA
 
Protein sequence
MDWWSIALSP RLWFFLLMFA LSGGVLLTVV WFERKAAARV QMRVGPYHVS PWSGGYLQLL 
ADAFKFIISE PIVPRGAHKV LFVWGPPLFV TLAFGASLLL PLTPELRLIK DPALLPYGLI
FSLVVLLLVS ISVVIIGWSV NNKFAYVGAA REALLVAAYE LPLILSFLAM AVLYGTLNPL
EIVNKQSLLV GALWNPLAFL VFIIATAMAT ARFPFEIADY EGDLATGPYS DYGGIFLVLS
FAGGTYYATF SYSFLASLLF LGGWALPGFS AGPWPQDIIG NLILAIWVYV KVVALMFFFA
FLRAAMPVLR LDHTLALGWR GLLLLGMAGV VWSVVLRLVG VAP