Gene Pars_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1842 
Symbol 
ID5056206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1647859 
End bp1649178 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content49% 
IMG OID640469388 
Productisocitrate dehydrogenase, NADP-dependent 
Protein accessionYP_001154045 
Protein GI145592043 
COG category[C] Energy production and conversion 
COG ID[COG0538] Isocitrate dehydrogenases 
TIGRFAM ID[TIGR00183] isocitrate dehydrogenase, NADP-dependent, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.282265 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTTG ATATAGAGAA GATAAAACAG CAAATCCCAA ACCTCGTAAC TTTCACAGGA 
AAGTATATAG ATCCGCCGAG CGGAGAATAC GTCAAGTATA CAGGCCCGGG ACAGCTGAAG
GTGCCTGATA AAGTAGTTAT AGGCTATATT GAGGGTGACG GAATTGGGCC AGAGGTTGCT
TACGCCGCAA TTAAGGTCGC CAACGAGGCG GTGGAAAAGG CCTATGGAAA GTCTAGGCAA
ATTACGTGGT ACGAAATCGT GGTTGGGGAG AAGGCAGAAA AGTTATTCGG CAACAGGTTG
CCAGATCAAA GCATAGAGGT GTTGAGAAAG ATTAGAGTAT TCCTCAAAGC CCCTCTCGAA
ACCCCTGTAG GGGGCGGGTT TAGAAGCATA AACGTGACGC TTCGCCAACT CTTCGACCTA
TATGCCAACA TTAGGCCTGT GAAGTACTTC CCCGGCTTGC CATCGCCTCT TAGACGCCCA
GAACTTGTAG ACTTGGTTAT ATTCAGAGAG AACACGGAAG ACGTCTACGC TGGTATAGAG
TGGCCTTATA ACAGCCCAGA GGCGGCGAAA ATTAGGGAAT TTCTACGCAG GGAATTCGGC
GTAAATATCA GAGATGACGC CGGCATAGGG ATAAAGCCTA TTAGCAAATT CGGTACTCAG
AGAATTGCCA GACTTGCGCT AAAGTTTGCC ATTGAGAACA AGAGACGGGT CGTAACCGTT
ATGCACAAGG GAAATATACA GAAATACACA GAAGGGGCCT TCAAAGAGTG GGCATTTGAA
GTGGCTAGAA ACGAGTTCAG AGAACATGTG GTATTTGAAG ACGAGTTGGC TCAGTACGGC
GGCTCTGTAC CACCAGGGAA GGTGCTTGTA AATGATAGAA TTGCCGATAA CATGCTCCAG
CAACTACTTA CGCGCACGGG GGAGTACGAC GTAATACTTG CCCCCAACCT AAACGGCGAC
TACGTCTCAG ACGAGGCCGC GGGCCTTGTG GGAGGACTTG GCGTCGCACC TGGCCTAGAC
GTAGGCGACT GGGGAATGAT GGCAGAGCCT GTACATGGAA CAGCGCCTAA GTACAGGGGC
AAGAACTACG TAAACCCAAC TGCCACAATA CTAGCTCTGG AACTGATGTT CCGCTTCCTA
GGATGGAGAG AGGTTGCTGA GTATATTATG AAAGGCGTCG AGACCGCATA CAGAGAAGGA
TATTTCACTG GCGACCTGGC TAGGCAGATG ACAGATGAGG AGAGAAAAAT GAGAGTCAAA
GAAGTACTCG GCACGCAAGA GTTCGCAGAC AAAGTGGTGG AGATTATAAA AAGACTTTAA
 
Protein sequence
MSVDIEKIKQ QIPNLVTFTG KYIDPPSGEY VKYTGPGQLK VPDKVVIGYI EGDGIGPEVA 
YAAIKVANEA VEKAYGKSRQ ITWYEIVVGE KAEKLFGNRL PDQSIEVLRK IRVFLKAPLE
TPVGGGFRSI NVTLRQLFDL YANIRPVKYF PGLPSPLRRP ELVDLVIFRE NTEDVYAGIE
WPYNSPEAAK IREFLRREFG VNIRDDAGIG IKPISKFGTQ RIARLALKFA IENKRRVVTV
MHKGNIQKYT EGAFKEWAFE VARNEFREHV VFEDELAQYG GSVPPGKVLV NDRIADNMLQ
QLLTRTGEYD VILAPNLNGD YVSDEAAGLV GGLGVAPGLD VGDWGMMAEP VHGTAPKYRG
KNYVNPTATI LALELMFRFL GWREVAEYIM KGVETAYREG YFTGDLARQM TDEERKMRVK
EVLGTQEFAD KVVEIIKRL