Gene Pars_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2233 
Symbol 
ID5056394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2000370 
End bp2001674 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content51% 
IMG OID640469786 
Productmalate dehydrogenase 
Protein accessionYP_001154431 
Protein GI145592429 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.70342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.683923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGAAA AATGGTACCA ACTCTCGGTT GAGACGCACC GGAGATACGG CGGCAAAATC 
TCTGTAATAC CAAAGGTGCC GGTTAGGTCT ATAGAAGATT TCGCAATATA CTACACACCT
GGTATAGCTG AGGTGTCGCG CCAGATTCAC AAAAACCCAG AAATGGCGTT TGAGCTTACC
TCTAGGTGGA ATATTATTGG CGTATTGACA GACGGCACAA GAGTCCTAGG TCTAGGCAAC
ATAGGCCCAG AGGCGGCGTA TCCTGTAATG GAAGGCAAAG CACTTATTTT CAAGTACTTA
GGAGGAGTAG ACGCTATTCC CATTCCTATT AGGGTGCGGA CGCCTGAGGA GTTCATATTT
GTAGCAAAGG CCCTCGAACC GGCGCTGGGA GGTATAAACC TCGAAGATAT AGAGTCCCCC
AAGTGCTTCT ACCTGCTAGA CAAGTTGCGA GAAGAGTTGA AAATCCCGGT GTGGCACGAC
GATCAGCAAG GCACAGCCAC CGCAACGCTC GCGGGACTTA TAAACGCGCT TAAGCTCGTG
GGTAAGAAGT TCAGCGATGT CGTGATCGCC CTTATAGGCG CAGGCGCCTC GAATATATAC
ACTGCCCGCA TCCTTATCAA ATACGGCGCT AAGCCGGGAA ACCTCATCTT GGTAGACAGC
AAGGGGATTC TCCACCCCGA GCGCGACGAC ATAGACAAAA TGATGCTCGA AAACCCGTGG
AAGTATAAAT ACGCCATTGA GACCAACGCA GAGCGGCGTA AAGGCGGCAT TCCCGAAGCT
ATGAAAGGCG CAGATGTAGT TATTGGAGCG TCAAGGCCGG GTCCCGGCGT CATAAAGAAG
GAGTGGGTAG CATCAATGAA CAAAGACGCC ATCGTATTTG CCTTGGCCAA CCCCGTCCCC
GAGATCTGGC CCTGGGAGGC AAAGGAGGCT GGGGCCAAGA TTGTGGCTAC TGGGAGGAGT
GACTTCCCCA ACCAGATAAA CAACTCGTTG ATATTCCCCG CCGTGTTCAG AGGCGCCCTA
GACGTCAGAG CTACTACCAT AACTGATGAA ATGCTCATAG CCGCGGCAGA AGAGGTGGCG
AAATTCGCCG AGGAAAAAGG AATCCACGAA GAGTATATAG TGCCAAAGAT TACAGAGTGG
GAAGTTTATG TAAGAGAGGC GGCAGCCGTC GCGGCGATGG CCTCTTCACA AAGAGTGGCG
AGGATCCCGA GATCTTACAA CGAGGAGCTT GAGATTGCGA GGAGTATAAT ATCAAAAAGC
ATAAAGACTC TTGAAATCTT GATGAGGGAG AAAATAATTG AATAA
 
Protein sequence
MTEKWYQLSV ETHRRYGGKI SVIPKVPVRS IEDFAIYYTP GIAEVSRQIH KNPEMAFELT 
SRWNIIGVLT DGTRVLGLGN IGPEAAYPVM EGKALIFKYL GGVDAIPIPI RVRTPEEFIF
VAKALEPALG GINLEDIESP KCFYLLDKLR EELKIPVWHD DQQGTATATL AGLINALKLV
GKKFSDVVIA LIGAGASNIY TARILIKYGA KPGNLILVDS KGILHPERDD IDKMMLENPW
KYKYAIETNA ERRKGGIPEA MKGADVVIGA SRPGPGVIKK EWVASMNKDA IVFALANPVP
EIWPWEAKEA GAKIVATGRS DFPNQINNSL IFPAVFRGAL DVRATTITDE MLIAAAEEVA
KFAEEKGIHE EYIVPKITEW EVYVREAAAV AAMASSQRVA RIPRSYNEEL EIARSIISKS
IKTLEILMRE KIIE