Gene Pars_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2247 
Symbol 
ID5055067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2014893 
End bp2015909 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content59% 
IMG OID640469800 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001154445 
Protein GI145592443 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0921714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.498116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTGC GGTTCCCCCA GCACAGGCCT AGGCGTCTCA GAGCCAGCAA GCTGATTAGA 
GACGCCGTGG CGGAGACGTC GTTAGATCCC AGCGACTTCA TATACCCCAT TTTTGTCAAG
CCTTCCGGCG AGAAGGAGCA AATACCCTCC ATGCCCGGCC AGTACAGGTG GCCAGTGGGG
GACGAATTGA CAAGGCACGT GGAGGAGGCC CTCGCCTTGG GCGTGAACAA GGTGATTCTC
TTCGGGGTGG TGCCCGACGA GCTTAAAGAC TCGGCTGGGT CCCCTGGCTA CGACCCACAC
GGCGTGGTGC CGAACGCCAT CCGGTTGCTA AAACAGACCT TCGGCGACAA GTTGCTTGTG
TTCGCCGATG TCTGTCTCTG CGAGTACACA GACCACGGCC ACTGCGGCAT TGTAAGAGAG
AGGCGGGGAA GGTGGTATGT GGACAACGAC GAGACTATAA AGCTGTACGC AAAAGAGGCA
GTGACCTACG CCGACGCCGG TGCGGATTTT GTAGCGCCGA GCGGCATGAT GGACGGGCAA
GTAGCCGAAA TAAGAAAAGC CCTAGACGCC CACGGCTTCC ACGACGTGGG GATAATGGCC
TACAGCGCCA AGTACGCCTC TGCGTTCTAT GGCCCTTTCA GAGTGGCGGC GGCCTCTGCG
CCTAAGTTCG GCGACAGGAG GACGTACCAG ATGGACCCCA GAAACGCCTA CGAGGCCGTC
AAGGAGGTTA TGCTCGACTT AGAAGAAGGC GCAGATATCG TCATGGTCAA GCCGGCGCTG
GCATACCTCG ACGTAATCCG CCTCGTGAAG ACGCACTACC CGTGGGCGCC CCTCGCCGCT
TACAATGTGT CGGGGGAGTA CTCCATGGTC AAAGCCGCCG CCTCCCTCGG CTACGTAGAC
GAACGCATCG TCACGTTGGA GATACTAACC GCCATAAAGA GGGCAGGAGC TCAGCTAATC
CTCACCTACC ACGCCCTCGA AGCCGCCAGG TGGTTGAAGG AGGGCGTGCC GTTTTAG
 
Protein sequence
MHVRFPQHRP RRLRASKLIR DAVAETSLDP SDFIYPIFVK PSGEKEQIPS MPGQYRWPVG 
DELTRHVEEA LALGVNKVIL FGVVPDELKD SAGSPGYDPH GVVPNAIRLL KQTFGDKLLV
FADVCLCEYT DHGHCGIVRE RRGRWYVDND ETIKLYAKEA VTYADAGADF VAPSGMMDGQ
VAEIRKALDA HGFHDVGIMA YSAKYASAFY GPFRVAAASA PKFGDRRTYQ MDPRNAYEAV
KEVMLDLEEG ADIVMVKPAL AYLDVIRLVK THYPWAPLAA YNVSGEYSMV KAAASLGYVD
ERIVTLEILT AIKRAGAQLI LTYHALEAAR WLKEGVPF