Gene Pars_0957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0957 
Symbol 
ID5054170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp848460 
End bp849479 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content57% 
IMG OID640468513 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_001153189 
Protein GI145591187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR00978] aspartate-semialdehyde dehydrogenase (non-peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.538451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0143726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGGT TTAAGGTCTA CGTGCTGGGC GCCACGGGTC TAGTGGGGCA GAGATACGTC 
CAGCTACTCG CCTCGCATCC ATGGTTTGAG ATTGTAGGGC TGGCCGCCTC TGAGAAGAGC
GCCGGGAAGA AGCTGTCAGA AACTGGGTGG GTTCTCGAGG AGCCGCCTCC GCCCAGCGTG
GCGGAGATGA GAATAGAGAA AATTGACGTG GAGAAGGTAC CGAGGGTGGA CTTCGTCTTC
TCCGCCCTGC CCAGCGAGGT GGCGGCCAAG GTAGAGCCGG AGCTAGCGGC GAGGGGCTTC
ACGGTGTTGT CCAACTCCAG CAATATGAGG ATGGACCCAG ACGTCCCCCT AGTTATACCT
GAGGTAAACC CTGAAGACTT ATCTCTGGTG GAGAAGCAGA GGGCGACGAG AGGCTGGCGC
GGCGCCGTGG TGAAGAAGCC TAACTGCACC ACGACTATCC TCAACCTGCC CCTGAAGCCC
ATACTAGACG AGTGGGGCAT CGAGAGGATC CACGTGGTCA CCATGCAGGC GCTTTCCGGC
GCCGGCTACT CGGGTGTGCC CTCAGTCGCG ATCGTAGACA ACCTAATCCC CTTCATAAGG
GGCGAGGAGG AGAAGGTGGT GGCAGAGACC AGGAAGATAC TTAAGCAAGA CTTCGAGATC
TTCGCGACGA CTACAAGAGT GCCCGTGTTA GACGGCCACA CAGAGGTTGT GTACGTTGAT
ACTAAAAAAG ACTTCGACAC GGCAACTGTT ACGGAGATAT TTGAGAAATT TAAAGGACTG
CCACAAGAGT TGAAGCTACC AACAGCGCCG CCGCGGCCTA TAGAGATAAG AGCACAGATA
GACAGGCCCC AGCCGAGACT CGACAGGTGG GCCGGGAGAG GAATGGCCGT CGTGGTGGGA
AGGGTGAGAA AACTTGCCCC GCGGAAGCTC GCCTTCGTTA TACTCGGCCA CAACACAGTC
AGAGGCGCCG CCGGTAACTC GATTTTAACT GCTGAGTTAA TTGTCGCGAC AAGGCGTTAG
 
Protein sequence
MDRFKVYVLG ATGLVGQRYV QLLASHPWFE IVGLAASEKS AGKKLSETGW VLEEPPPPSV 
AEMRIEKIDV EKVPRVDFVF SALPSEVAAK VEPELAARGF TVLSNSSNMR MDPDVPLVIP
EVNPEDLSLV EKQRATRGWR GAVVKKPNCT TTILNLPLKP ILDEWGIERI HVVTMQALSG
AGYSGVPSVA IVDNLIPFIR GEEEKVVAET RKILKQDFEI FATTTRVPVL DGHTEVVYVD
TKKDFDTATV TEIFEKFKGL PQELKLPTAP PRPIEIRAQI DRPQPRLDRW AGRGMAVVVG
RVRKLAPRKL AFVILGHNTV RGAAGNSILT AELIVATRR