Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0957 |
Symbol | |
ID | 5054170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 848460 |
End bp | 849479 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640468513 |
Product | aspartate-semialdehyde dehydrogenase |
Protein accession | YP_001153189 |
Protein GI | 145591187 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0136] Aspartate-semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR00978] aspartate-semialdehyde dehydrogenase (non-peptidoglycan organisms) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.538451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0143726 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGGT TTAAGGTCTA CGTGCTGGGC GCCACGGGTC TAGTGGGGCA GAGATACGTC CAGCTACTCG CCTCGCATCC ATGGTTTGAG ATTGTAGGGC TGGCCGCCTC TGAGAAGAGC GCCGGGAAGA AGCTGTCAGA AACTGGGTGG GTTCTCGAGG AGCCGCCTCC GCCCAGCGTG GCGGAGATGA GAATAGAGAA AATTGACGTG GAGAAGGTAC CGAGGGTGGA CTTCGTCTTC TCCGCCCTGC CCAGCGAGGT GGCGGCCAAG GTAGAGCCGG AGCTAGCGGC GAGGGGCTTC ACGGTGTTGT CCAACTCCAG CAATATGAGG ATGGACCCAG ACGTCCCCCT AGTTATACCT GAGGTAAACC CTGAAGACTT ATCTCTGGTG GAGAAGCAGA GGGCGACGAG AGGCTGGCGC GGCGCCGTGG TGAAGAAGCC TAACTGCACC ACGACTATCC TCAACCTGCC CCTGAAGCCC ATACTAGACG AGTGGGGCAT CGAGAGGATC CACGTGGTCA CCATGCAGGC GCTTTCCGGC GCCGGCTACT CGGGTGTGCC CTCAGTCGCG ATCGTAGACA ACCTAATCCC CTTCATAAGG GGCGAGGAGG AGAAGGTGGT GGCAGAGACC AGGAAGATAC TTAAGCAAGA CTTCGAGATC TTCGCGACGA CTACAAGAGT GCCCGTGTTA GACGGCCACA CAGAGGTTGT GTACGTTGAT ACTAAAAAAG ACTTCGACAC GGCAACTGTT ACGGAGATAT TTGAGAAATT TAAAGGACTG CCACAAGAGT TGAAGCTACC AACAGCGCCG CCGCGGCCTA TAGAGATAAG AGCACAGATA GACAGGCCCC AGCCGAGACT CGACAGGTGG GCCGGGAGAG GAATGGCCGT CGTGGTGGGA AGGGTGAGAA AACTTGCCCC GCGGAAGCTC GCCTTCGTTA TACTCGGCCA CAACACAGTC AGAGGCGCCG CCGGTAACTC GATTTTAACT GCTGAGTTAA TTGTCGCGAC AAGGCGTTAG
|
Protein sequence | MDRFKVYVLG ATGLVGQRYV QLLASHPWFE IVGLAASEKS AGKKLSETGW VLEEPPPPSV AEMRIEKIDV EKVPRVDFVF SALPSEVAAK VEPELAARGF TVLSNSSNMR MDPDVPLVIP EVNPEDLSLV EKQRATRGWR GAVVKKPNCT TTILNLPLKP ILDEWGIERI HVVTMQALSG AGYSGVPSVA IVDNLIPFIR GEEEKVVAET RKILKQDFEI FATTTRVPVL DGHTEVVYVD TKKDFDTATV TEIFEKFKGL PQELKLPTAP PRPIEIRAQI DRPQPRLDRW AGRGMAVVVG RVRKLAPRKL AFVILGHNTV RGAAGNSILT AELIVATRR
|
| |