Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2107 |
Symbol | |
ID | 5055627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1882709 |
End bp | 1883695 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640469659 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_001154305 |
Protein GI | 145592303 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.462909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTACA TAGTGGAGTC GTATCAGTCA GGAAAAGCCT TGAAAGAGGA AATAGAGTCG AGAGGAATCC CGGCGTGGTA TGTAGAGCTC TGGGGGAACT ATATAGTGGC GACGCCGCCT GGCACAAAAA TAGACGGACT GAGAACCCCA GTTAAGGCCG TGGTGGAGCT CAAAACCGAC TACCAGCTAG TGTCGCGGCA GTGGAAGCGC GACCCCACCC CAGTTGTCAT AGGAGATAGG GAGATAAGAG AGGGCAAGAT ATTCATAATC GCCGGCCCCT GTTCGGTAGA AACCGAGGAG CAGATATTGA CCACAGCGCG TGCTGTCAAG GAGGCCGGCG CCGACGCTCT CCGCGGCGGG GCTTTTAAGC CTAGGACGAG CCCCTACGCC TTCCAGGGAC TCGGCGAGAG AGGGCTGATC CTCCTCGCCA AGGCCCGCGA AGCCACGGGA CTGCCCATAA CCACAGAGCT TATGGACCCC GAGGACCTGC CCCTCGTAAC TAAGTACGCC GATGCGATAC AGGTCGGCGC CAGGAACATG CAGAACTTCA CCCTCTTGAA GAAGCTCGGC CGAGCGGAGA AGCCCATCTT GTTGAAGCGC GGCTTCGGCA ACACAATAGA CGAGTGGTTA CTGGCCGCTG AGTACGTGGC CTTGCACGGC AACGGCAACA TTGTGCTGGT GGAGCGCGGG ATAAGGACTT ATGACAAGAC GCTGAGGTTC ACCCTCGACG TCGGGGCCAT TGCATTTGCC AAGCAACACA CCCACCTCCC CGTAATCGGC GACCCAAGCC ACCCAGCCGG GGACCGCAGA TACGTCATAC CCCTCGCCCT GGCTATACTC GCCGCCGGCG CCGATGGCCT CATCGTCGAG GTACACCCAG ACCCAGACAA GGCGTGGAGC GATGCAAAGC AACAACTCAC CTTCCAACAG TTCGAAGAAC TCGTAGCCAA GGCGAAGGCG CTGGCCAAAG CTCTGGGAAA AGACTAG
|
Protein sequence | MLYIVESYQS GKALKEEIES RGIPAWYVEL WGNYIVATPP GTKIDGLRTP VKAVVELKTD YQLVSRQWKR DPTPVVIGDR EIREGKIFII AGPCSVETEE QILTTARAVK EAGADALRGG AFKPRTSPYA FQGLGERGLI LLAKAREATG LPITTELMDP EDLPLVTKYA DAIQVGARNM QNFTLLKKLG RAEKPILLKR GFGNTIDEWL LAAEYVALHG NGNIVLVERG IRTYDKTLRF TLDVGAIAFA KQHTHLPVIG DPSHPAGDRR YVIPLALAIL AAGADGLIVE VHPDPDKAWS DAKQQLTFQQ FEELVAKAKA LAKALGKD
|
| |