Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0992 |
Symbol | |
ID | 5055161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 883327 |
End bp | 884232 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640468548 |
Product | dihydrodipicolinate synthetase |
Protein accession | YP_001153224 |
Protein GI | 145591222 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.673974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.00851435 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACTAG AAGGGGTCAT CGCGGCTACC GTAACTCCCT TTACAAAAGA CGGGGTCAAC TACGAGGGGC TCAGAATCAT CTTGTCTAGG ATCGTCGAGG CGGGCTACCA CGGGGTTTTC CCCACATCCT CCACCGGCGA GGTGACTAAG CTCACGCCGG AGGAAAGGGT AAAAGTTATG GAGGTGGCCA AGGAGGTGGC GGGTGGGAAG GCCCTAGTCA TCGCGGGGAC TGGGACAGGC GATCATCTAT CTACAATCGA TATGGTCAGG AGGTACAAGG ACGTGGGGGT GGACGTCGTC CTGATCACCC CACCGTACTA CATACAGTAC GATTGGGCCG CCATATACGC CTTCTACAAG AAAGTCCTCG ACAAAACGGA TGTCCCCGTT ATCCTCTACA CAATCCCCCT AGCCACGGGG TACAACATCC CCGTCGAGGT GTTCGAGCTT GTGGCCAACG AGTACAGCCA AGTCGTCGGA GTTAAGGACA GCTCTGGCGA CTTCCGCTAC CACCTCGACC TCATCTACCT CCTTGGGAGG CGTCTGTCTG TGCTCCAGGG GCTGGACATG CTCTTCGTGC CCTCACTCAT AATGGGCGCC CACGGCGGCA TCCTTGCAGG GCCCAACTTC TTAGGCAAAA CAACGCTGGA GCAGTACCGC CTAGTTAAAG AGGGAAAAAC CGCCGAGGCC GTGTCCCTCC ACAACAAGCT TATGCCGCTA TGGCGCTTCA TGGGCGGCTG CGGCTTGGTG GGGAAGCTAG GAGGCAAGTG GCCCACGCTG TACAAGCTTG CAACCCAACT AGTCCACGGC ATAGACATGG GACCGCCGAG GGAGCCCCTT CCGCCGGTGG AGGATAAGGA CAGGAAAGAA CTGGAAAAAA TCTTGAAGGA GCTCGGCCTA ATATAG
|
Protein sequence | MKLEGVIAAT VTPFTKDGVN YEGLRIILSR IVEAGYHGVF PTSSTGEVTK LTPEERVKVM EVAKEVAGGK ALVIAGTGTG DHLSTIDMVR RYKDVGVDVV LITPPYYIQY DWAAIYAFYK KVLDKTDVPV ILYTIPLATG YNIPVEVFEL VANEYSQVVG VKDSSGDFRY HLDLIYLLGR RLSVLQGLDM LFVPSLIMGA HGGILAGPNF LGKTTLEQYR LVKEGKTAEA VSLHNKLMPL WRFMGGCGLV GKLGGKWPTL YKLATQLVHG IDMGPPREPL PPVEDKDRKE LEKILKELGL I
|
| |