Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1666 |
Symbol | |
ID | 4601248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1612873 |
End bp | 1613778 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774439 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_921064 |
Protein GI | 119720569 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.458011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGCCA GGTTCTACGG AGTGATATCC CCATTCATCA CGCCGTTCAG GGAGGACCTC TCGCTGGACA GGGAGGCGGT CGCCTGGCTC GCCAGGTACC AGGCCGAGAA GGGGGTTCAC GGGATCTTCC CGAACAGCAC TACCGGGGAG TTCGTGCACC TATCGAGGGA GGAGGCCGTC GAGGTAACGA GGCTGGTCCT GGAGGCTGTC GGCGGCAAGG TCTGGGTTAT CCCGGGTATC AGCGCTAACT ACACTGAGGA CTCCGTCGCT CTCGGGAGAA CCTTCAAGGA CTTGGGGGTC GACGGCGCCG TGGTTACTCC TCCCTACTTC TTCAAGGTGT CCCCGGAGAG GCTGAAGGTC CACTTCTCGA CTATCCTCGA AAAGGTAGAC CTCCCGATAA TAGTGTACAA CATACCGGCG ACTACGGGGA TCAACATACC GGTGGGGCTC TACCTGGAGC TCGCGAAGGA GCACAGCAAC CTGGCGGGCG CCAAGGCTAC CGTCGAGAGC TTCACCTACT TCCGCCAGCT GGTACAGGTA GTGAAGGCTG AGAGGAAGGA CTTCGCCGTG CTGACAGGGC TCGACGACCT CCTGCTACCG GTGCTGATGA TGGGAGGCGA CGGCGGGATA ATGGCGCTCG CAAACGCCGC CCCGCAGATA CACCGCGAGG TCTACGACGC GTACAGATCC GGGGACCTGA AAAGGGCGTT GGAGGCTTGG CACAAGCTCT TGAGGCTCGT ACGCGTCTAC GACTACGCCA CCTCCTTCCC GACCTCCGTG AAGACTTTGC TGAAAGTCAT GGGTGCCCCG GTAAAGCCGT ACGCTAGGAC GCCTCTCACC CCGGAGACGC GGGAAGTGGA GGAAAAGATA GCGCAGATAG CTAGGGAGCT GGGCCTCAAA ATATAA
|
Protein sequence | MSARFYGVIS PFITPFREDL SLDREAVAWL ARYQAEKGVH GIFPNSTTGE FVHLSREEAV EVTRLVLEAV GGKVWVIPGI SANYTEDSVA LGRTFKDLGV DGAVVTPPYF FKVSPERLKV HFSTILEKVD LPIIVYNIPA TTGINIPVGL YLELAKEHSN LAGAKATVES FTYFRQLVQV VKAERKDFAV LTGLDDLLLP VLMMGGDGGI MALANAAPQI HREVYDAYRS GDLKRALEAW HKLLRLVRVY DYATSFPTSV KTLLKVMGAP VKPYARTPLT PETREVEEKI AQIARELGLK I
|
| |