Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0101 |
Symbol | |
ID | 6164362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 86789 |
End bp | 87598 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641667268 |
Product | hypothetical protein |
Protein accession | YP_001793505 |
Protein GI | 171184586 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00117454 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGTTGCTCG TCCACGCAGA CGACGGCCGC TGGGTAGCAA GGGTTTTGAG GGAGCTGGGC GCCGAGGCGG TGGAGGGCGT GGCGACGTGG GACGAGGTGC GCTTGATCCA CAGCGAGATG TTTGTGAAGG CCGTGGCGAA ACACGAGGCG GAGGTCCTGG CGGCAGCCGC GGTGGCTAAT GCGGCTTTGG CCAAGGGACG CGCCGTCTTC TACGTGGGGG GCACCGCCAT GGCTGGCCTC CACGCGCCCA GGAGCAACCA GCTCTTCAAC GCGGCGGTCT ACATAGCGAA GAGGTCCGGC GCGGCGGTGA TCTCCATAGA CAGACACTTC CCAACGGGGA CGTGGGAGCT CCACCTGGAA CACGGCTTCC CCCTCTACCT GGTGTACGGA GGGGCCGAGG GCCCAACTAG GAGGCAGCTC GCAGGGCGGA GGGAGGCAAC GGCCTTCCCG CTGCCGCCGG GGGTCGGAGA CGGCGGCTTC TGGAAGATCG CCCGCCACGT GCTGGAGCAG TGGGACGGCC CCGTGGTAAT ACAGCTGGGC TTCGACATAC ATAGGGAGGA CCCCACCGGC TACCTCTTCG CCTCGGAGAC GTTCTACCAC AAGCTCGGGA GGGCCCTGGC GGGGAGGGAG TTCTACATAT CGATCGAGTG CCCCTCGACG CCACGGGTGC TGAGGGCGGC CCTGGAGGCG CTTTTGTCCG GCATAACAGG CGGTCCGCCC CCACGGGCTG ACGCCGGCTG GGAAAGCCCG GAGGCGGCTA GGGAGGTGGA CAGGATGCTT AAATCGGCTA GGCGCCCCGC CCGCCGGTAG
|
Protein sequence | MLLVHADDGR WVARVLRELG AEAVEGVATW DEVRLIHSEM FVKAVAKHEA EVLAAAAVAN AALAKGRAVF YVGGTAMAGL HAPRSNQLFN AAVYIAKRSG AAVISIDRHF PTGTWELHLE HGFPLYLVYG GAEGPTRRQL AGRREATAFP LPPGVGDGGF WKIARHVLEQ WDGPVVIQLG FDIHREDPTG YLFASETFYH KLGRALAGRE FYISIECPST PRVLRAALEA LLSGITGGPP PRADAGWESP EAAREVDRML KSARRPARR
|
| |