Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0796 |
Symbol | |
ID | 4601257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 750934 |
End bp | 752115 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773573 |
Product | hypothetical protein |
Protein accession | YP_920201 |
Protein GI | 119719706 |
COG category | [S] Function unknown |
COG ID | [COG1679] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACCTCG ATAAGTTCGA GGAGAGGATG CTCGAAGGTG AGCTCGGAGA GGCTGTAGCC CTCGCTATGA GGATAGTTGT GAAGATTGCG GAGATCTTCT CCGCCGAAAG GCTCGTGAAG ATTAAGCATG CACACGTCTC CGGGGTTTCC TACGAGAACA TAGGAGACGA GGGGTTAGAG TTCTTGGAGG GGCTCGCCGC TAAGGGCGGG AGATTCTCCG TCCCTACAAC CGTGAATCCC GGCGCTGTAG ACCTGGAACT GTGGAGGAAG ATGGGCGTAG ACGAGTCGTA TGTGGAGAAA CAGCTCAGGA TAGTGGGCGC GTTTAGAAGG ATGGGGGCGA AGGTTACCTT GACGTGCACC CCCTACCTCT ACGAGGACAT TTCCCCGGGT GACCACCTCG CGTGGTCTGA AAGCAACGCG GTGCTCTTCG CGAACAGCGT TATCGGCGCC AGGACGAACA GGGATGGGGG ACCACTAGCA CTGATGGAGG CTATAGCCGG GCGGGCACCC CTCTCGGGGT TGCACCTCGA CGAGAACAGG AGACCGTCCC TCGTGGTGGA CTTCTCGGAG AGCTCGAGGT ACATCGCGGA AAACGGACTT TTCTCCGTCG CCGGGCTCAT CGTGGGCAGG CTCGCCGGGA ACCGTGTCCC CCTGGTCCGC GGGCTCGGCC TACAGAGAAA AGACGTAGAG GAATTAAAGC TCTTTCTGGC GGCAGTCGGA GCCACCGGGG GGACCGGGAT GGTTCTCATA GATGGAGTTT CGCCCGAAGC CCCCGGGGAC ATGCCAGGAG AGGTTGAGAA AATCGGCGTG GACGACGTCA AGGCGGAGTT AGAGAAGTAC GGGGGCTCCG GGTGGGATGC AGTCGTGCTC GGGTGCCCAC ATCTGAGCTA CGAGGAGGTT GCGTCTATCA TTGAATGGTT CGAGAGAAAA GGTAGGCCCA GGTCCCGGGT GTACCTCTAC ACGAGCAGGG AGGTTGCCTC GAGGCTTCGA AGCGACCGCC TAGAAAAGCT GAATATACAC TTGTTCGCCG ATACGTGCAT GGTGGTTTCC AACCTAGGGG CGTACGCCTC GCGGAGCGTC GCGACGGATT CCGGGAAGGC TGCCTTCTAC CTAGCGTCGA AGGGCTACAG TGTCGCGCTC CTGCCCAGGA GGAAGCTACT GGAGATGCTC GTCCAGGGGT GA
|
Protein sequence | MYLDKFEERM LEGELGEAVA LAMRIVVKIA EIFSAERLVK IKHAHVSGVS YENIGDEGLE FLEGLAAKGG RFSVPTTVNP GAVDLELWRK MGVDESYVEK QLRIVGAFRR MGAKVTLTCT PYLYEDISPG DHLAWSESNA VLFANSVIGA RTNRDGGPLA LMEAIAGRAP LSGLHLDENR RPSLVVDFSE SSRYIAENGL FSVAGLIVGR LAGNRVPLVR GLGLQRKDVE ELKLFLAAVG ATGGTGMVLI DGVSPEAPGD MPGEVEKIGV DDVKAELEKY GGSGWDAVVL GCPHLSYEEV ASIIEWFERK GRPRSRVYLY TSREVASRLR SDRLEKLNIH LFADTCMVVS NLGAYASRSV ATDSGKAAFY LASKGYSVAL LPRRKLLEML VQG
|
| |