Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0203 |
Symbol | |
ID | 4602214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 181490 |
End bp | 182581 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639772957 |
Product | hypothetical protein |
Protein accession | YP_919616 |
Protein GI | 119719121 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00374] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.212885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTG GAAAGAGGGC TTTGGTCATT CTCCTAGAGG TACTGGTAGT ACTGCTAGTG TTCGTCTACA CGCTCTACGG GTTCGACGTT GTGAGCTTCA TAGAGGCTTT GCGGAACGTA GATGCATGGG ACATCCTACC TATAGTTGTC TTCGAGCTGG CCTACTACTT CTCCCACGCG GCCGCCTTCT GGCTTCTCTG CAGGAAGCGC TTCTCGATAT CTATGTGGGA GGCTCTGGGA GGCTCCATGC TGGCATGGCT GGTCGACATA CTTCTTCCAG GAGCATTCGT GGAGGGGGAT ATCGCGCGGG CTATCTTCTT GAAGACGAAG TCGGACTGGC CCTCCGCCGT CAGCTACACT ATCTTCTTCC GCTTCCTCAT CAACGTTACA ATGGTCGTAT TCATAGTGTT CACGTCCCTC CTGGCCATCA ACCTGCTAAA GTTCTACGAG GAGTACCTCG TACTCTACCT CGGCATAGTC GTCGCGACGC TTTTAGCATC GCTTCTACTA GTGCTTGTTT TAACGAAGCC CTTGCTCGTA AAAAACGTCG CAGTCTTCCT GGCTCGGAAA GCTAAGGTGA AACGCTTGGA GAAGTTCGAA AAGGACTTAG AGAACTTTCT AAGCCTCGTA GCGGAGGCTT CTAGGGACTT CAACTTCAGG AACATTCACC TCTGGGGGGC CGTGGCGTTC CTATTCTTGC AGTGGGTCAG CGGGATACTT ACTCCATTCT TCTCGTTGAG GGCCGTGGGC GCCAACGTTA ACTTAGCATT GATAGGACCG GGTTACACTA TCCTAACGCT GTACTCACTA GCGTCCATAG GCATACCGTT CATGGTTGGA AGCATAGACG CTGCACTGCT GTCGCTGTAC CTGCTTCTCG GAGTACCGAG GGAAAAGGCG CTCGCCGCTA CGATAATAGG GAGAAGCGTA ACTATACTCA CGTCGCTCCT CGTGATCTAC CCGATAGGTA TGTTCTTCGC CAAGAAGGTC TTTAGCTCTA GAAACATATC CGTACTGAAG GAAAGCATCA AGAGGGCAGC TGCAGAGTAT GGCTTCGCGT ACACGTTTTC CGAGTCTTCT AGTACCGGCT AG
|
Protein sequence | MKLGKRALVI LLEVLVVLLV FVYTLYGFDV VSFIEALRNV DAWDILPIVV FELAYYFSHA AAFWLLCRKR FSISMWEALG GSMLAWLVDI LLPGAFVEGD IARAIFLKTK SDWPSAVSYT IFFRFLINVT MVVFIVFTSL LAINLLKFYE EYLVLYLGIV VATLLASLLL VLVLTKPLLV KNVAVFLARK AKVKRLEKFE KDLENFLSLV AEASRDFNFR NIHLWGAVAF LFLQWVSGIL TPFFSLRAVG ANVNLALIGP GYTILTLYSL ASIGIPFMVG SIDAALLSLY LLLGVPREKA LAATIIGRSV TILTSLLVIY PIGMFFAKKV FSSRNISVLK ESIKRAAAEY GFAYTFSESS STG
|
| |